On 10. 01. 19, 18:52, Andrei Vagin wrote:
> Currently, exit_ptrace() adds all ptraced tasks in a dead list, than
> zap_pid_ns_processes() waits all tasks in a current pidns, and only
> then tasks from the dead list are released.
> 
> zap_pid_ns_processes() can stuck on waiting tasks from the dead list. In
> this case, we will have one unkillable process with one or more dead
> children.
> 
> Thanks to Oleg for the advice to release tasks in find_child_reaper().
> 
> Fixes: 7c8bd2322c7f ("exit: ptrace: shift "reap dead" code from exit_ptrace() 
> to forget_original_parent()")
> 
> Cc: "Eric W. Biederman" <ebied...@xmission.com>
> Cc: Andrew Morton <a...@linux-foundation.org>
> Signed-off-by: Oleg Nesterov <o...@redhat.com>
> Signed-off-by: Andrei Vagin <ava...@gmail.com>
> ---
> 
> v2: Oleg showed that ptraced tasks can be released in
> find_child_reaper(). This allows to avoid additional
> write_lock/unlock(tasklist), and another list_for_each_entry_safe(dead)
> loop is called only if it is actually needed.
> 
>  kernel/exit.c | 12 ++++++++++--
>  1 file changed, 10 insertions(+), 2 deletions(-)
> 
> diff --git a/kernel/exit.c b/kernel/exit.c
> index 2d14979577ee..5df787a497f5 100644
> --- a/kernel/exit.c
> +++ b/kernel/exit.c
> @@ -558,12 +558,14 @@ static struct task_struct *find_alive_thread(struct 
> task_struct *p)
>       return NULL;
>  }
>  
> -static struct task_struct *find_child_reaper(struct task_struct *father)
> +static struct task_struct *find_child_reaper(struct task_struct *father,
> +                                             struct list_head *dead)
>       __releases(&tasklist_lock)
>       __acquires(&tasklist_lock)
>  {
>       struct pid_namespace *pid_ns = task_active_pid_ns(father);
>       struct task_struct *reaper = pid_ns->child_reaper;
> +     struct task_struct *p, *n;
>  
>       if (likely(reaper != father))
>               return reaper;
> @@ -579,6 +581,12 @@ static struct task_struct *find_child_reaper(struct 
> task_struct *father)
>               panic("Attempted to kill init! exitcode=0x%08x\n",
>                       father->signal->group_exit_code ?: father->exit_code);
>       }
> +
> +     list_for_each_entry_safe(p, n, dead, ptrace_entry) {
> +             list_del_init(&p->ptrace_entry);
> +             release_task(p);
> +     }
> +

Hi,

from our (SUSE) QA we received a report that this patch causes a
performance decline in libmicro pthread_* benchmark as reported in:
https://bugzilla.suse.com/show_bug.cgi?id=1126762

I tried myself from the repo:
https://github.com/redhat-performance/libMicro

I ran
pthread_create -B 8 -C 200 -S

and with the patch applied:
# STATISTICS       usecs/call (raw)          usecs/call (outliers removed)
#                   mean     23.38611                17.29311

Without:
#                   mean     41.36539                39.21347

The values vary, but they are around 23 and 42, respectively.

The benchmark seems to create 8 (-B above) pthreads, does lock/unlock in
them and then the threads exit. The benchmark reaps the threads via
pthread_join. This all happens 200 times (-C above).

Any idea how to restore the performance close to the previous state?

>       zap_pid_ns_processes(pid_ns);
>       write_lock_irq(&tasklist_lock);
>  
> @@ -668,7 +676,7 @@ static void forget_original_parent(struct task_struct 
> *father,
>               exit_ptrace(father, dead);
>  
>       /* Can drop and reacquire tasklist_lock */
> -     reaper = find_child_reaper(father);
> +     reaper = find_child_reaper(father, dead);
>       if (list_empty(&father->children))
>               return;

thanks,
-- 
js
suse labs

Reply via email to