On Tuesday, January 06, 2015 03:51:20 PM Imre Palik wrote:
> From: "Palik, Imre" <im...@amazon.de>
> 
> When file auditing is enabled, during a low memory situation, a memory
> allocation with __GFP_FS can lead to pruning the inode cache.  Which can,
> in turn lead to audit_tree_freeing_mark() being called.  This can call
> audit_schedule_prune(), that tries to fork a pruning thread, and
> waits until the thread is created.  But forking needs memory, and the
> memory allocations there are done with __GFP_FS.
> 
> So we are waiting merrily for some __GFP_FS memory allocations to complete,
> while holding some filesystem locks.  This can take a while ...
> 
> This patch creates a single thread for pruning the tree from
> audit_add_tree_rule(), and thus avoids the deadlock that the on-demand
> thread creation can cause.
> 
> Reported-by: Matt Wilson <m...@amazon.com>
> Cc: Matt Wilson <m...@amazon.com>
> Signed-off-by: Imre Palik <im...@amazon.de>

Thanks for sticking with this and posting a revised patch, my comments are 
inline with the patch below ... also as a FYI, when sending a revised patch it 
is often helpful to put a revision indicator in the subject line, as an 
example:

  "[RFC PATCH v2] audit: make audit less awful"

It's not strictly necessary, but it makes my life just a little bit easier.

> diff --git a/kernel/audit_tree.c b/kernel/audit_tree.c
> index 0caf1f8..0ada577 100644
> --- a/kernel/audit_tree.c
> +++ b/kernel/audit_tree.c

...

> +static int launch_prune_thread(void)
> +{
> +     prune_thread = kthread_create(prune_tree_thread, NULL,
> +                             "audit_prune_tree");
> +     if (IS_ERR(prune_thread)) {
> +             audit_panic("cannot start thread audit_prune_tree");

I'm not certain audit_panic() is warranted here, pr_err() might be a better 
choice.  What is the harm if the thread doesn't start and we return an error 
code?

> +             prune_thread = NULL;
> +             return -ENOSYS;

Out of curiosity, why ENOSYS?

> +     } else {
> +             wake_up_process(prune_thread);
> +             return 0;
> +     }
> +}

See my comments below in audit_schedule_prune().

>  /* called with audit_filter_mutex */
>  int audit_add_tree_rule(struct audit_krule *rule)
>  {
> @@ -663,6 +713,12 @@ int audit_add_tree_rule(struct audit_krule *rule)
>       /* do not set rule->tree yet */
>       mutex_unlock(&audit_filter_mutex);
> 
> +     if (unlikely(!prune_thread)) {
> +             err = launch_prune_thread();
> +             if (err)
> +                     goto Err;
> +     }
> +

Why not put this at the top of audit_add_tree_rule()?

>       err = kern_path(tree->pathname, 0, &path);
>       if (err)
>               goto Err;
> @@ -713,6 +769,9 @@ int audit_tag_tree(char *old, char *new)
>       struct vfsmount *tagged;
>       int err;
> 
> +     if (!prune_thread)
> +             return -ENOSYS;

Help me out - why?

>       err = kern_path(new, 0, &path2);
>       if (err)
>               return err;
> @@ -800,36 +859,11 @@ int audit_tag_tree(char *old, char *new)
>       return failed;
>  }
> 
> -/*
> - * That gets run when evict_chunk() ends up needing to kill audit_tree.
> - * Runs from a separate thread.
> - */
> -static int prune_tree_thread(void *unused)
> -{
> -     mutex_lock(&audit_cmd_mutex);
> -     mutex_lock(&audit_filter_mutex);
> -
> -     while (!list_empty(&prune_list)) {
> -             struct audit_tree *victim;
> -
> -             victim = list_entry(prune_list.next, struct audit_tree, list);
> -             list_del_init(&victim->list);
> -
> -             mutex_unlock(&audit_filter_mutex);
> -
> -             prune_one(victim);
> -
> -             mutex_lock(&audit_filter_mutex);
> -     }
> -
> -     mutex_unlock(&audit_filter_mutex);
> -     mutex_unlock(&audit_cmd_mutex);
> -     return 0;
> -}
> 
>  static void audit_schedule_prune(void)
>  {
> -     kthread_run(prune_tree_thread, NULL, "audit_prune_tree");
> +     BUG_ON(!prune_thread);
> +     wake_up_process(prune_thread);
>  }

First, I probably wasn't clear last time so I'll be more clear now: no 
BUG_ON() here, handle the error.

Second, and closely related to the last sentence, perhaps the right approach 
is to merge the launch_prune_thread() code with audit_schedule_prune() such 
that we only have one function which starts the thread if it isn't present, 
and wakes it up if it is, something like the following:

        static int audit_schedule_prune(void)
        {
                if (!prune_thread) {
                        prune_thread = kthread_create(...);
                        if (IS_ERR(prune_thread)) {
                                pr_err("cannot start thread audit_prune_tree");
                                prune_thread = NULL;
                                return -ENOSYS;
                        }
                }

                wake_up_process(prune_thread);
                return 0;
        }

>  /*
> @@ -896,9 +930,10 @@ static void evict_chunk(struct audit_chunk *chunk)
>       for (n = 0; n < chunk->count; n++)
>               list_del_init(&chunk->owners[n].list);
>       spin_unlock(&hash_lock);
> +     mutex_unlock(&audit_filter_mutex);
>       if (need_prune)
>               audit_schedule_prune();
> -     mutex_unlock(&audit_filter_mutex);
> +
>  }

Remove that trailing empty vertical whitespace please.  If you aren't using it 
already, you should look into scripts/checkpatch.pl to sanity check your 
patches before sending.

-- 
paul moore
www.paul-moore.com

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Reply via email to