Re: [RFC][PATCH] Re: Linux 2.4.4-ac10

Rik van Riel Thu, 24 May 2001 01:49:37 -0700
On Thu, 24 May 2001, Mike Galbraith wrote:
> On Sun, 20 May 2001, Rik van Riel wrote:
>
> > Remember that inactive_clean pages are always immediately
> > reclaimable by __alloc_pages(), if you measured a performance
> > difference by freeing pages in a different way I'm pretty sure
> > it's a side effect of something else.  What that something
> > else is I'm curious to find out, but I'm pretty convinced that
> > throwing away data early isn't the way to go.
>
> OK.. let's forget about throughput for a moment and consider
> those annoying reports of 0 order allocations failing :)

Those are ok.  All failing 0 order allocations are either
atomic allocations or GFP_BUFFER allocations.  I guess we
should just remove the printk()  ;)

> What do you think of the below (ignore the refill_inactive bit)
> wrt allocator reliability under heavy stress?  The thing does
> kick in and pump up zones even if I set the 'blood donor' level
> to pages_min.

> -             unsigned long water_mark;
> +             unsigned long water_mark = 1 << order;

Makes no sense at all since water_mark gets assigned not 10
lines below.  ;)


> +             if (direct_reclaim) {
> +                     int count;
> +
> +                     /* If we're in bad shape.. */
> +                     if (z->free_pages < z->pages_low && z->inactive_clean_pages) {

I'm not sure if we want to fill up the free list all the way
to z->pages_low all the time, since "free memory is wasted
memory".

The reason the current scheme only triggers when we reach
z->pages_min and then goes all the way up to z->pages_low
is memory defragmentation. Since we'll be doing direct
reclaim for just about every allocation in the system, it
only happens occasionally that we throw away all the
inactive_clean pages between z->pages_min and z->pages_low.

> +                             count = 4 * (1 << page_cluster);
> +                             /* reclaim a page for ourselves if we can afford to.. 
>*/
> +                             if (z->inactive_clean_pages > count)
> +                                     page = reclaim_page(z);
> +                             if (z->inactive_clean_pages < 2 * count)
> +                                     count = z->inactive_clean_pages / 2;
> +                     } else count = 0;

What exactly is the reasoning behind this complex  "count"
stuff? Is there a good reason for not just refilling the
free list up to the target or until the inactive_clean list
is depleted ?

> +                     /*
> +                      * and make a small donation to the reclaim challenged.
> +                      *
> +                      * We don't ever want a zone to reach the state where we
> +                      * have nothing except reclaimable pages left.. not if
> +                      * we can possibly do something to help prevent it.
> +                      */

This comment makes little sense

> +             if (z->inactive_clean_pages - z->free_pages > z->pages_low
> +                             && waitqueue_active(&kreclaimd_wait))
> +                     wake_up_interruptible(&kreclaimd_wait);

This doesn't make any sense to me at all.  Why wake up
kreclaimd just because the difference between the number
of inactive_clean pages and free pages is large ?

Didn't we determine in our last exchange of email that
it would be a good thing under most loads to keep as much
inactive_clean memory around as possible and not waste^Wfree
memory early ?

> -     /*
> -      * First, see if we have any zones with lots of free memory.
> -      *
> -      * We allocate free memory first because it doesn't contain
> -      * any data ... DUH!
> -      */

We want to keep this.  Suppose we have one zone which is
half filled with inactive_clean pages and one zone which
has "too many" free pages.

Allocating from the first zone means we evict some piece
of, potentially useful, data from the cache; allocating
from the second zone means we can keep the data in memory
and only fill up a currently unused page.


> @@ -824,39 +824,17 @@
>  #define DEF_PRIORITY (6)
>  static int refill_inactive(unsigned int gfp_mask, int user)
>  {

I've heard all kinds of things about this part of the patch,
except an explanation of why and how it is supposed to work ;)


> @@ -976,8 +954,9 @@
>                * We go to sleep for one second, but if it's needed
>                * we'll be woken up earlier...
>                */
> -             if (!free_shortage() || !inactive_shortage()) {
> -                     interruptible_sleep_on_timeout(&kswapd_wait, HZ);
> +             if (current->need_resched || !free_shortage() ||
> +                             !inactive_shortage()) {
> +                     interruptible_sleep_on_timeout(&kswapd_wait, HZ/10);

Makes sense.  Integrated in my tree ;)


regards,

Rik
--
Linux MM bugzilla: http://linux-mm.org/bugzilla.shtml

Virtual memory is like a game you can't win;
However, without VM there's truly nothing to lose...

                http://www.surriel.com/
http://www.conectiva.com/       http://distro.conectiva.com/

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/
Re: [RFC][PATCH] Re: Linux 2.4.4-ac10

Reply via email to