On Tue, Feb 12, 2019 at 12:34:09PM -0800, Andrew Morton wrote:
> On Tue, 12 Feb 2019 13:47:24 -0500 Johannes Weiner <han...@cmpxchg.org> wrote:
> 
> > On Tue, Feb 12, 2019 at 09:56:45AM -0800, Roman Gushchin wrote:
> > > The patchset contains few changes to the vmalloc code, which are
> > > leading to some performance gains and code simplification.
> > > 
> > > Also, it exports a number of pages, used by vmalloc(),
> > > in /proc/meminfo.
> > > 
> > > Patch (1) removes some redundancy on __vunmap().
> > > Patch (2) separates memory allocation and data initialization
> > >   in alloc_vmap_area()
> > > Patch (3) adds vmalloc counter to /proc/meminfo.
> > > 
> > > v2->v1:
> > >   - rebased on top of current mm tree
> > >   - switch from atomic to percpu vmalloc page counter
> > 
> > I don't understand what prompted this change to percpu counters.

I *think*, I see some performance difference, but it's barely measurable
in my setup. Also as I remember, Matthew was asking why not percpu here.
So if everybody prefers a global atomic, I'm fine with either.

> > 
> > All writers already write vmap_area_lock and vmap_area_list, so it's
> > not really saving much. The for_each_possible_cpu() for /proc/meminfo
> > on the other hand is troublesome.
> 
> percpu_counters would fit here.  They have probably-unneeded locking
> but I expect that will be acceptable.
> 
> And they address the issues with for_each_possible_cpu() avoidance, CPU
> hotplug and transient negative values.

Not sure, because percpu_counters are based on dynamic percpu allocations,
which are using vmalloc under the hood.

Thanks!

Reply via email to