On Tue, Oct 29, 2019 at 01:09:01PM +0300, Andrew Rybchenko wrote:
> On 10/28/19 5:01 PM, Olivier Matz wrote:
> > The size returned by rte_mempool_op_calc_mem_size_default() is aligned
> > to the specified page size. Therefore, with big pages, the returned size
> > can be much more that what we really need to populate the mempool.
> > 
> > For instance, populating a mempool that requires 1.1GB of memory with
> > 1GB hugepages can result in allocating 2GB of memory.
> > 
> > This problem is hidden most of the time due to the allocation method of
> > rte_mempool_populate_default(): when try_iova_contig_mempool=true, it
> > first tries to allocate an iova contiguous area, without the alignment
> > constraint. If it fails, it fallbacks to an aligned allocation that does
> > not require to be iova-contiguous. This can also fallback into several
> > smaller aligned allocations.
> > 
> > This commit changes rte_mempool_op_calc_mem_size_default() to relax the
> > alignment constraint to a cache line and to return a smaller size.
> > 
> > Signed-off-by: Olivier Matz <olivier.m...@6wind.com>
> 
> One may be unrelated questions below
> 
> Reviewed-by: Andrew Rybdhenko <arybche...@solarflare.com>
> 
> [snip]
> 
> > diff --git a/lib/librte_mempool/rte_mempool_ops_default.c 
> > b/lib/librte_mempool/rte_mempool_ops_default.c
> > index 4e2bfc82d..f6aea7662 100644
> > --- a/lib/librte_mempool/rte_mempool_ops_default.c
> > +++ b/lib/librte_mempool/rte_mempool_ops_default.c
> > @@ -12,7 +12,7 @@ rte_mempool_op_calc_mem_size_default(const struct 
> > rte_mempool *mp,
> >                                  size_t *min_chunk_size, size_t *align)
> >   {
> >     size_t total_elt_sz;
> > -   size_t obj_per_page, pg_num, pg_sz;
> > +   size_t obj_per_page, pg_sz, objs_in_last_page;
> >     size_t mem_size;
> >     total_elt_sz = mp->header_size + mp->elt_size + mp->trailer_size;
> > @@ -33,14 +33,30 @@ rte_mempool_op_calc_mem_size_default(const struct 
> > rte_mempool *mp,
> >                     mem_size =
> >                             RTE_ALIGN_CEIL(total_elt_sz, pg_sz) * obj_num;
> >             } else {
> > -                   pg_num = (obj_num + obj_per_page - 1) / obj_per_page;
> > -                   mem_size = pg_num << pg_shift;
> > +                   /* In the best case, the allocator will return a
> > +                    * page-aligned address. For example, with 5 objs,
> > +                    * the required space is as below:
> > +                    *  |     page0     |     page1     |  page2 (last) |
> > +                    *  |obj0 |obj1 |xxx|obj2 |obj3 |xxx|obj4|
> > +                    *  <------------- mem_size ------------->
> > +                    */
> > +                   objs_in_last_page = ((obj_num - 1) % obj_per_page) + 1;
> > +                   /* room required for the last page */
> > +                   mem_size = objs_in_last_page * total_elt_sz;
> > +                   /* room required for other pages */
> > +                   mem_size += ((obj_num - objs_in_last_page) /
> > +                           obj_per_page) << pg_shift;
> > +
> > +                   /* In the worst case, the allocator returns a
> > +                    * non-aligned pointer, wasting up to
> > +                    * total_elt_sz. Add a margin for that.
> > +                    */
> > +                    mem_size += total_elt_sz - 1;
> >             }
> >     }
> > -   *min_chunk_size = RTE_MAX((size_t)1 << pg_shift, total_elt_sz);
> > -
> > -   *align = RTE_MAX((size_t)RTE_CACHE_LINE_SIZE, (size_t)1 << pg_shift);
> > +   *min_chunk_size = total_elt_sz;
> > +   *align = RTE_CACHE_LINE_SIZE;
> 
> Not directly related to the patch, but may be RTE_MEMPOOL_ALIGN should be
> used?

Yes, and there is another one that could be changed in
rte_mempool_populate_iova().

I can add a patch for that.

> 
> >     return mem_size;
> >   }
> 

Reply via email to