On Tue, 24 Nov 2009, Mindaugas Kavaliauskas wrote:

Hi,

> >  * harbour/src/vm/fm.c
> >    * enabled HB_FM_DLMT_ALLOC by default in MT HVM if HB_FM_DL_ALLOC is
> >      also enabled
> BTW, what is HB_FM_DLMT_ALLOC performance in comparison to native
> Linux malloc() (in MT mode of cause)?

It's not trivial to well compare performance of different memory allocators
in MT mode. To give precise answer we should prepare some series of dedicated
tests anyhow using speedtst.prg in different mode we can create basic tests
and the performance seems to be quite good. In practice the same as with
NEDMALLOC or PTMALLOC but this is nearly the same code so it's expected.
DLMALLOC is compiled as part of Harbour core code with strong optimization
switches and well optimized as part of hvmall.c so it's usually a little
bit faster in all tests.
SUSE Linux uses Hoard Memory Allocator which seems to be the best memory
allocator for MT mode. In older SUSE builds I made some tests on x...@32
and it was always faster then DLMALLOC in ST and MT modes. Recently I've
repeated test with SUSE 11.2 x...@64 and in ST mode DLMALLOC is faster but
slower in MT mode. I guess that faster ST mode is mostly result of better
optimizations when DLMALLOC is compiled as part of hvmall.c. In MT mode
is slower but the difference is not very huge in concurrent access when
HB_FM_DLMT_ALLOC is enabled, here are results you can compare:

1) ./speedtst

DLMALLOC ST mode:
   [ total application time: ]....................................15.47
   [ total real time: ]...........................................15.47

Hoard ST mode:
   [ total application time: ]....................................16.76
   [ total real time: ]...........................................16.78


2) ./speedtst

DLMT MT MODE:
   [ total application time: ]....................................19.58
   [ total real time: ]...........................................19.59

Hoard ST mode:
   [ total application time: ]....................................18.50
   [ total real time: ]...........................................18.50


3) ./speedtst --thread

DLMT
   [ total application time: ]....................................26.65
   [ total real time: ]............................................9.73
Hoard:
   [ total application time: ]....................................24.37
   [ total real time: ]............................................8.68


4) ./speedtst --thread=3 --scale

DLMT:
   [   TOTAL   ]_________________________________ 59.95  29.64 ->  2.02
   [ total application time: ]...................................147.05
   [ total real time: ]...........................................89.60
Hoard:
   [   TOTAL   ]_________________________________ 57.93  32.01 ->  1.81
   [ total application time: ]...................................149.78
   [ total real time: ]...........................................89.95


As you can see in the last test DLMT has even better scalability what
allows to reduce the speed difference and DLMT was finally a little bit
faster. Anyhow this test uses only 3 threads which very nicely play with
DLMT and I expect that when much bigger number of threads is used on multi
CPU hardware then Hoard will be more efficient i.e. in test (3) we have
56 threads and here Hoard is still faster. But of course it does not
change the fact that DLMT results are really good.

best regards,
Przemek
_______________________________________________
Harbour mailing list (attachment size limit: 40KB)
Harbour@harbour-project.org
http://lists.harbour-project.org/mailman/listinfo/harbour

Reply via email to