On Tue, 07 Oct 2008, Phil Barnett wrote:

Hi Phil,

> > Can you build Harbour in this system and then compiler and run
> > harbour/tests/speetst.prg f.e.:
> >    hbmk -n -w -es2 -mt speedtst
> >    ./speedtst --exclude=mem --scale --thread:2
> >    ./speedtst --exclude=mem --scale --thread:4
> >    ./speedtst --exclude=mem --scale --thread:6
> >    ./speedtst --exclude=mem --scale --thread:8
> Quad core Phenom 9600, 32 bit Fedora 9
> [EMAIL PROTECTED] tests]$  ./speedtst --exclude=mem --scale 
> *************************************************************************************************************************
[...]
> excluded tests: 029 030 023 025 027 040 041 043 052 053 019 022 031 032 054
>                                                         1 th.  2 th.  factor
> ============================================================================
> [   TOTAL   ]_________________________________________ 24.75  30.95 ->  0.80
> ============================================================================
> [ total application time: ]....................................84.63
> [ total real time: ]...........................................55.70

Many thanks for your tests.
The results are as wrong as in other tests but I forgot about one thing.
Your CPU just like mine strongly interacts with simultaneous assign the
same memory cell and the code which makes it in main HVM loop is covered
by #ifndef HB_GUI ... #endif.
To disable it you will have to rebuild Harbour with C_USR=-DHB_GUI
or at least compile this tests with -gc3 - it causes that except codeblocks
for the rest of code C code is generated not PCODE evaluated by main HVM
loop. I would like to ask you about yet another test. The same but using
Harbour recompiled with export C_USR=-DHB_GUI or with speedtst.prg compiled
with -gc3 switch, f.e.:
   hbmk -n -w -es2 -gc3 -mt speedtst
   for i in 1 2 4 6 8; do ./speedtst --exclude=mem --scale --thread:$i; done

AFAIR Enrico used to compile Harbour with -DHB_GUI so maybe also Intel
CPUs does not like it though Maurilio tests doesn't suggested it.
Before I'll remove this code from main HVM loop I would like to know
the overhead it causes also for Intel.

Viktor,
I've seen your Darwin 64-bit results - thank you. Much better.
If you have multi CPU machine then please compare the results of:
   ./speedtst --exclude=mem --scale --thread:2
when it's compiled with -gc3 and when whole Harbour is compiled with
-DHB_GUI
I would like to see results of 4 tests:
   ./speedtst --exclude=mem --scale --thread:2
done on the same multi Intel-CPU machine for:
   1. standard Harbour build and speedtst compiled with -gc0
   2. standard Harbour build and speedtst compiled with -gc3
   3. Harbour compiled with C_USR=-DHB_GUI and speedtst compiled with -gc0
   4. Harbour compiled with C_USR=-DHB_GUI and speedtst compiled with -gc3

It should give the answer what is the cost simultaneous memory assign by
more then one threads. I know that is very high in my CPU. Maurilio said
that there is small difference in his Intel machine on OS2 but I'm not
sure if it was not local to his tests. I can see in Enrico results that
it behaves better then my Phenom for [ T004: x := S_C ] where such
situation is intentional but in this case the operation is covered by
LOCK x86 machine instruction so the cost can be different then unprotected
memory write.

best regards,
Przemek
_______________________________________________
Harbour mailing list
Harbour@harbour-project.org
http://lists.harbour-project.org/mailman/listinfo/harbour

Reply via email to