Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-15 Thread Andriy Gapon
on 15/04/2010 16:23 Adam Vande More said the following: > Is is possible to add a tunable to the scheduler for it's aggressiveness > in switching cores? No idea; not a scheduler person. -- Andriy Gapon ___ freebsd-stable@freebsd.org mailing list http:/

Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-15 Thread Adam Vande More
On Thu, Apr 15, 2010 at 3:54 AM, Andriy Gapon wrote: > This is a good point. > But on the other hand, it means that our scheduler doesn't do a perfect job > here. BTW, I use ULE. > My observation is that when a number of CPU-intensive long running > processes is > less than or equal to number of

Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-15 Thread Andriy Gapon
on 14/04/2010 20:47 Adam Vande More said the following: > I'm no expert Andriy, but it seems like if gotoblas > implemented some of the FreeBSD optimizations then we'd be in the same > ballpark. This is a good point. But on the other hand, it means that our scheduler doesn't do a perfect job here.

Re: HyperThreading makes worse to me (was Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920)

2010-04-15 Thread Adrian Chadd
4.289174 > n: 3600 > time : 92.195022 or 25.105942 > Mflops : 37177.621122 > n: 3700 > time : 97.718841 or 25.434243 > Mflops : 39841.319494 > n: 3800 > time : 105.740463 or 27.414029 > Mflops : 40042.592613 > n: 3900 > time : 113.980157 or 29.678505 > Mflops : 39984.6

Re: Linux static linked ver doesn't work on FBSD (Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920)

2010-04-15 Thread Garrett Cooper
On Wed, Apr 14, 2010 at 10:26 PM, Maho NAKATA wrote: > From: Pieter de Goeje > Subject: Re: How to reproduce: Re: Only 70% of theoretical peak performance > on FreeBSD 8/amd64, Corei7 920 > Date: Wed, 14 Apr 2010 16:05:18 +0200 > >> I think the best test would be to run

Re: HyperThreading makes worse to me (was Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920)

2010-04-15 Thread Garrett Cooper
On Wed, Apr 14, 2010 at 9:21 PM, Ian Smith wrote: > On Wed, 14 Apr 2010, Garrett Cooper wrote: >  > On Wed, Apr 14, 2010 at 7:49 PM, Garrett Cooper wrote: >  > > On Wed, Apr 14, 2010 at 5:46 PM, Maho NAKATA wrote: >  > >> Hi Andry and Adam >  > >> >  > >> My test again. No desktop, etc. I just r

Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-14 Thread Andriy Gapon
on 15/04/2010 04:20 Maho NAKATA said the following: > Hi Andriy and Adam, > > I did also the same thing as suggested. > > my conclusion: on Core i7 920, 2.66GHz, TurboBoost on, HyperThreading off, So HyperThreading is off. > then, pinned to each core like following > > % procstat -t 1408 >

Linux static linked ver doesn't work on FBSD (Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920)

2010-04-14 Thread Maho NAKATA
From: Pieter de Goeje Subject: Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920 Date: Wed, 14 Apr 2010 16:05:18 +0200 > I think the best test would be to run a statically compiled linux binary on > FreeBSD. That way the compiler settin

Re: HyperThreading makes worse to me (was Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920)

2010-04-14 Thread Ian Smith
On Wed, 14 Apr 2010, Garrett Cooper wrote: > On Wed, Apr 14, 2010 at 7:49 PM, Garrett Cooper wrote: > > On Wed, Apr 14, 2010 at 5:46 PM, Maho NAKATA wrote: > >> Hi Andry and Adam > >> > >> My test again. No desktop, etc. I just run dgemm. > >> Contrary to Adam's result, Hyper Threading make

Re: HyperThreading makes worse to me (was Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920)

2010-04-14 Thread Garrett Cooper
On Wed, Apr 14, 2010 at 7:49 PM, Garrett Cooper wrote: > On Wed, Apr 14, 2010 at 5:46 PM, Maho NAKATA wrote: >> Hi Andry and Adam >> >> My test again. No desktop, etc. I just run dgemm. >> Contrary to Adam's result, Hyper Threading makes the performance worse. >> all tests are done on Core i7 920

Re: HyperThreading makes worse to me (was Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920)

2010-04-14 Thread Garrett Cooper
On Wed, Apr 14, 2010 at 5:46 PM, Maho NAKATA wrote: > Hi Andry and Adam > > My test again. No desktop, etc. I just run dgemm. > Contrary to Adam's result, Hyper Threading makes the performance worse. > all tests are done on Core i7 920 @ 2.67GHz. (TurboBoost @2.8GHz) > > Turbo Boost off, Hyper thr

Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-14 Thread Maho NAKATA
Hi Adam, From: Adam Vande More Subject: Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920 Date: Wed, 14 Apr 2010 12:47:31 -0500 > Since this is a full fledged desktop environment, 90% utilization seems > pretty good. No, I don't thi

Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-14 Thread Maho NAKATA
66 or 42.380653 Mflops : 40208.611317 n: 4500 time : 175.930335 or 45.422572 Mflops : 40132.139469 Thanks From: Adam Vande More Subject: Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920 Date: Wed, 14 Apr 2010 12:47:31 -0500 > On Wed, Apr 14, 2010 at

Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-14 Thread Maho NAKATA
opps I missed this e-mail... From: Adam Vande More Subject: Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920 Date: Wed, 14 Apr 2010 11:45:04 -0500 > On Wed, Apr 14, 2010 at 11:34 AM, Adam Vande More > wrote: > >> >>

HyperThreading makes worse to me (was Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920)

2010-04-14 Thread Maho NAKATA
lops : 39984.635420 n: 4000 time : 122.941569 or 31.946174 Mflops : 40077.412531 n: 4100 ---DETAILS--- From: Adam Vande More Subject: Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920 Date: Wed, 14 Apr 2010 11:34:45 -0500 >> > time : 162.4

Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-14 Thread Maho NAKATA
From: Andriy Gapon Subject: Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920 Date: Wed, 14 Apr 2010 16:19:13 +0300 > on 14/04/2010 02:21 Maho NAKATA said the following: >> 2. install ports/math/gotoblas (manual download required) >>

Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-14 Thread Adam Vande More
On Wed, Apr 14, 2010 at 11:51 AM, Andriy Gapon wrote: > on 14/04/2010 19:45 Adam Vande More said the following: > > > > also if I run cpuset on the dgemm then the utilization is basically at > > the theoretical max for one core so at least that part is working. > > You can also try procstat -t t

Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-14 Thread Andriy Gapon
on 14/04/2010 19:45 Adam Vande More said the following: > > also if I run cpuset on the dgemm then the utilization is basically at > the theoretical max for one core so at least that part is working. You can also try procstat -t to find out thread IDs and cpuset -t to pin the threads to the cor

Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-14 Thread Adam Vande More
On Wed, Apr 14, 2010 at 11:34 AM, Adam Vande More wrote: > > > > That's about 67% utilization, turning off HTT drops it more. HTT on the > newer cores is good, not bad. > Well that was completely contrarty to some tests I'd run when I first got the cpu. With HTT off: n: 3000 time : 44.705516 o

Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-14 Thread Adam Vande More
On Wed, Apr 14, 2010 at 10:26 AM, Andriy Gapon wrote: > on 14/04/2010 02:21 Maho NAKATA said the following: > > 4. run dgemm. > > % ./dgemm > > n: 3000 > > time : 134.648208 or 16.910525 > > Mflops : 31943.419695 > > n: 3100 > > time : 148.122279 or 18.615284 > > Mflops : 32017.357408 > > n: 3200

Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-14 Thread Andriy Gapon
on 14/04/2010 02:21 Maho NAKATA said the following: > 4. run dgemm. > % ./dgemm > n: 3000 > time : 134.648208 or 16.910525 > Mflops : 31943.419695 > n: 3100 > time : 148.122279 or 18.615284 > Mflops : 32017.357408 > n: 3200 > time : 162.45 or 20.430651 > Mflops : 32087.318295 > n: 3300 > ti

Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-14 Thread Pieter de Goeje
On Wednesday 14 April 2010 15:19:13 Andriy Gapon wrote: > on 14/04/2010 02:21 Maho NAKATA said the following: > > 2. install ports/math/gotoblas (manual download required) > > make install > > Do you know how gotoblas on Linux was obtained? > Was it built from source? > Has it come pre-packaged?

Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-14 Thread Andriy Gapon
on 14/04/2010 02:21 Maho NAKATA said the following: > 2. install ports/math/gotoblas (manual download required) > make install Do you know how gotoblas on Linux was obtained? Was it built from source? Has it come pre-packaged? If so, can you find out details of its build configuration? Thanks!

How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-13 Thread Maho NAKATA
Hi all, thanks for showing interest in this issue. I uploaded my test code so that you can test on your PC. Following is the instruction. 1. download my source codes. http://people.freebsd.org/~maho/dgemm/Makefile http://people.freebsd.org/~maho/dgemm/dgemm.cpp check md5. % md5 Makefile dgemm.cp

Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-13 Thread Andriy Gapon
on 13/04/2010 02:33 Maho NAKATA said the following: > From: Andriy Gapon >> Another question is what compilers (what versions of GCC) were used on both >> system to compile the program? > > Hi > > on Ubuntu $ gcc -v Using built-in specs. Target: x86_64-linux-gnu Configured > with: ../src/configu

Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-12 Thread Alan Cox
On Tue, Apr 13, 2010 at 12:35 AM, Andrew Snow wrote: > > The statements about the scheduler flipping between cores is also somewhat > false, ULE does the right thing now for long-running computational threads. > > Furthermore, I can't see how a Gflops benchmark which fits in the CPU cache > has a

Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-12 Thread Andrew Snow
The statements about the scheduler flipping between cores is also somewhat false, ULE does the right thing now for long-running computational threads. Furthermore, I can't see how a Gflops benchmark which fits in the CPU cache has anything to do with the memory architecture of the operating

Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-12 Thread Alan Cox
On Sun, Apr 11, 2010 at 11:12 PM, Maho NAKATA wrote: > Hi FreeBSD developers, > [the original article in Japanese can be found at > http://blog.goo.ne.jp/nakatamaho/e/b5f6fbc3cc6e1ac4947463eb1ca4eb0a ] > > *Abstract* > I compared the peak performance of FreeBSD 8.0/amd64 and Ubuntu 9.10 amd64 > u

Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-12 Thread Michael Poole
Maho NAKATA writes: > From: Michael Poole > Subject: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, > Corei7 920 > Date: Mon, 12 Apr 2010 10:06:55 -0400 > >> Nakata-san's theoretical performance numbers assume 4 to 4.2 operations >> per core pe

Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-12 Thread Maho NAKATA
Hi Bruce, many thanks. I like FreeBSD, esp. ports, since I'm have been a ports committer for 8 years, so I'll do what I can do...First step might be reproducible results and provide better analysis for ports/math/ports/gotoblas. thanks -- Nakata Maho http://accc.riken.jp/maho/ , http://ja.openoff

Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-12 Thread Maho NAKATA
Hi, Many thanks for interested in. I used following program to major the FLOPS. I'll provide more in details. you many need but you can change dgemm_f77 to something else to link agianst GotoBLAS (ports/math/gotoblas). I think you can use math/atlas but it takes too long time to compile... --- #i

Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-12 Thread Maho NAKATA
Hi Andriy and Jeremy In my case, % sysctl vm.pmap.pg_ps_enabled vm.pmap.pg_ps_enabled: 1 thanks a lot! From: Jeremy Chadwick Subject: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920 Date: Mon, 12 Apr 2010 08:00:23 -0700 > On Mon, Apr 12, 2010 at 05:41:35PM +0

Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-12 Thread Maho NAKATA
From: Andriy Gapon Subject: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920 Date: Mon, 12 Apr 2010 17:49:24 +0300 > on 12/04/2010 17:41 Andriy Gapon said the following: >> It would also be get good to learn more about your program. >> How much

Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-12 Thread Maho NAKATA
From: Michael Poole Subject: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920 Date: Mon, 12 Apr 2010 10:06:55 -0400 > Nakata-san's theoretical performance numbers assume 4 to 4.2 operations > per core per cycle at the nominal (2.66 GHz, non-TurboBoost)

Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-12 Thread Maho NAKATA
, tho) Thanks From: Antony Mawer Subject: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920 Date: Mon, 12 Apr 2010 23:58:17 +1000 > This may well be the same sort of issue that was discussed in this thread > here: > > http://lists.freebsd.org/piper

Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-12 Thread Maho NAKATA
Hi Bruce, From: Bruce Simpson Subject: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920 Date: Mon, 12 Apr 2010 10:49:14 +0100 > So, where's the profiling to discover why this is the case? Ok I'll provide better documentation so that everyone can test it

Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-12 Thread Bruce Simpson
Hi all, There's a port archivers/pbzip2, and I am inclined to believe this is a good benchmark for multi-core performance in real-world usage (with an appropriate input data set). BZIP2 is a compression algorithm which is readily applicable to multicore, because of the nature in which its wo

Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-12 Thread Jeremy Chadwick
On Mon, Apr 12, 2010 at 05:41:35PM +0300, Andriy Gapon wrote: > Perhaps, he talks about support of large pages (2M) and related improvements > in > TLB performance. If so, he (and you) may read about 'superpages' feature of > FreeBSD. > I am not sure if it is enabled by default in 8.0, you can c

Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-12 Thread Andriy Gapon
on 12/04/2010 17:41 Andriy Gapon said the following: > It would also be get good to learn more about your program. > How much memory does it typically use, how does it allocate it? > Is it single-threaded or not? If not, how many threads does it have and what > do > they do, how do they communica

Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-12 Thread Andriy Gapon
on 12/04/2010 07:12 Maho NAKATA said the following: > Hi FreeBSD developers, > [the original article in Japanese can be found at > http://blog.goo.ne.jp/nakatamaho/e/b5f6fbc3cc6e1ac4947463eb1ca4eb0a ] > > *Abstract* > I compared the peak performance of FreeBSD 8.0/amd64 and Ubuntu 9.10 amd64 > u

Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-12 Thread Michael Poole
Antony Mawer writes: > This may well be the same sort of issue that was discussed in this thread > here: > > http://lists.freebsd.org/pipermail/freebsd-hackers/2010-March/031004.html > > In short, the Core i7 CPUs have a feature called "TurboBoost" where > the clock speed of one or more cores

Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-12 Thread Antony Mawer
This may well be the same sort of issue that was discussed in this thread here: http://lists.freebsd.org/pipermail/freebsd-hackers/2010-March/031004.html In short, the Core i7 CPUs have a feature called "TurboBoost" where the clock speed of one or more cores is boosted when other cores are id

Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-12 Thread Adrian Chadd
Of course, what would be helpful is actually figuring out what is going on rather than some conjecture. :) With what he said, tweaking memory allocation under FreeBSD and/or linux would change the performance characteristics and either validate or disprove his assumptions? Adrian On 12 April 20

Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-12 Thread Bruce Simpson
On 04/12/10 05:12, Maho NAKATA wrote: *Abstract* I compared the peak performance of FreeBSD 8.0/amd64 and Ubuntu 9.10 amd64 using dgemm (a linear algebra routine, matrix-matrix multiplication). I obtained only 70% of theoretical peak performance on FreeBSD 8/amd64 and almost 95% on Ubuntu 9.10 /

Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-11 Thread Garrett Cooper
On Sun, Apr 11, 2010 at 9:12 PM, Maho NAKATA wrote: > Hi FreeBSD developers, > [the original article in Japanese can be found at > http://blog.goo.ne.jp/nakatamaho/e/b5f6fbc3cc6e1ac4947463eb1ca4eb0a ] > > *Abstract* > I compared the peak performance of FreeBSD 8.0/amd64 and Ubuntu 9.10 amd64 > us

Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-11 Thread Maho NAKATA
Hi FreeBSD developers, [the original article in Japanese can be found at http://blog.goo.ne.jp/nakatamaho/e/b5f6fbc3cc6e1ac4947463eb1ca4eb0a ] *Abstract* I compared the peak performance of FreeBSD 8.0/amd64 and Ubuntu 9.10 amd64 using dgemm (a linear algebra routine, matrix-matrix multiplication