On Wed, Aug 5, 2009 at 8:29 AM, Johann Kraus<johann.kr...@gmail.com> wrote: > >> Could it be that your CPU has a single floating-point unit shared by 4 >> cores on a single die, and thus only 2 floating-point units total for >> all 8 of your cores? If so, then that fact, plus the fact that each >> core has its own separate ALU for integer operations, would seem to >> explain the results you are seeing. > > Exactly, this would explain the behaviour. But unfortunately it is not > the case. I implemented a small example using Java (Java Threads) and > C (PThreads) and both times I get a linear speedup. See the attached > code below. The cores only share 12 MB cache, but this should be > enough memory for my micro-benchmark. Seeing the linear speedup in > Java and C, I would negate a hardware limitation. > > _ > Johann > > ### C ### >
That's a lot of code for a message, could you please use a pastebin next time? Thanks. I looked briefly at your problem and don't see anything right off the bat. Do you have a profiler and could you try that out? I'm interested. Thanks, Rich --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "Clojure" group. To post to this group, send email to clojure@googlegroups.com Note that posts from new members are moderated - please be patient with your first post. To unsubscribe from this group, send email to clojure+unsubscr...@googlegroups.com For more options, visit this group at http://groups.google.com/group/clojure?hl=en -~----------~----~----~----~------~----~------~--~---