Re: [racket-users] Places code not using all the CPU

George Neuner Fri, 05 Oct 2018 23:03:34 -0700


On 10/5/2018 10:32 AM, Matthew Flatt wrote:

At Fri, 5 Oct 2018 15:36:04 +0200, Paulo Matos wrote: > Again, I amreally surprised that you mention that places are not > separateprocesses. Documentation does say they are separate racket > virtualmachines, how is this accomplished if not by using separate >processes? Each place is an OS thread within the Racket process. Thevirtual machine is essentially instantiated once in each thread, wherethings that look like global variables at the C level are actuallythread-local variables to make them place-specific. Still, there issome sharing among the threads. > My workers are really doing Z3 stylework - number crushing and lots of > searching. No IO (writing todisk) or communication so I would expect > them to really max out allCPUs. My best guess is that it's memory-allocation bottlenecks,probably at the point of using mmap() and mprotect(). Maybe thingsdon't scale well beyond the 4-core machines that I use. On mymachines, the enclosed program can max out CPU use with system timebeing a small fraction. It scales ok from 1 to 4 places (i.e., realtime increased only some). The machine's core are hyperthreaded, andthe example maxes out CPU utilization at 8 --- but it takes twice aslong in real time, so the hardware threads don't help much in thiscase. Running two processes with 4 places takes about the same realtime as running one process with 8 places, as does 2 processes with 2places. Do you see similar effects, or does this little example stopscaling before the number of processes matches the number of cores?


As Matthew said, this may be a case where multiple processes are better.

One thing that likely is vastly different between your two systems isthe memory architecture. On Paulo's many-core machine, each group of[probably] 6 CPUs will have its own physical bank of memory which isclose to it and which it uses preferentially. Access to a differentbank may be very costly. Paulo's machine may be spending a much greaterpercentage of time moving data between VM instances that are located indifferent memory regions ... something Matthew can't see on his quad-core.

Paulo, you might take a look at how memory is being allocated [not surewhat tools you have for this] and see what happens if you restrict theprocess to running on various groups of CPUs. It may be that some banksof your memory are "closer" than others.


Hope this helps,
George

--
You received this message because you are subscribed to the Google Groups "Racket 
Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to racket-users+unsubscr...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Re: [racket-users] Places code not using all the CPU

Reply via email to