I am an undergraduate working on a Julia parallelism project. I have read in quite a few tutorials that to get the best parallel performance I should spawn a number of processes equal to the number of cores in my processor ( working with 2 cores & 4 threads). However in a test to check processing speeds my result ( monte carlo test for pi to 1 billion) was that using 17 processes calculated the quickest. Adding extra processes above 17 didn't speed up the calculation. Can anyone explain what is happening here?
Any help would be great Regards, Noel
