Thanks! With blas_set_num_threads(CPU_CORES) the running time is equal.
On Saturday, April 23, 2016 at 9:07:06 PM UTC+3, Jason Eckstein wrote: > > Yes, that is correct. If you want to force each worker to use every core > for linear algebra operations you need to use the following > line: blas_set_num_threads(CPU_CORES) > > I've done this myself and had parallel julia processes each using more > than one core. Make use that line of code is used in each parallel > instance being called. > > On Saturday, April 23, 2016 at 4:29:40 AM UTC-6, Dan Y wrote: >> >> Maybe it is because I use matrix multiplication in my code. >> I guess that base library which is responsible for linear algebra uses >> multiple cores automatically. >> But with "-p 2" it is forced to work only on 1 core for some reason. >> It's only a guess, though. >> >