Thanks! With blas_set_num_threads(CPU_CORES) the running time is equal.

On Saturday, April 23, 2016 at 9:07:06 PM UTC+3, Jason Eckstein wrote:
>
> Yes, that is correct.  If you want to force each worker to use every core 
> for linear algebra operations you need to use the following 
> line: blas_set_num_threads(CPU_CORES)
>
> I've done this myself and had parallel julia processes each using more 
> than one core.  Make use that line of code is used in each parallel 
> instance being called.
>
> On Saturday, April 23, 2016 at 4:29:40 AM UTC-6, Dan Y wrote:
>>
>> Maybe it is because I use matrix multiplication in my code. 
>> I guess that base library which is responsible for linear algebra uses 
>> multiple cores automatically.
>> But with "-p 2" it is forced to work only on 1 core for some reason.
>> It's only a guess, though.
>>
>

Reply via email to