I noticed that once I addprocs(), subsequent "using" statements were extremely slow. I guess in this case its loading the module on each processor, but if it happens in parallel it shouldn't be *that* much more wall time, and here I'm talking about two orders of magnitude difference.
Assuming I've got a file Empty.jl who contents is, module Empty end then single threaded: tic() using Empty toc() elapsed time: 0.024461076 seconds vs. multi-threaded: addprocs() #I've got 8 procs tic() using Empty toc() elapsed time: 2.479418079 seconds Should I submit this as an Issue on Github, or is there something else going on? I've checked both Julia 0.4.5. and 0.5 (01e3c8a). I'm on Ubuntu 16.04 64bit.