Thanks for the thoughts people, much appreciated, gives me some ideas to work with.
I'm going to play around with pure Julia solutions first as my prior experience trying to get MPI.jl running on my cluster in a REPL was painful. This could be the wrong attitude and I may have to change it. Workers will be in the low tens as I only need one per compute node.
