Thanks! SLURM Elastic Computing seems like it might do the trick. I need to
try it out.
xCAT is interesting, too. It seems to be the HPC version of Salt'ed
Cobbler. :) I don't know that it's so important for our problem. We have a
small cluster for testing against the cloud, primarily. I could se
For provisioning, I personally use xCAT, which just started
supporting docker
http://xcat-docs.readthedocs.io/en/stable/advanced/docker/lifecycle_management.html
Together with slurm elastic computing feature
http://xcat-docs.readthedocs.io/en/stable/advanced/docker/lifecycle_ma
The closest thing we have to what you describe is the “orte-dvm” - this allows
one to launch a persistent collection of daemons. You can then run your
applications against it using “mpiexec -hnp ” where the url is that of the
orte-dvm “head” daemon.
If I understand you correctly, however, then
Hi Daniel,
Thanks.
Shifter is also interesting. However, it assumes our users map to a Unix
user id, and therefore the access to the shared file system can be
controlled by normal Unix permissions. That's not scalable, and makes for
quite a bit of complexity. Each node must know about each user s
Did you check shifter?
https://www.nersc.gov/assets/Uploads/cug2015udi.pdf ,
https://www.nersc.gov/assets/Uploads/cug2015udi.pdf ,
http://www.nersc.gov/research-and-development/user-defined-images/ ,
https://github.com/NERSC/shifter
On 06/03/2016 01:58 AM, Rob Na