On 28 February 2012 11:02, Stefano Bridi <[email protected]> wrote: > Hi list, I have a problem on a SGE setup where the home directory are > shared trough glusterfs and some job failed to start because of a > latency on the filesystem propagation between the login node and the > compute node. > What happen is that a script create a workdir with some support files, > "cd" inside and then qsub a script, sometime the script start to run > on the compute node too quickly and the "workdir" is not yet visible > on that node. I know it is a glusterfs problem that must be resolved > elsewhere but in the meantime, where can I put a "sleep"? > Does exist a prerun hook that I can use for that? For other use > (copying files around and cleanup) does exist a similar postrun hook? The prolog and epilog might be usable provided the output files are sent somewhere else. Not sure if they run in the job's working directory. Otherwise you could set up a client side jsv that sleeps for a bit. This would slow down every qsub unless you can find a way to make it only sleep if the current directory is newly created.
William > > thanks > stef > _______________________________________________ > users mailing list > [email protected] > https://gridengine.org/mailman/listinfo/users > > _______________________________________________ users mailing list [email protected] https://gridengine.org/mailman/listinfo/users
