Hi, Slurm has been running okay until recently my jobs are being terminated before they finish running. At first I thought it was the memory and I allocated —mem=10000, then moved to —mem=20000, but still the jobs run halfway and stop without an error in the slurm.out file. I then tried a job that ran and completed a week ago, and it terminated when it was halfway as well. Has anyone ever experienced and rectified this? I also tried:
scontrol show config | grep InactiveLimit InactiveLimit = 0 sec Regards, Batsirai The views expressed in this email are, unless otherwise stated, those of the author and not those of the National Health Laboratory Service or its management. The information in this e-mail is confidential and is intended solely for the addressee. Access to this e-mail by anyone else is unauthorized. If you are not the intended recipient, any disclosure, copying, distribution or any action taken or omitted in reliance on this, is prohibited and may be unlawful. Whilst all reasonable steps are taken to ensure the accuracy and integrity of information and data transmitted electronically and to preserve the confidentiality thereof, no liability or responsibility whatsoever is accepted if information or data is, for whatever reason, corrupted or does not reach its intended destination.