Hi List, When running qstat, I am sometimes receiving messages like: ''ERROR: failed receiving gdi request response for mid=1 (got syncron message receive timeout error)".
Also, qping - info shows warning/error and high number of qmaster clients (> 40) at times when I receive messages like above. So it seems to me that qmaster is not able to handle higher number of clients for some reason. I am thinking of two possible reasoning: 1. Buggy jsv script (but jsv should not be executed when running just 'qstat' right?) 2. Qmaster spool directory stored on shared NFS storage Could someone tell me more about this? Anyone experienced similar issue? It seems to me that qmaster should handle ~100 clients without any substantial problem (at least machine CPU load is minimal). Thanks, Ondrej ----- The information contained in this e-mail and in any attachments is confidential and is designated solely for the attention of the intended recipient(s). If you are not an intended recipient, you must not use, disclose, copy, distribute or retain this e-mail or any part thereof. If you have received this e-mail in error, please notify the sender by return e-mail and delete all copies of this e-mail from your computer system(s). Please direct any additional queries to: communicati...@s3group.com. Thank You. Silicon and Software Systems Limited (S3 Group). Registered in Ireland no. 378073. Registered Office: South County Business Park, Leopardstown, Dublin 18. _______________________________________________ SGE-discuss mailing list SGE-discuss@liv.ac.uk https://arc.liv.ac.uk/mailman/listinfo/sge-discuss