Re: [OMPI users] Error when using more than 88 processors for a specific executable -Abyss

2011-10-15 Thread Gustavo Correa
Hi Ashwani Also, check if there are rogue processes from old jobs in your compute nodes taking up lots of file descriptors. A reboot should fix this easily. My two cents. Gus Correa On Oct 15, 2011, at 10:34 AM, Ralph Castain wrote: > Okay, let's try spreading them out more, just to avoid put

Re: [OMPI users] Error when using more than 88 processors for a specific executable -Abyss

2011-10-15 Thread Ralph Castain
Okay, let's try spreading them out more, just to avoid putting more on a node than you actually need. Add -bynode to your cmd line. This will spread the procs across all the nodes. Our default mode is "byslot", which means we fill each node before adding procs to the next one. "bynode" puts one

Re: [OMPI users] Error when using more than 88 processors for a specific executable -Abyss

2011-10-14 Thread Ashwani Kumar Mishra
Hi Ralph, No idea how much this program consumes the numbers of file descriptors :( Best Regards, Ashwani On Sat, Oct 15, 2011 at 12:08 AM, Ralph Castain wrote: > Should be plenty for us - does your program consume a lot? > > > On Oct 14, 2011, at 12:25 PM, Ashwani Kumar Mishra wrote: > > Hi Ra

Re: [OMPI users] Error when using more than 88 processors for a specific executable -Abyss

2011-10-14 Thread Ralph Castain
Should be plenty for us - does your program consume a lot? On Oct 14, 2011, at 12:25 PM, Ashwani Kumar Mishra wrote: > Hi Ralph, > fs.file-max = 10 > is this ok or less? > > Best Regards, > Ashwani > > > On Fri, Oct 14, 2011 at 11:45 PM, Ralph Castain wrote: > Can't offer much about the

Re: [OMPI users] Error when using more than 88 processors for a specific executable -Abyss

2011-10-14 Thread Ashwani Kumar Mishra
Hi Ralph, fs.file-max = 10 is this ok or less? Best Regards, Ashwani On Fri, Oct 14, 2011 at 11:45 PM, Ralph Castain wrote: > Can't offer much about the qsub job. On the first one, what is your limit > on the number of file descriptors? Could be your sys admin has it too low. > > > On Oc

Re: [OMPI users] Error when using more than 88 processors for a specific executable -Abyss

2011-10-14 Thread Ralph Castain
Can't offer much about the qsub job. On the first one, what is your limit on the number of file descriptors? Could be your sys admin has it too low. On Oct 14, 2011, at 12:07 PM, Ashwani Kumar Mishra wrote: > Hello, > When i try to run the following command i receive the following error when i

[OMPI users] Error when using more than 88 processors for a specific executable -Abyss

2011-10-14 Thread Ashwani Kumar Mishra
Hello, When i try to run the following command i receive the following error when i try to submit this job on the cluster having 40 nodes with each node having 8 processor & 8 GB RAM: Both the command work well, as long as i use only upto 88 processors in the cluster, but the moment i allocate mor