Thanks Reuti OK - I made duplicates of all of my parallel environments, so that the slow queue has a different PE list than the fast queue. The submitted job now runs on the correct queue.
However, in some sense I'm back to square one. The reason I created two queues and made them non-requestable is that I wanted to assign resources to users, rather than have them choose them. Now, the user can effectively choose which queue to be in by choosing the correct parallel environment. I can't see a way to make the parallel environments non-requestable. Even if this were possible however, if the user doesn't include a -pe line in their submission script, I don't see how they would specify the number of processors they need. Sorry for my basic questions. I'd appreciate any comments you have. On Mon, Jan 23, 2012 at 2:57 PM, Reuti <[email protected]> wrote: > Am 23.01.2012 um 20:34 schrieb Andrew Pearson: > > > Hi. I'm trying to move from load-based to sequence based scheduling, > and I have a problem. First, a little something about my setup: > > > > I have two sets of machines - 176 'fast' cores in 16-core nodes, and 90 > 'slow' cores in 2-core nodes. I have two corresponding queues - slow.q and > fast.q. The queues are non-requestable. fast.q looks at the @fast host > group, which contains only the names of the fast nodes, and slow.q looks at > the @slow host group, which contains only the names of the slow nodes. In > fast.q, I have slots = 16 and processors = 16, while in slow.q I have slots > = 2 and processors = 2. Finally, slow.q is seq_no 1 and fast.q is seq_no 2. > > > > Here's the problem: If I submit a 120 processor job (so it's too large > to fit on the slow cores), it still gets assigned to slow.q. This in > itself is bad - I want such a job to go directly to fast.q. Its gets worse > though - because there aren't enough machines in slow.q, the remaining 30 > threads end up on nodes in fast.q! I don't understand how this second part > is possible. I've done qstat -f, and my 'fast' compute nodes definitely > aren't listed as being members of slow.q. > > > > Any suggestions? Thank you. > > If the same PE is attached to more than one queue, it can collect slots > from any of them: > > http://gridengine.org/pipermail/users/2012-January/002526.html > > -- Reuti > >
_______________________________________________ users mailing list [email protected] https://gridengine.org/mailman/listinfo/users
