Hello list,

Since about 3 weeks I try to make GE working (step by step, first the master then client...)

I install it on a Debian (inux crosswords.ns42.fr 2.6.32-5-amd64 #1 SMP Sun Sep 23 10:07:46 UTC 2012 x86_64 GNU/Linux) using the default packages:

ii gridengine-client 6.2u5-1squeeze1 Utilities for Grid Engine queue management ii gridengine-common 6.2u5-1squeeze1 Distributed resource management - common files ii gridengine-exec 6.2u5-1squeeze1 Distributed resource management - Execution Server ii gridengine-master 6.2u5-1squeeze1 Distributed resource management - Master Server ii gridengine-qmon 6.2u5-1squeeze1 Graphical utilities for Grid Engine queue management


I read that I need to populate several variables in /etc/profile.d/gridengine.sh file

SGE_ROOT, is where the installation leave, in debian /usr/lib/gridengine
SGE_ARCH which should be $SGE_ROOT/util/arch. Those folders (util/arch) does not exist. Did I have to create them?

I already create $SGE_ROOT/default/commont and link here /etc/gridengine/bootstrap.

I can start gridengine-master (/etc/init.d/gridengine-master) and gridengine-exec (/etc/init.d/gridengine-exec)

# ps aux | grep grid
root 4728 0.0 0.9 220152 10140 pts/0 Sl+ 07:27 0:00 vi /etc/profile.d/gridengine.sh sgeadmin 5751 0.0 0.5 127756 5288 ? Sl 07:36 0:00 /usr/lib/gridengine/sge_qmaster sgeadmin 6108 0.0 0.1 50208 1596 ? Sl 08:08 0:00 /usr/lib/gridengine/sge_execd
root      6114  0.0  0.0   7548   836 pts/1    S+   08:08   0:00 grep grid

When I try to launch something (qmon for exemple) it fail with this message:
error: fopen("/usr/lib/gridengine/default/common/act_qmaster") failed: No such file or directory error: unable to read qmaster name: can't open "/usr/lib/gridengine/default/common/act_qmaster" for reading qmaster hostname error: fopen("/usr/lib/gridengine/default/common/act_qmaster") failed: No such file or directory unable to send message to qmaster using port 6444 on host "/usr/lib/gridengine": got unexpected parameters error: fopen("/usr/lib/gridengine/default/common/act_qmaster") failed: No such file or directory

The only act_master file I can find leave in:
/var/lib/gridengine/default/common/act_qmaster and content crossword (the machine hostname without domain).

I now that my question are ridiculous and confusing. If you have a good tutorial that I can read, I will be very happy with that.

Thanks is advance,
Jacques Foucry
--
Jacques Foucry
*NOVΛSPARKS *
IT Manager
Tel : +33 (0)1 42 68 12 61
[email protected]
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to