Hi Laurent, Unfortunately, as far as I know, none of the current Open MPI developers has access to a system with POE, so the POE process launcher has fallen into disrepair. Attached is a patch that should allow you to compile (however, you may also need to add #include <signal.h> to pls_poe_module.c).
Though this should allow the compile to succeed, launching with POE may not work (it has not been tested for quite a while). If it doesn't work, you should use the rsh launcher instead (pass -mca pls rsh on the command line, or set the parameter using one of the methods here: http://www.open-mpi.org/faq/?category=tuning#setting-mca-params). Sorry about this. We have an IBM machine at my institution which I am told will have POE on it 'soon', but I am not sure when. Once it does, we will be working on getting POE well supported again. I should mention that we do use LoadLeveler on one of our machines and Open MPI seems to work with it quite well. I would be interested in hearing how it works for you. Hope this helps, let me know if this works. Thanks, Tim On Thursday 10 May 2007 02:57 am, Laurent Nguyen wrote: > Hello, > > I tried to install OpenMPI 1.2 but I saw there some problems when > compiling files with POE. When OpenMPI 1.2.1 was released, I saw in the > bug fixes that this problem was fixed. Then I tried, but it still > doesn't work. The problem comes from orte/mca/pls/poe/pls_poe_module.c. > A static function "static int pls_poe_cancel_operation(void);" is > declared but not defined in the files. I don't know if my configuration > make it bug. > > So, if someone achieved to install OpenMPI 1.2.1 on IBM, I would like to > have some advices. > > Thank you for your help, > > PS: I attached some output files of my installation
Index: orte/mca/pls/poe/pls_poe_module.c =================================================================== --- orte/mca/pls/poe/pls_poe_module.c (revision 14640) +++ orte/mca/pls/poe/pls_poe_module.c (working copy) @@ -37,6 +37,7 @@ #include "opal/mca/base/mca_base_param.h" #include "opal/util/argv.h" #include "opal/util/opal_environ.h" +#include "opal/util/output.h" #include "orte/mca/errmgr/errmgr.h" #include "orte/mca/gpr/gpr.h" @@ -69,7 +70,10 @@ static int pls_poe_signal_job(orte_jobid_t jobid, int32_t signal, opal_list_t *attrs); static int pls_poe_signal_proc(const orte_process_name_t *name, int32_t signal); static int pls_poe_finalize(void); -static int pls_poe_cancel_operation(void); +static int pls_poe_cancel_operation(void) { + return ORTE_ERR_NOT_IMPLEMENTED; +} + orte_pls_base_module_t orte_pls_poe_module = { pls_poe_launch_job,