Hi Laurent,

Unfortunately, as far as I know, none of the current Open MPI developers has 
access to a system with POE, so the POE process launcher has fallen into 
disrepair. Attached is a patch that should allow you to compile (however, you 
may also need to add #include <signal.h> to pls_poe_module.c). 

Though this should allow the compile to succeed, launching with POE may not 
work (it has not been tested for quite a while). If it doesn't work, you 
should use the rsh launcher instead (pass -mca pls rsh on the command line, 
or set the parameter using one of the methods here: 
http://www.open-mpi.org/faq/?category=tuning#setting-mca-params). 

Sorry about this. We have an IBM machine at my institution which I am told 
will have POE on it 'soon', but I am not sure when. Once it does, we will be 
working on getting POE well supported again.

I should mention that we do use LoadLeveler on one of our machines and Open 
MPI seems to work with it quite well. I would be interested in hearing how it 
works for you.

Hope this helps, let me know if this works.

Thanks,

Tim

On Thursday 10 May 2007 02:57 am, Laurent Nguyen wrote:
> Hello,
>
> I tried to install OpenMPI 1.2 but I saw there some problems when
> compiling files with POE. When OpenMPI 1.2.1 was released, I saw in the
> bug fixes that this problem was fixed. Then I tried, but it still
> doesn't work. The problem comes from orte/mca/pls/poe/pls_poe_module.c.
> A static function "static int pls_poe_cancel_operation(void);" is
> declared but not defined in the files. I don't know if my configuration
> make it bug.
>
> So, if someone achieved to install OpenMPI 1.2.1 on IBM, I would like to
> have some advices.
>
> Thank you for your help,
>
> PS: I attached some output files of my installation
Index: orte/mca/pls/poe/pls_poe_module.c
===================================================================
--- orte/mca/pls/poe/pls_poe_module.c	(revision 14640)
+++ orte/mca/pls/poe/pls_poe_module.c	(working copy)
@@ -37,6 +37,7 @@
 #include "opal/mca/base/mca_base_param.h"
 #include "opal/util/argv.h"
 #include "opal/util/opal_environ.h"
+#include "opal/util/output.h"

 #include "orte/mca/errmgr/errmgr.h"
 #include "orte/mca/gpr/gpr.h"
@@ -69,7 +70,10 @@
 static int pls_poe_signal_job(orte_jobid_t jobid, int32_t signal, opal_list_t *attrs);
 static int pls_poe_signal_proc(const orte_process_name_t *name, int32_t signal);
 static int pls_poe_finalize(void);
-static int pls_poe_cancel_operation(void);
+static int pls_poe_cancel_operation(void) {
+    return ORTE_ERR_NOT_IMPLEMENTED;
+}
+    

 orte_pls_base_module_t orte_pls_poe_module = {
     pls_poe_launch_job,

Reply via email to