Re: [OMPI users] OpenIB error messages: reporting the default or telling you what's happening?

2011-09-11 Thread Kevin . Buckley
Ralph, > Are you getting those messages from ompi_info? Or from an MPI app >(and if so, what are you doing to get them)? They're coming out of a user's application. Reason I just wanted to check about what the errors are saying is that things are still in tesing mode wrt the IB kit though, as I

Re: [OMPI users] OpenIB error messages: reporting the default or telling you what's happening?

2011-09-11 Thread Ralph Castain
Hi Kevin Are you getting those messages from ompi_info? Or from an MPI app (and if so, what are you doing to get them)? On Sep 11, 2011, at 5:25 PM, kevin.buck...@ecs.vuw.ac.nz wrote: > I have recently seen some OpenIB time out errors and see the > following reported: > > * btl_openib_ib_retr

[OMPI users] OpenIB error messages: reporting the default or telling you what's happening?

2011-09-11 Thread Kevin . Buckley
I have recently seen some OpenIB time out errors and see the following reported: * btl_openib_ib_retry_count - The number of times the sender will attempt to retry (defaulted to 7, the maximum value). * btl_openib_ib_timeout - The local ACK timeout parameter (defaulted to 10). The actual