Re: [OMPI users] btl_tcp_endpoint errors

2007-04-04 Thread Jeff Squyres
On Apr 3, 2007, at 1:22 PM, Heywood, Todd wrote: ssh: connect to host blade45 port 22: No route to host [blade1:05832] ERROR: A daemon on node blade45 failed to start as expected. [blade1:05832] ERROR: There may be more information available from [blade1:05832] ERROR: the remote shell (see ab

Re: [OMPI users] btl_tcp_endpoint errors

2007-04-03 Thread Heywood, Todd
Hi Adrian, Thanks for that info. The OS is Linux. I was able to get rid of the "connection reset" (104) errors by increasing btl_tcp_endpoint_cache. That leaves the "no route to host" (113) problem. Interestingly, I sometimes (sometimes not) get the same error on daemon startup with ssh when expe

Re: [OMPI users] btl_tcp_endpoint errors

2007-04-03 Thread Adrian Knoth
On Mon, Apr 02, 2007 at 07:15:41PM -0400, Heywood, Todd wrote: Hi, > [blade90][0,1,223][../../../../../ompi/mca/btl/tcp/btl_tcp_endpoint.c:572:mc > a_btl_tcp_endpoint_complete_connect] connect() failed with errno=113 errno is OS specific, so it's important to know which OS you're using. You can