Re: need some help with tcp/ip programming

guy keren Mon, 14 May 2007 13:48:51 -0700

Amos Shapira wrote:

On 14/05/07, *guy keren* <[EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]>>wrote:


    Rafi Cohen wrote:
     > Reading some documentation on tcp/ip programming, I had the
    impression
     > that the select mechanism should detect such remote disconnect event,
     > thus enabling me to make a further "read" from this socket which
    should
     > end in reading 0 bytes. Reading 0 bytes should indicate disconnection
     > and let me disconnect propperly from my side and try to reconnect.
     > However, it seems that select does not detect all those disconnect
     > events and even worse, I can not see any rule behind when it does
    detect
     > this and when it does not.

    select does not notice "disconnections". it only notices if the socket
    was closed by the remote side. that's a completely different issue, and
    that's also the only time when you get a 0 return value from the read()
    system call.

I think you are tinkering with semantics and so miss the real issue (doyou work as a consultant? :).

did you write that to rafi or to me? i'm not dealing with semantics - iam dealing with a real problem, that stable applications have to dealwith - when the network breaks, and you never get the close from theother side.

Basically - Rafi expects (as he should) that a "read(fd,...)==0" after aselect(2) call that indicated activity on fd means that the other sidehas closed the connection.


if this is what he expects than, indeed, this is what happens.

Alas - I think that I've just read not longago that there is a bug in Linux' select in implementing just that andit might miss the close from the other side sometimes

what you are describing here sounds astonishing - that such a basicfeature of the sockets implementation is broken? i find this hard tobelieve, without clear evidence.

(sorry, can't finda reference with a quick google, closest I got to might be:http://forum.java.sun.com/thread.jspa?threadID=767657&messageID=4386218<http://forum.java.sun.com/thread.jspa?threadID=767657&messageID=4386218>).I don't remember what was the work-around to that.

you're describing an issue with JVM - not with linux. i neverencountered such a problem when doing socket programming in C or C++.


if you can find something clearer about this, that will be very interesting.

Another point to check - does the read(2) after select(2) return anerror? See select_tut(2) for more details on how to program with select- you should check for errors as well instead of just assuming thatread(2) must succeed ( e.g. interrupt). Also while you are at it - checkwhether pselect(2) can help you improve your program's robustness.
Maybe using poll(2) will help you around that (I also heard that poll isgenerally more efficient because it helps the kernel avoid having tore-interpret the syscall parameters on every call).

it helps avoiding copying too much data to/from kernel space on a sparsesockets list, and it helps avoiding having to scan large sets in thekernel, to initialize its onw internal data structures.


--guy

=================================================================
To unsubscribe, send mail to [EMAIL PROTECTED] with
the word "unsubscribe" in the message body, e.g., run the command
echo unsubscribe | mail [EMAIL PROTECTED]

Re: need some help with tcp/ip programming

Reply via email to