More info/questions: (hung rsync is 17407)
# /usr/proc/bin/pstack 17407
17407: /usr/local/bin/rsync -rptgoD --partial --delete-after -vv
--delete -e
ff215d08 poll (ffbefad0, 0, 14)
ff1ce6d4 select (ffbefad0, ff2369f4, ff2369f4, ff2369f4, ff2369f4,
0) + 298
0001b568 msleep (14, ffbefc54, 40, 0, ff233968, 1b638) + 78
0001b620 wait_process (4400, ffbefc54, 1, 49800, 52d10, 1) + c
0001c694 client_run (7, 4, 4400, 1, ffbefdf0, ffbefcc8) + 114
0001cb0c start_client (1, ffbefdf0, 1, 4f400, ff233968, ff1a01e0) +
314
0001cd54 main (2, ffbefdf0, ffbefdfc, 4f338, 0, 0) + 1e8
000160c8 _start (0, 0, 0, 0, 0, 0) + 5c
# /usr/proc/bin/pfiles 17407
17407: /usr/local/bin/rsync -rptgoD --partial --delete-after -vv
--delete -e
Current rlimit: 64 file descriptors
0: S_IFCHR mode:0666 dev:85,0 ino:216127 uid:0 gid:3 rdev:13,2
O_RDONLY|O_NONBLOCK|O_LARGEFILE
1: S_IFREG mode:0644 dev:32,727 ino:970354 uid:0 gid:1 size:9990679
O_WRONLY|O_APPEND
2: S_IFREG mode:0644 dev:32,727 ino:970356 uid:0 gid:1 size:0
O_WRONLY|O_NONBLOCK|O_APPEND
3: S_IFDOOR mode:0444 dev:206,0 ino:53988 uid:0 gid:0 size:0
O_RDONLY|O_LARGEFILE FD_CLOEXEC door to nscd[299]
4: S_IFSOCK mode:0666 dev:201,0 ino:16261 uid:0 gid:0 size:0
O_RDWR|O_NONBLOCK
7: S_IFSOCK mode:0666 dev:201,0 ino:60422 uid:0 gid:0 size:0
O_RDWR|O_NONBLOCK
Neil Schellenberger wrote:
>
> Eric,
>
> Since the poll is nfds=0 and timeo=20 (i.e. almost certainly
> msleep(20)) and since waitpid is looking for 17408, this actually
> really has to be the call of wait_process() at main.c:532 where rsync
> is (apparently) waiting for ssh to die.
>
> The reason that timeout has no effect is that only io_flush() is being
> called in this loop and since there is (presumably) nothing more to be
> writen so the usual I/O loop stuff (including check_timeout()) is not
> being called.
>
> If you can manage it, both pstack and pfiles output would be useful to
> check if my guess of main.c:532 is right and to see if the pipe to ssh
> is still open. If the pipe is still open, we may have our culprit (it
> doesn't realise it should be exiting?). If it's closed, we'll need to
> know what ssh is up to....
>
> Regards,
> Neil
>
> --
> Neil Schellenberger | Voice : (613) 599-2300 ext. 8445
> CrossKeys Systems Corporation | Fax : (613) 599-2330
> 350 Terry Fox Drive | E-Mail: [EMAIL PROTECTED]
> Kanata, Ont., Canada, K2K 2W5 | URL : http://www.crosskeys.com/
> + Greg Moore (1975-1999), Gentleman racer and great Canadian +
--
__________________________________________________________________
Eric T. Whiting AMI Semiconductors
(208) 234-6717 2300 Buckskin Road
(208) 234-6659 (fax) Pocatello,ID 83201
[EMAIL PROTECTED]