On Tue, Jan 27, 2009 at 10:12:03AM +0000, Allan Black wrote: > Jason Dixon wrote: > > a "Packet size too big" error. The Director resides on a global zone in > > Solaris x86. I've managed to capture a truss during one of the > > failures: > > http://mirrors.omniti.com/bacula/bacula.truss > ... > [This is where it goes wrong] > > Just after half way through the above, this happens: > > 14106/68: pollsys(0xFE55FE10, 1, 0xFE55FEC8, 0x00000000) = 1 > 14106/68: fd=6 ev=POLLRDNORM rev=POLLRDNORM > 14106/68: timeout: 5.000000000 sec > > which indicates that a "normal" incoming event has occurred on file > descriptor 6, > which is the connection to the SD. 3 lines later, > > 14106/68: read(6, 0xFE55FF80, 4) Err#131 > ECONNRESET > > The FD attempts to read from the SD, and gets "Connection reset by peer". From > the job report you posted, it doesn't look like the SD is crashing/restarting, > nor is the machine rebooting. > > Something, somewhere though, is interfering with the connection between the FD > and the SD. Sorry to say this, but you may have to truss the SD!
I've enabled the truss to run on bacula-sd each night. I'll report back my findings. Thanks, -- Jason Dixon OmniTI Computer Consulting, Inc. jdi...@omniti.com 443.325.1357 x.241 ------------------------------------------------------------------------------ This SF.net email is sponsored by: SourcForge Community SourceForge wants to tell your story. http://p.sf.net/sfu/sf-spreadtheword _______________________________________________ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users