> -----Original Message----- > From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of frank > Sent: Monday, October 17, 2005 11:14 PM > To: bacula-users@lists.sourceforge.net > Subject: RE: [Bacula-users] Connection to remote Storage Daemon hangs > > > -----Original Message----- > > From: [EMAIL PROTECTED] > > [mailto:[EMAIL PROTECTED] On Behalf Of Florian > > Schnabel > > Sent: Monday, October 17, 2005 10:47 PM > > To: bacula-users@lists.sourceforge.net > > Subject: Re: [Bacula-users] Connection to remote Storage Daemon hangs > > > > frank wrote: > > > Hi there, > > > > > > I try to setup Bacula to backup via an OpenVPN tunnel connection, so far > > > without luck. If I backup locally to a mounted file it works without > > > problems however whenever I try to run that same job to store its data to > > > a remote SD, the connection to the SD somehow stops responding. When I > > > retrieve a status from the client it hangs always at the same point > > > “SDReadSeqNo=5 fd=7”: > > > > > > Running Jobs: > > > Director connected at: 17-Oct-05 22:55 JobId 29 Job > > > Client1.2005-10-17_22.49.05 is running. > > > Backup Job started: 17-Oct-05 22:49 > > > Files=340 Bytes=289,365 Bytes/sec=817 > > > Files Examined=360 > > > Processing file: /var/www/html/images/favicon.ico > > > SDReadSeqNo=5 fd=7 > > > > > > When I try to ask for the status of the SD, the connection hangs, however > > > when no jobs are running I can successfully query the status of the SD. > > > > > > After some time the job terminates with the error message “Broken Pipe”. > > > As indicated in the documentation, I played around with the heartbeat > > > parameter however without success. > > > > > > Full log result of a job failure: > > > > > > 17-Oct 21:19 host1-dir: No prior Full backup Job record found. > > > 17-Oct 21:19 host1-dir: No prior or suitable Full backup found. Doing > > > FULL backup. > > > 17-Oct 21:19 host1-dir: Start Backup JobId 21, > > > Job=Client1.2005-10-17_21.19.54 17-Oct 21:20 host2-sd: Volume > > > "Subversion" previously written, moving to end of data. > > > 17-Oct 21:38 host1-fd: Client1.2005-10-17_21.19.54 Fatal error: > > > backup.c:477 Network send error 4298 to SD. ERR=Broken pipe 17-Oct 21:39 > > > host1-dir: Client1.2005-10-17_21.19.54 Error: Bacula 1.36.3 (22Apr05): > > > 17-Oct-2005 21:39:02 > > > JobId: 21 > > > Job: Client1.2005-10-17_21.19.54 > > > Backup Level: Full (upgraded from Incremental) > > > Client: host1-fd > > > FileSet: "Full Set" 2005-10-15 20:27:09 > > > Pool: "Default" > > > Storage: "File" > > > Start time: 17-Oct-2005 21:19:56 > > > End time: 17-Oct-2005 21:39:02 > > > FD Files Written: 359 > > > SD Files Written: 0 > > > FD Bytes Written: 358,575 > > > SD Bytes Written: 0 > > > Rate: 0.3 KB/s > > > Software Compression: 1.2 % > > > Volume name(s): > > > Volume Session Id: 1 > > > Volume Session Time: 1129555063 > > > Last Volume Bytes: 1 > > > Non-fatal FD errors: 0 > > > SD Errors: 0 > > > FD termination status: Error > > > SD termination status: Running > > > Termination: *** Backup Error *** > > > > > > I run bacula version 1.36.3 on Fedora Core 4 installed from the RPM from > > > Sourceforge. The /lib/tls is disabled using LD_ASSUME_KERNEL=2.4.19 from > > > the boot scripts. > > > > > > I’m running out of options, any pointers are appreciated. > > > > > > regards > > > Frank > > > > > > > the "broken pipe" means only your client closed the connection > > ungracefully .. i.e. you jsut closed it without using quit > > > > try to wait, it may jsut take a while > > > > Florian > > The job terminates by itself after about 20 mins. During those 20 minutes I > can no longer query the status of the SD, it does not respond to status > command. From the logfile I see that the SD reports that it did not write any > data however the file on disk grows slightly on every job. > > regards > Frank
I upgraded Bacula to 1.37.40, same problem remains: the SD stops responding and after about 20 mins the job terminates. I do get some additional error message from the storage daemon: "host2-sd: askdir.c:319 Didn't get vol info vol=TestVolume000: ERR=Network error on bnet_recv in req_vol_info." Any idea what this means and how I can solve this? The same job runs without problems if I send it to the local SD. regards Frank ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users