> -----Original Message-----
> From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of frank
> Sent: Monday, October 17, 2005 11:14 PM
> To: bacula-users@lists.sourceforge.net
> Subject: RE: [Bacula-users] Connection to remote Storage Daemon hangs
> 
> > -----Original Message-----
> > From: [EMAIL PROTECTED] 
> > [mailto:[EMAIL PROTECTED] On Behalf Of Florian 
> > Schnabel
> > Sent: Monday, October 17, 2005 10:47 PM
> > To: bacula-users@lists.sourceforge.net
> > Subject: Re: [Bacula-users] Connection to remote Storage Daemon hangs
> > 
> > frank wrote:
> > > Hi there,
> > > 
> > > I try to setup Bacula to backup via an OpenVPN tunnel connection, so far 
> > > without luck. If I backup locally to a mounted file it works without 
> > > problems however whenever I try to run that same job to store its data to 
> > > a remote SD, the connection to the SD somehow stops responding. When I 
> > > retrieve a status from the client it hangs always at the same point 
> > > “SDReadSeqNo=5 fd=7”:
> > > 
> > > Running Jobs:
> > > Director connected at: 17-Oct-05 22:55 JobId 29 Job
> > > Client1.2005-10-17_22.49.05 is running.
> > >     Backup Job started: 17-Oct-05 22:49
> > >     Files=340 Bytes=289,365 Bytes/sec=817
> > >     Files Examined=360
> > >     Processing file: /var/www/html/images/favicon.ico
> > >     SDReadSeqNo=5 fd=7
> > > 
> > > When I try to ask for the status of the SD, the connection hangs, however 
> > > when no jobs are running I can successfully query the status of the SD.
> > > 
> > > After some time the job terminates with the error message “Broken Pipe”. 
> > > As indicated in the documentation, I played around with the heartbeat 
> > > parameter however without success.
> > > 
> > > Full log result of a job failure:
> > > 
> > > 17-Oct 21:19 host1-dir: No prior Full backup Job record found.
> > > 17-Oct 21:19 host1-dir: No prior or suitable Full backup found. Doing 
> > > FULL backup.
> > > 17-Oct 21:19 host1-dir: Start Backup JobId 21, 
> > > Job=Client1.2005-10-17_21.19.54 17-Oct 21:20 host2-sd: Volume 
> > > "Subversion" previously written, moving to end of data.
> > > 17-Oct 21:38 host1-fd: Client1.2005-10-17_21.19.54 Fatal error: 
> > > backup.c:477 Network send error 4298 to SD. ERR=Broken pipe 17-Oct 21:39 
> > > host1-dir: Client1.2005-10-17_21.19.54 Error: Bacula 1.36.3 (22Apr05): 
> > > 17-Oct-2005 21:39:02
> > >   JobId:                  21
> > >   Job:                    Client1.2005-10-17_21.19.54
> > >   Backup Level:           Full (upgraded from Incremental)
> > >   Client:                 host1-fd
> > >   FileSet:                "Full Set" 2005-10-15 20:27:09
> > >   Pool:                   "Default"
> > >   Storage:                "File"
> > >   Start time:             17-Oct-2005 21:19:56
> > >   End time:               17-Oct-2005 21:39:02
> > >   FD Files Written:       359
> > >   SD Files Written:       0
> > >   FD Bytes Written:       358,575
> > >   SD Bytes Written:       0
> > >   Rate:                   0.3 KB/s
> > >   Software Compression:   1.2 %
> > >   Volume name(s):         
> > >   Volume Session Id:      1
> > >   Volume Session Time:    1129555063
> > >   Last Volume Bytes:      1
> > >   Non-fatal FD errors:    0
> > >   SD Errors:              0
> > >   FD termination status:  Error
> > >   SD termination status:  Running
> > >   Termination:            *** Backup Error ***
> > > 
> > > I run bacula version 1.36.3 on Fedora Core 4 installed from the RPM from 
> > > Sourceforge. The /lib/tls is disabled using LD_ASSUME_KERNEL=2.4.19 from 
> > > the boot scripts.
> > > 
> > > I’m running out of options, any pointers are appreciated.
> > > 
> > > regards
> > > Frank
> > > 
> > 
> > the "broken pipe" means only your client closed the connection 
> > ungracefully .. i.e. you jsut closed it without using quit
> > 
> > try to wait, it may jsut take a while
> > 
> > Florian
> 
> The job terminates by itself after about 20 mins. During those 20 minutes I 
> can no longer query the status of the SD, it does not respond to status 
> command. From the logfile I see that the SD reports that it did not write any 
> data however the file on disk grows slightly on every job.
> 
> regards
> Frank

I upgraded Bacula to 1.37.40, same problem remains: the SD stops responding and 
after about 20 mins the job terminates.

I do get some additional error message from the storage daemon: 
"host2-sd: askdir.c:319 Didn't get vol info vol=TestVolume000: ERR=Network 
error on bnet_recv in req_vol_info."

Any idea what this means and how I can solve this? The same job runs without 
problems if I send it to the local SD.

regards
Frank



-------------------------------------------------------
This SF.Net email is sponsored by:
Power Architecture Resource Center: Free content, downloads, discussions,
and more. http://solutions.newsforge.com/ibmarch.tmpl
_______________________________________________
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users

Reply via email to