Dan Langille wrote: > On 24 Apr 2006 at 12:05, Martin Horcicka wrote: > >> I run a lot of jobs in parallel (with spooling) and a few of them (different >> ones every day) usually fail with this message from the client daemons: >> >> Fatal error: Bad response from stored to open command > > Is everthing on the same bacula version? Are the clients all 1.38.8? > Are the storage daemons all on 1.38.8?
The director and the storage daemon are both 1.38.8 (it's the same server). The clients are on versions from 1.38.2 to 1.38.8. >> It seems to happen after elapsing of the job's MaxRunTime or MaxWaitTime and >> the jobs don't have any destination volume assigned by the storage daemon - >> which is strange. When I run the job manually immediately after the failure, >> it works well. >> >> Does anyone know what the message really means and where should I look for a >> cause of the problem? Right now I'm running another test backup - from 120 jobs run in parallel 118 jobs finished OK and 2 jobs are in a strange state that will likely result in the problem described above: >From "status dir": Director Version: 1.38.8 (14 April 2006) i386-portbld-freebsd5.4 freebsd 5.4-RELEASE-p14 ... Running Jobs: JobId Level Name Status ====================================================================== 2389 Differe b2--backup.2006-04-24_13.08.59 is running 2404 Differe b4--backup.2006-04-24_13.09.14 is running ==== >From "status storage": Storage Version: 1.38.8 (14 April 2006) i386-portbld-freebsd5.4 freebsd 5.4-RELEASE-p14 ... Running Jobs: Writing: Differential Backup job b2--backup JobId=2389 Volume="" pool="Daily" device=""Tape-Library-1-Drive-0" (/dev/nsa0)" Files=0 Bytes=0 Bytes/sec=0 FDReadSeqNo=4 in_msg=4 out_msg=3 fd=130 Writing: Differential Backup job b4--backup JobId=2404 Volume="" pool="Daily" device=""Tape-Library-1-Drive-0" (/dev/nsa0)" Files=0 Bytes=0 Bytes/sec=0 FDReadSeqNo=4 in_msg=4 out_msg=3 fd=161 ==== Notice the strange Volume="" above. >From "status client" (machine b2): Client Version: 1.38.6 (28 March 2006) i386-portbld-freebsd4.11 freebsd 4.11-RELEASE-p11 ... Running Jobs: JobId 2389 Job b2--backup.2006-04-24_13.08.59 is running. Backup Job started: 24-dub-06 13:09 Files=0 Bytes=0 Bytes/sec=0 Files Examined=0 SDReadSeqNo=4 fd=7 Director connected at: 24-dub-06 15:50 ==== >From "status client" (machine b4): Client Version: 1.38.6 (28 March 2006) i386-portbld-freebsd4.11 freebsd 4.11-RELEASE-p11 ... Running Jobs: JobId 2404 Job b4--backup.2006-04-24_13.09.14 is running. Backup Job started: 24-dub-06 13:09 Files=0 Bytes=0 Bytes/sec=0 Files Examined=0 SDReadSeqNo=4 fd=7 Director connected at: 24-dub-06 15:52 ==== Unfortunately, I don't know how to find out what the system is doing right now in more detail but it seems it does not do anything. Martin ------------------------------------------------------- Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642 _______________________________________________ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users