Dan Langille wrote:
> On 24 Apr 2006 at 12:05, Martin Horcicka wrote:
> 
>> I run a lot of jobs in parallel (with spooling) and a few of them (different
>> ones every day) usually fail with this message from the client daemons:
>>
>>   Fatal error: Bad response from stored to open command
> 
> Is everthing on the same bacula version?  Are the clients all 1.38.8? 
>  Are the storage daemons all on 1.38.8?

The director and the storage daemon are both 1.38.8 (it's the same server).
The clients are on versions from 1.38.2 to 1.38.8.

>> It seems to happen after elapsing of the job's MaxRunTime or MaxWaitTime and
>> the jobs don't have any destination volume assigned by the storage daemon -
>> which is strange. When I run the job manually immediately after the failure,
>> it works well.
>>
>> Does anyone know what the message really means and where should I look for a
>> cause of the problem?

Right now I'm running another test backup - from 120 jobs run in parallel 118
jobs finished OK and 2 jobs are in a strange state that will likely result in
the problem described above:

>From "status dir":
Director Version: 1.38.8 (14 April 2006) i386-portbld-freebsd5.4 freebsd
5.4-RELEASE-p14
...
Running Jobs:
 JobId Level   Name                       Status
======================================================================
  2389 Differe  b2--backup.2006-04-24_13.08.59 is running
  2404 Differe  b4--backup.2006-04-24_13.09.14 is running
====

>From "status storage":
Storage Version: 1.38.8 (14 April 2006) i386-portbld-freebsd5.4 freebsd
5.4-RELEASE-p14
...
Running Jobs:
Writing: Differential Backup job b2--backup JobId=2389 Volume=""
    pool="Daily" device=""Tape-Library-1-Drive-0" (/dev/nsa0)"
    Files=0 Bytes=0 Bytes/sec=0
    FDReadSeqNo=4 in_msg=4 out_msg=3 fd=130
Writing: Differential Backup job b4--backup JobId=2404 Volume=""
    pool="Daily" device=""Tape-Library-1-Drive-0" (/dev/nsa0)"
    Files=0 Bytes=0 Bytes/sec=0
    FDReadSeqNo=4 in_msg=4 out_msg=3 fd=161
====

Notice the strange Volume="" above.

>From "status client" (machine b2):
Client Version: 1.38.6 (28 March 2006)  i386-portbld-freebsd4.11 freebsd
4.11-RELEASE-p11
...
Running Jobs:
JobId 2389 Job b2--backup.2006-04-24_13.08.59 is running.
    Backup Job started: 24-dub-06 13:09
    Files=0 Bytes=0 Bytes/sec=0
    Files Examined=0
    SDReadSeqNo=4 fd=7
Director connected at: 24-dub-06 15:50
====

>From "status client" (machine b4):
Client Version: 1.38.6 (28 March 2006)  i386-portbld-freebsd4.11 freebsd
4.11-RELEASE-p11
...
Running Jobs:
JobId 2404 Job b4--backup.2006-04-24_13.09.14 is running.
    Backup Job started: 24-dub-06 13:09
    Files=0 Bytes=0 Bytes/sec=0
    Files Examined=0
    SDReadSeqNo=4 fd=7
Director connected at: 24-dub-06 15:52
====

Unfortunately, I don't know how to find out what the system is doing right now
 in more detail but it seems it does not do anything.

Martin



-------------------------------------------------------
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users

Reply via email to