On Wednesday 14 December 2005 22:13, Jonas Mixter wrote:
> On 2005-12-14 17:55, Attila Fülöp wrote:
> > Jonas Mixter wrote:
> >> On 2005-12-14 10:29, Jonas Mixter wrote:
> >>> Hi!
> >>> Yesterday I upgraded my director, fd and sd from bacula 1.36.2 (that
> >>> comes with Debian sarge) to 1.38.2.
> >>> I have one machine running the director (with a connected tape-station)
> >>> and one machine with a lot of disks and a storage daemon (this should
> >>> later on be placed elsewhere, hence the separation of the daemons). The
> >>> director is named merry-dir and the storage daemon is named pippin-sd.
> >>> Before the upgrade I could backup for example the catalog to a file
> >>> storage on the machine with just the sd. I could also restore files
> >>> from
> >>> the backup.
> >>>
> >>> After the upgrade, I could backup just as before. The job exits OK
> >>> and I
> >>> could see that the files that hold the backup are growing. But I cannot
> >>> restore...
> >>> When running a restore I could mark the files I want and the jobs is
> >>> visible underneath "Running Jobs" in bconsole. The job is "waiting for
> >>> Client merry-fd to connect to Storage File" and after a while the job
> >>> times out.
> >>>
> >>> Here's the output when trying to restore a file to merry-fd (running on
> >>> the same server as the director):
> >>>
> >>> 14-Dec 09:03 merry-dir: Start Restore Job
> >>> RestoreFiles.2005-12-14_09.03.22
> >>> 14-Dec 09:13 merry-fd: RestoreFiles.2005-12-14_09.03.22 Fatal error:
> >>> Authorization key rejected by Storage daemon.
> >>> Please see
> >>> http://www.bacula.org/rel-manual/faq.html#AuthorizationErrors
> >>> for help.
> >>> 14-Dec 09:13 merry-fd: RestoreFiles.2005-12-14_09.03.22 Fatal error:
> >>> Failed to authenticate Storage daemon.
> >>> 14-Dec 09:13 merry-dir: RestoreFiles.2005-12-14_09.03.22 Fatal error:
> >>> Socket error on Storage command: ERR=No data available
> >>> 14-Dec 09:13 merry-dir: RestoreFiles.2005-12-14_09.03.22 Error: Bacula
> >>> 1.38.2 (20Nov05): 14-Dec-2005 09:13:30
> >>> JobId:                  512
> >>> Job:                    RestoreFiles.2005-12-14_09.03.22
> >>> Client:                 merry-fd
> >>> Start time:             14-Dec-2005 09:03:25
> >>> End time:               14-Dec-2005 09:13:30
> >>> Files Expected:         1
> >>> Files Restored:         0
> >>> Bytes Restored:         0
> >>> Rate:                   0.0 KB/s
> >>> FD Errors:              0
> >>> FD termination status: SD termination status:  Waiting on FD
> >>> Termination:            *** Restore Error ***
> >>>
> >>> 14-Dec 09:13 merry-dir: RestoreFiles.2005-12-14_09.03.22 Error: Bacula
> >>> 1.38.2 (20Nov05): 14-Dec-2005 09:13:30
> >>> JobId:                  512
> >>> Job:                    RestoreFiles.2005-12-14_09.03.22
> >>> Client:                 merry-fd
> >>> Start time:             14-Dec-2005 09:03:25
> >>> End time:               14-Dec-2005 09:13:30
> >>> Files Expected:         1
> >>> Files Restored:         0
> >>> Bytes Restored:         0
> >>> Rate:                   0.0 KB/s
> >>> FD Errors:              1
> >>> FD termination status: SD termination status:  Waiting on FD
> >>> Termination:            *** Restore Error ***
> >>>
> >>>
> >>> Note the job fails twice... Is that really correct?
> >>> I could backup and restore without any problems to a local file on the
> >>> server running the director (tested only once though).
> >>> I could check the status of the storage daemon from the director, and
> >>> also do backups so the passwords should be OK, right?
> >>>
> >>> I've also seen this error sometimes "14-Dec 09:43 merry-sd: Job
> >>> RestoreFiles.2005-12-14_09.43.50 waiting to reserve a device." but
> >>> don't
> >>> really know the meaning. In a previous post i saw that Kern asked about
> >>> the /lib/tls-directory when presented with this error. I do have the
> >>> /lib/tls, but start all my daemons with the startscript that bacula
> >>> provides. In that script the "LD_ASSUME_KERNEL=2.4.19"-variable is
> >>> exported. (If running kernel 2.4! is the /lib/tls a problem with kernel
> >>> 2.6 too?)
> >>> I use kernel 2.4.27 on the server with the director, and 2.6.8 on
> >>> the sd
> >>> server. The /lib/tls is there on both the servers which are running
> >>> Debian sarge.
> >>> Should really "merry-sd" be involved? I've selected the storage on
> >>> pippin-sd when running "restore" in bconsole. Could it be that the
> >>> director/fd is trying to restore from the wrong sd? Shouldn't the job
> >>> then fail with a "volume not found" error, ask me to mount a volume, or
> >>> similiar?
> >>>
> >>> If I try to cancel a job, the director from time to time complains it
> >>> cannot the job even though it's listed under running jobs.
> >>> *cancel
> >>> Automatically selected Job: JobId=515
> >>> Job=RestoreFiles.2005-12-14_10.18.20
> >>> Confirm cancel (yes/no): yes
> >>> 3902 Job RestoreFiles.2005-12-14_10.18.20 not found.
> >>>
> >>> The job is marked "has been canceled" in bconsole anyway. I don't
> >>> remember if I got errors like this before the upgrade.
> >>>
> >>> What could be wrong in my setup? I've been banging my head to the wall
> >>> for quite a few hours now and I'm out of ideas.
> >>>
> >>> Best regards, Jonas Mixter
> >>
> >> Hi!
> >> I've doing some more tests today.
> >> There seems to be no problems at all restoring from backups stored on
> >> tape.
> >> If I copy the entire backupfile from the server with running only the
> >> sd, to the server running the bacula-dir I could do a restore files
> >> without any problems.
> >> Does anyone have any suggestions where continue trouble shooting?
> >>
> >> Best regards, Jonas Mixter
> >
> > Authorization key rejected by Storage daemon.
> > Please see http://www.bacula.org/rel-manual/faq.html#AuthorizationErrors
> > for help.
> > 14-Dec 09:13 merry-fd: RestoreFiles.2005-12-14_09.03.22 Fatal error:
> > Failed to authenticate Storage daemon.
> >
> > Either you have a password mismatch or some firewall in between.
>
> Hi!
> Thank you for your answer. I'm afraid this is not the problem though,
> even if it seems obvious.
> There is no firewall between (or on) the machines. They are connected to
> the very same network switch and I got no iptables or similar activated.
> The passwords seem to match when I review the config files and I have no
> problem using "status storage" or running backups _to_ this storage
> daemon. I shouldn't be able to do that if the passwords weren't
> matching, right? I only get this problem when trying to restore _from_
> the storage daemon.
> It also seems more like a timeout (10 minutes between the start of the
> job and the error) than an actual authorization error too me. Am I wrong?

Most likely the error message is correct, but for a slightly more subtle 
reason -- I suspect that either your Director and the Client don't resolve 
the SD address to the same IP or more likely the SD has crashed.

>
-- 
Best regards,

Kern

  (">
  /\
  V_V


-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_idv37&alloc_id865&op=click
_______________________________________________
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users

Reply via email to