Hello,

Stephan Heine - [Genetic Interactive] wrote:

Hi,

First off. Thank you for a fantastic backup solution.
Secondly, sorry for the long post.

We've seen longer ones...

The bacula configuration runs perfectly at his client's site,
backing up about 15 clients. Flawlessly.

Yesterday I included a new client into the backup that is a
AMD system that runs  NVidia based gigabit NIC.
This system has no trouble receiving DHCP, drops no packets and communicates
to the rest of the network perfectly.

Here is my problem.

The backup fails (mostly early into the backup) everytime with the errors
below.
I have thought that it must be that onboard NIC because if I install a
Realtek
based NIC it works perfectly.

The board was swopped but the problem persists.

I do remember some nasty comments about nVidia networks on other lists... although I never experienced problems with the few I managed myself.

So, it might simply be a hardware fault. Of course, drivers are important, too, and probably firewall settings or timeouts... I don't know about your RedHat version, but perhaps these are manageb by the network interfaces ethernet ID?

Error status is received:

*messages
27-Jun-2005 10:04 backer-dir: No prior Full backup Job record found.
27-Jun-2005 10:04 backer-dir: No prior or suitable Full backup found. Doing
FULL backup.
27-Jun-2005 10:04 backer-dir: Start Backup JobId 4932,
Job=OctaneDocs.2005-06-27_10.04.47
27-Jun-2005 10:04 backer-sd: Volume "octanedocsfull_0001" previously
written, moving to end of data.
27-Jun-2005 10:05 octane-fd: OctaneDocs.2005-06-27_10.04.47 Fatal error:
..\filed\../../filed/backup.c:498 Network send error to SD. ERR=An existing
connection was forcibly closed by the remote host.
27-Jun-2005 10:05 backer-dir: OctaneDocs.2005-06-27_10.04.47 Error: Bacula
1.36.0 (20Oct04): 27-Jun-2005 10:05
<SNIP>

And another time:

*messages
27-Jun-2005 11:11 backer-dir: Start Backup JobId 4933,
Job=OctaneDocs.2005-06-27_11.11.41
27-Jun-2005 11:12 octane-fd: OctaneDocs.2005-06-27_11.11.41 Fatal error:
..\filed\../../filed/backup.c:472 Network send error 77 to SD. ERR=An
existing connection was forcibly closed by the remote host.
27-Jun-2005 11:12 backer-dir: OctaneDocs.2005-06-27_11.11.41 Error: Bacula
1.36.0 (20Oct04): 27-Jun-2005 11:12
<SNIP>

What I have also noticed is that the status on the SD reports that it is
still busy and occupies the attention of the SD.
This makes subsequent backups to this device fail. If the SD is restarted
and a different job is run it is perfect.

That's normal, because the bacula components use some long timeouts. After some hours it should notice the dropped connection.

backer-sd Version: 1.36.0 (20 October 2004) i686-redhat-linux-gnu redhat
(Tettnang)
Daemon started 27-Jun-05 11:13, 0 Jobs run since started.

Running Jobs:
Full Backup job OctaneDocs JobId=4934 Volume="octanedocsfull_0001"
device="/arch/backup/full/docs"
    Files=4 Bytes=835 Bytes/sec=15
    FDReadSeqNo=31 in_msg=22 out_msg=5 fd=6
====

Device status:
Device "/arch/backup/full/docs" is mounted with Volume "octanedocsfull_0001"
    Total Bytes=1 Blocks=0 Bytes/block=1
    Positioned at File=0 Block=0
Device "/arch/backup/full/mail" is not open.
====

Please point me in the right direction.

Running the SD with debug output and analyzing that might give you some information, as well as using strace to see how it works with the network interface. If you suspect a network problem, you could watch the traffic using tcpdum or ethereal somewhere between the involved computers.

Arno

Yours sincerely

Stephan Heine
Support Engineer
Genetic Interactive
Tel: +27 861 99 88 99
Fax: +27 861 99 77 99
Cell: +27 82 467 1164
EMail: [EMAIL PROTECTED]




-------------------------------------------------------
SF.Net email is sponsored by: Discover Easy Linux Migration Strategies
from IBM. Find simple to follow Roadmaps, straightforward articles,
informative Webcasts and more! Get everything you need to get up to
speed, fast. http://ads.osdn.com/?ad_id=7477&alloc_id=16492&op=click
_______________________________________________
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users

--
IT-Service Lehmann                    [EMAIL PROTECTED]
Arno Lehmann                  http://www.its-lehmann.de


-------------------------------------------------------
SF.Net email is sponsored by: Discover Easy Linux Migration Strategies
from IBM. Find simple to follow Roadmaps, straightforward articles,
informative Webcasts and more! Get everything you need to get up to
speed, fast. http://ads.osdn.com/?ad_id=7477&alloc_id=16492&op=click
_______________________________________________
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users

Reply via email to