Re: [Bacula-users] bacula performance with spooling when writing to tape from fd

Arno Lehmann Wed, 04 Jan 2006 05:30:23 -0800

Hello,

On 1/4/2006 12:12 AM, Joe Dollard wrote:

I've run into a performance problem when doing backups to tape that Ineed some help in resolving. According to the output from btape test,my tape drive can be written to at around 9,700 KB/s. I've also run atest with the Windows file daemon and can backup to disk on my baculaserver at around 9,000 KB/s. Based upon these two figures, I wouldassume that I should be able to do a backup from the Windows file daemonto tape at 9,000 KB/s - which over my 100 megabit network I'd be veryhappy with.

The basic asumptions sounds reasonable - windows client can deliverdata, and the tape could write it without holding the client. BUT thefigures you give are about the maximum throughput you can get over a100M ethernet.

However with spooling disabled my backup to tape runs atabout 6700 KB/s (using the same job which gave me 9000 KB/s before).With spooling enabled my backup runs at approx 4700 KB/s.

Unless I'm mistaken, the througput report witch spooling enabled is notthe figure you're interested in because it measures the overall datarate: First, data is spooled from client to disk, then despooled fromdisk to tape. In other words, the actual speed for each of the processesmight be much higer - in your case, I'd assume that the figures you giveabove are a good estimate. 4700K/s, with moving each byte twice, wouldbe something like 9400K/s for each subprocess.

To solve the problem with direct client to tape data, you need to makesure that there's no bottleneck in your whole setup. First, even if datastalls for a short time, the tape drive will stop and has to reposition,which can take quite long. During that phase, the network buffers willrun full, which, depending on your network and client setup, can evenlead to to a slowed client system.

In other words, writing the data to tape has to be the speed limitingpart of a network backup without spooling.

You can try to tune your network buffer setup - search the archives forsome more information - and you might even try to install a fasternetwork link between your backup server and the one delivering the data.A dedicated network link can help a lot, especially if your network isheavily used by other applications as well when the backup jobs run.

One of my servers has about 240GB of data that I need to run a fullbackup on weekly, however my bacula server only has about 100GB ofavailable disk space. As I don't have enough disk space to spool theentire job to disk first, the FD is going to be sitting idle while theSD writes the first 100GB to disk, and then the process will be repeatedagain, and again for the final 40GB.

Right, but some time in the future, that might change. Don't hold yourbreath, though.

Is there anything I can do inbacula to allow the FD to keep spooling data to the SD while the SD iswriting data to tape?

There have been different proposals how to handle this problem -multiple smaller spool files per job are one solution. This might not bethe easiest way, because it requires a big change in the way the SD nowworks, and would be limited by hard disk throughput.

The - theoretically - best solution I know about is first a BIG memorybuffer which holds data for the tape and is only written when it'snearly filled, and which would be re-filled whenever possible. Behindthat, you'd need a fast disk setup with several dedicated disks, eachfor one spool file, and of course each on it's controller. Eachcontroller would need it's own bus or dedicated link to the system, in turn.

In other words, a solution to really achieve maximum throughput requiresnot only a major modification of Bacula, but also a really optimizedsystem it runs on.

Are there any other workaround I could use, or amI going to have to buy a bigger hard drive for my backup server?

My experience tells me that installing a bigger hard disk spool area isthe best workaround in terms of cost and resulting speed improvement, yes.

Also, assuming that you have enough disk space, the planned (and alreadystarted) development of job migration should allow a "real" D2D2T backupsetup with Bacula, which would allow not only higher data throughput,but also more flexibility. Admittedly, that's something for the future,but if you have the disk space now it should be a small modification ofyour setup to use that space not as dedicated spool space, but as harddisk volumes in a migration scheme.

I hope this explains your experiences, and perhaps helps a little whendeciding how to solve the current problem. And, of course, if you cansupport development of the features I mentioned, I'm quite sure thatmany Bacula users would be much impressed :-)


Arno

Thanks,
Joe


-------------------------------------------------------

This SF.net email is sponsored by: Splunk Inc. Do you grep through logfiles

for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click
_______________________________________________
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


--
IT-Service Lehmann                    [EMAIL PROTECTED]
Arno Lehmann                  http://www.its-lehmann.de


-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click
_______________________________________________
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users

Re: [Bacula-users] bacula performance with spooling when writing to tape from fd

Reply via email to