On Friday, 21 March 2003 at 12:57:27 +0100, Alexander Haderer wrote:
> At 10:26 21.03.2003 +1030, Greg 'groggy' Lehey wrote:
>> On Thursday, 20 March 2003 at 13:13:18 +0100, Alexander Haderer wrote:
>>> At 12:53 20.03.2003 +0100, Maarten de Vries wrote:
>>>> This would be for backup. Data on about 50 webservers would be backed up
>>>> to it on a nightly basis. So performance wouldn't be important.
>>>
>>> Sure? Consider this:
>>>
>>> a.
>>> Filling 3TB with 1 Mbyte/s lasts more than 800 hours or 33 days.
>>
>> I do a nightly backup to disk.  It's compressed (gzip), which is the
>> bottleneck.  I get this sort of performance:
>>
>> dump -2uf - /home | gzip > /dump/wantadilla/2/home.gz
>>  ...
>>  DUMP: DUMP: 1254971 tape blocks
>>  DUMP: finished in 217 seconds, throughput 5783 KBytes/sec
>>  DUMP: level 2 dump on Thu Mar 20 21:01:31 2003
>>
>> You don't normally fill up a backup disk at once, so this would be
>> perfectly adequate.  I'd expect a system of the kind that Maarten's
>> talking about to be able to transfer at least 40 MB/s sequential at
>> the disk.  That would mean he could backup over 1 TB in an 8 hour
>> period.
>
> Of course you are right. My note a. was meant as a more general hint to
> think about transfer rates when dealing with large files/filesystem.
> Maarten gave no details about how the webservers are connected with the
> backup server. I should have give more details of what I mean: When backing
> up 50 Webservers over network to one backup server the network may become a
> bottleneck. If you have to use encrypted connections (ssh) because the
> webservers are located elsewhere you need CPU power at server side for each
> connection.

Correct.

>>> b.
>>> Using ssh + dump/cpio/tar needs CPU power for encryption, especially when
>>> multiple clients safe their data at the same time.
>>
>> You can share the compression across multiple machines.  That's what
>> was happening in the example above.
>
> It is a good idea to do compression at the client side.
>
> As I understand your example /dump/wantadilla/2 is either a local
> dir or connected via NFS. The latter requires a local network if you
> don't want to do NFS mounts across the Internet. Is this right?

Yes.  This is just a local network.  There's no absolute necessity for
NFS, and I certainly wouldn't do it across the Internet.

Greg
--
See complete headers for address and phone numbers

Attachment: pgp00000.pgp
Description: PGP signature

Reply via email to