Andrew Gideon wrote:
I currently do incremental backups using --link-dest. Unchanged files are hard links to the previous snapshot; changed files are new copies.

Where this "fails" is for large files that have received small changes. The directory containing my main IMAP account, for example, typically generates between 1 and 2 G of daily backup data as I file messages in my inbox. Yesterday, though, I filed some old messages in my inbox. This touched files I don't typically touch. They were minor changes in the scheme of things, but the result was an 11G backup of that volume last night.

If you resist using a fileformat which has some kind of implicit "chunking" capability (eg maildir isn't perfect, but kind of goes in the opposite direction and chunks your mail into lots of smaller files - perhaps too many in the opinion of some...), then you really need a backup system which can "chunk" your backup files and only store the differences

There are a few like rsnapshot which might suit your needs. Or if you want something different then perhaps try brackup, which is kind of rsync like, but has a "chunking" phase built into it which tries to be rather smart about dividing files up into the hopefully static bit and the variable bit. eg in the case of MP3s it would create one chunk for the data and one chunk for the tagging, that way you can retag your mp3 collection and it won't re-upload the whole lot

Ideally I would like to see a kind of half and half maildir/mbox format emerge for email (perhaps dbox from dovecot will get there?). The idea is that it would use mbox for storage, but break the files up into chunks of about a couple of MB for each mbox file. This way you would have much smaller files to defrag in the case of deletes, but you would gain the packing efficiency of mbox (especially if you use compressed mailboxes, eg dovecot)

Good luck

Ed W
--
Please use reply-all for most replies to avoid omitting the mailing list.
To unsubscribe or change options: https://lists.samba.org/mailman/listinfo/rsync
Before posting, read: http://www.catb.org/~esr/faqs/smart-questions.html

Reply via email to