On Mon, Dec 05, 2011 at 01:51:05AM +0200, Timo Sirainen wrote: > I'm mainly wondering if it's common for backup programs to > support using a separate program to generate the backups. For > example if there was a "dovecot-backup" binary that just > dumps all (or new-since-last-backup) of the users' mails into > stdout, which the backup program can use. Or perhaps in that > case there wouldn't really be much of anything for the backup > to do except to write it to tape..
For databases, most commercial programs use some kind of application-specific plugins. Bakula has "Client Run Before Job" which permits indicating a command to be run before proceeding with the backup, maybe there is more. However I have not heard of any standardized way of doing it so that the application provider could provide a way to interact with multiple backup programs. It's more like each backup vendor boasting that his backup software can backup application XXX without downtime. > SIS was designed to work with hard links. They couldn't be > replaced with symlinks without a redesign (which would be less > efficient in normal operation). Right, but if the backup program recognized this then maybe the replacement by a symlink could be done only in the backup. rsync has to keep in memory all the inodes and check the list every time a hard link is found. If it knew that "a hard link in attachments should link to an identical file name in the common attachment SiS store. > [Zimbra] > > You mean you first create uncompressed zip files (why not > just tar?) of all the mails to the filesystem and the backup > software then backups those zip files? Well, not I, this is Zimbra's backup system :-) The backups are the destination zips. Maybe zip is used because the extension and extraction method is the same whether compressed or not; that way compression is just an option to the backup program to be turned on or off. > Dovecot's mdbox files already contain multiple messages in > each file, so it should be a lot more efficient to do backups > on those. And each message in an mdbox file can be compressed > if zlib plugin is enabled. So I think that sounds quite a lot > like what you propose. Is that combined or combinable with SiS? If attachments are in separate files, that means they are aligned on block boundaries, which make block-level SiS (like NetApp's) much more efficient. Think of an attachment sent to all department heads, all of whom forward the attachment to all their subordinates.