Re: [zfs-discuss] Thoughts on ZFS Pool Backup Strategies

David Magda Sat, 20 Mar 2010 08:51:05 -0700

On Mar 20, 2010, at 00:57, Edward Ned Harvey wrote:

I used NDMP up till November, when we replaced our NetApp with aSolaris Sunbox. In NDMP, to choose the source files, we had the ability tobrowse thefileserver, select files, and specify file matching patterns. Mypoint is:NDMP is file based. It doesn't allow you to spawn a process andbackup a
data stream.


Not quite.

It can reference files, but only by specifying where they are in anopaque "data stream" (see §2.3.5.2 of the NDMPv4 spec [1]):

The file locator data in the file history record is in a dataservice (OS) specific format. To the DMA this information is anopaque string. This means that the DMA will not attempt to interpretit. In order to determine the location of a file in the backup datastream, the DMA will send the complete file history record for thecorresponding file history record to the data service, the dataservice will calculate the starting location and the length of thebyte string to be read from the original backup data stream. The DMAwill use this data to manipulate the tape service to retrieve theselected data.

So the backup software (DMA) simply knows the tape on which the fileis on, and the starting byte of that tape, but if you want to restorea file from (say) a NetApp share or export, you have to send the bytesto another NetApp which can interpret the stream. It's not like thebyte stream is in a known format (tar, cpio, or zip) that can beinterpreted by anyone. (Unless you reverse engineer the format ofcourse.)

After a filer ("NDMP Data Service") is told to start backing up, itcan tell the backup software ("NDMP Data Management Application"--DMA)about files via the NDMP_FH_ADD_FILE command (see §4.3.1 [1]).


[1] http://www.ndmp.org/download/sdk_v4/draft-skardal-ndmp4-04.txt

So technically Oracle can implement an NDMP service on (Open)Solaris,and backup vendors could interface with that and send the raw ZFS datastream to tape. As the Solaris kernel traverses the file system, andcomes across directories and files, it would tell the backup softwareabout the file (path, owner, group, etc.) and where it is in thestream sent to "tape" (LTO, VTL, etc.). On file restoration, thebackup software would then have to send the (opaque-to-it) data streamfrom tape to another Solaris box that could interpret it.

This is of course in the case of a CIFS share or NFS export, where thefiler (NetApp, Sun 7000 series, Celerra) has some knowledge of thefile names, and wouldn't work on a raw LUN--unless the filer startsparsing the LUN for disk formats like is done with VMware's VMDKformat and NetBackup, where they can figure out the files.


_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] Thoughts on ZFS Pool Backup Strategies

Reply via email to