Re: [zfs-discuss] Project Proposal: Availability Suite

Jim Dunham Mon, 05 Feb 2007 04:49:50 -0800

Frank,

On Fri, 2 Feb 2007, Torrey McMahon wrote:
Jason J. W. Williams wrote:
Hi Jim,

Thank you very much for the heads up. Unfortunately, we need the
write-cache enabled for the application I was thinking of combining
this with. Sounds like SNDR and ZFS need some more soak time together
before you can use both to their full potential together?
Well...there is the fact that SNDR works with other FS other thenZFS. (Yes, I know this is the ZFS list.) Working around architecturalissues for ZFS and ZFS alone might cause issues for others.
SNDR has some issues with logging UFS as well. If you start a SNDRlive copy on an active logging UFS (not _writelocked_), the UFS logstate may not be copied consistently.

Treading "very" carefully, UFS logging may have issues with beingreplicated, not the other way around. SNDR replication (aftersynchronizing) maintains a write-order consistent volume, thus if thereis an issue with UFS logging being able to access an SNDR secondary,then UFS logging will also have issues with accessing a volume afterSolaris crashes. The end result of Solaris crashing, or SNDR replicationstopping, is a write-ordered, crash-consistent volume.

Given that both UFS logging and SNDR are (near) perfect (or there wouldbe a flood of escalations), this issue in all cases I've seen to date,is that the SNDR primary volume being replicated is mounted with UFSlogging enable, but the SNDR secondary is not mounted with UFS loggingenabled. Once this condition happens, the problem can be resolved byfixing /etc/vfstab to correct the inconsistent mount options, and thenperforming an SNDR update sync.

If you want a live remote replication facility, it _NEEDS_ to talk tothe filesystem somehow. There must be a callback mechanism that thefilesystem could use to tell the replicator "and from exactly now onyou start replicating". The only entity which can truly give thissignal is the filesystem itself.

There is an RFE against SNDR for something called "in-line PIT". I hopethat this work will get done soon.

And no, that _not_ when the filesystem does a "flush write cache"ioctl. Or when the user has just issued a "sync" command or similar.For ZFS, it'd be when a ZIL transaction is closed (as I understandit), for UFS it'd be when the UFS log is fully rolled. There's nonotification to external entities when these two events happen.

Because ZFS is always on-disk consistent, this is not an issue. So farin ALL my testing with replicating ZFS with SNDR, I have not seen ZFS fail!

Of course be careful to not confuse my stated position with anotherclosely related scenario. That being accessing ZFS on the remote nodevia a forced import "zpool import -f <name>", with active SNDRreplication, as ZFS is sure to panic the system. ZFS, unlike otherfilesystems has 0% tolerance to corrupted metadata.

Jim

SNDR tries its best to achieve this detection, but without actually_stopping_ all I/O (on UFS: writelocking), there's a window ofvulnerability still open.And SNDR/II don't stop filesystem I/O - by basic principle. That's howthey're sold/advertised/intended to be used.
I'm all willing to see SNDR/II go open - we could finally work theseissues !
FrankH.
I think the best of both worlds approach would be to let SNDR plug-into ZFS along the same lines the crypto stuff will be able to plug in,different compression types, etc. There once was a slide that showedhow that worked....or I'm hallucinating again.
_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss
_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss


_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] Project Proposal: Availability Suite

Reply via email to