Ralf, > Ralf Ramge wrote: >> Questions: >> >> a) I don't understand why the kernel panics at the moment. the zpool >> isn't mounted on both systems, the zpool itself seems to be fine >> after a >> reboot ... and switching the primary and secondary hosts just for >> resyncing seems to force a full sync, which isn't an option. >> >> b) I'll try a "sndradm -m -r" the next time ... but I'm not sure if I >> like that thought. I would accept this if I replaced the primary host >> with another server, but having to do a 24 TB full sync just >> because the >> replication itself had been disabled for a few minutes would be >> hard to >> swallow. Or did I do something wrong? >> >> > I've answered these questions myself at the meantime (with a nice > employee fo Sun Hamburg giving me the hint). For Google: during a > reverse sync, neither side of the replication is allowed to have the > zpool imported, because after the reverse sync finishes, SNDR enters > replication mode. This renders reverse syncs useless for HA scenarios, > switch primary & secondary instead.
This is close, but not the actual scenario, and actual answer is much better then one would expect. Just prior to issuing a reverse sync, neither side of the replication is allowed to have the, zpool imported. This step is VERY IMPORTANT, since ZFS will detect SNDR replicated writes, and since these writes were not issued by the local ZFS, ZFS will assume these writes are some form of data corruption, since the checksums won't match, and panic the system. Instantly after issuing a reverse sync, zpool(s) on the SNDR primary node can now be imported, without waiting. Although there may be minutes, hours or days of change that need to be replicated from the SNDR secondary volumes to the SNDR primary volumes, SNDR supports on- demand pull of unreplicated changes. This improves one's MTTR (Mean Time To Recover), at the cost of performance for the duration of reverse sync, where the duration is a function of the amount of change that happened while running on the secondary node's volumes. Also any changes made to the primary volume, will be replicated to the secondary volumes during this period, so that the very instant the reverse synchronization operation is complete, both sides of the replica will be identical. > -- > > Ralf Ramge > Senior Solaris Administrator, SCNA, SCSA > > Tel. +49-721-91374-3963 > [EMAIL PROTECTED] - http://web.de/ > > 1&1 Internet AG > Brauerstraße 48 > 76135 Karlsruhe > > Amtsgericht Montabaur HRB 6484 > > Vorstand: Henning Ahlert, Ralph Dommermuth, Matthias Ehrlich, > Andreas Gauger, Matthias Greve, Robert Hoffmann, Norbert Lang, > Achim Weiss > Aufsichtsratsvorsitzender: Michael Scheeren > > _______________________________________________ > zfs-discuss mailing list > zfs-discuss@opensolaris.org > http://mail.opensolaris.org/mailman/listinfo/zfs-discuss Jim Dunham Solaris, Storage Software Group Sun Microsystems, Inc. 1617 Southwood Drive Nashua, NH 03063 Phone x24042 / 781-442-4042 Email: [EMAIL PROTECTED] http://blogs.sun.com/avs _______________________________________________ zfs-discuss mailing list zfs-discuss@opensolaris.org http://mail.opensolaris.org/mailman/listinfo/zfs-discuss