Hi all,
Ok, this might be to stir some things up again but I would like to
make this more clear.
I have been reading this and other threads regarding ZFS on SAN and
how well ZFS can recover from a serious error such as a cached disk
array goes down or the connection to the SAN is lost. What I am
hearing (Miles, ZFS-8000-72) is that sometimes you can end up in an
unrecoverable state that forces you to restore the whole pool. I have
been operating quite large deployments of SVM/UFS VxFS/VxVM for some
years and while you sometimes are forced to do a filesystem check and
some files might end up in lost+found I have never lost a whole
filesystem. This is despite whole arrays crashing, split-brain
scenarios etc. In the previous discussion a lot of fingers was pointed
at hardware and USB connections, but then some people mentioned
loosing pools located SAN in this thread.
We are currently evaluating if we should begin to implement ZFS in our
SAN. I can see great opportunities with ZFS but if we have a higher
risk of loosing entire pools that is a serious issue. I am aware that
the other filesystems might not be in a correct state after a serious
failure, but as stated before that can be much better than restoring
a multi terabyte filesystem from yesterdays backup.
So, what is the opinion, is this an existing problem even when using
enterprise arrays? If I understand this correctly there should be no
risk of loosing an entire pool if DKIOCFLUSHWRITECACHE is honored by
the array?
If it is a problem, will the worst case scenario be at least on pair
with UFS/VxFS when 6667683 is fixed?
Grateful for any additional information.
Regards
Henrik Johansson
http://sparcv9.blogspot.com
_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss