Re: [zfs-discuss] ZFS and Storage

Gregory Shaw Tue, 27 Jun 2006 08:41:14 -0700

This is getting pretty picky. You're saying that ZFS will detect anyerrors introduced after ZFS has gotten the data. However, as statedin a previous post, that doesn't guarantee that the data given to ZFSwasn't already corrupted.

If you don't trust your storage subsystem, you're going to encounterissues regardless of the software use to store data. We'll have tosee if ZFS can 'save' customers in this situation. I've found thatregardless of the storage solution in question you can't anticipateall issues and when a brownout or other ugly loss-of-service occurs,you may or may not be intact, ZFS or no.


I've never seen a product that can deal with all possible situations.

On Jun 27, 2006, at 9:01 AM, Jeff Victor wrote:

Unfortunately, a storage-based RAID controller cannot detect errorswhich occurred between the filesystem layer and the RAIDcontroller, in either direction - in or out. ZFS will detect themthrough its use of checksums.
But ZFS can only fix them if it can access redundant bits. Itcan't tell a storage device to provide the redundant bits, so itmust use its own data protection system (RAIDZ or RAID1) in orderto correct errors it detects.
Gregory Shaw wrote:
Most controllers support a background-scrub that will read avolume and repair any bad stripes. This addresses the bad blockissue in most cases.It still doesn't help when a double-failure occurs. Luckily,that's very rare. Usually, in that case, you need to evacuatethe volume and try to restore what was damaged.
On Jun 26, 2006, at 6:40 PM, Eric Schrock wrote:
On Mon, Jun 26, 2006 at 05:26:24PM -0600, Gregory Shaw wrote:
You're using hardware raid. The hardware raid controller willrebuildthe volume in the event of a single drive failure. You'd needto keep
on top of it, but that's a given in the case of either hardware or
software raid.
True for total drive failure, but not there are a more failure modes
than that. With hardware RAID, there is no way for the RAIDcontroller
to know which block was bad, and therefore cannot repair the block.
With RAID-Z, we have the integrated checksum and can docombinatorial
analysis to know not only which drive was bad, but what the data
_should_ be, and can repair it to prevent more corruption in thefuture.
- Eric

--
Eric Schrock, Solaris Kernel Development http://blogs.sun.com/ eschrock
-----
Gregory Shaw, IT Architect
Phone: (303) 673-8273        Fax: (303) 673-8273
ITCTO Group, Sun Microsystems Inc.
1 StorageTek Drive MS 4382              [EMAIL PROTECTED] (work)
Louisville, CO 80028-4382                 [EMAIL PROTECTED] (home)
"When Microsoft writes an application for Linux, I've Won." -Linus Torvalds
_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss
--
--------------------------------------------------------------------------Jeff VICTOR Sun Microsystems jeff.victor @sun.com
OS Ambassador            Sr. Technical Specialist
Solaris 10 Zones FAQ: http://www.opensolaris.org/os/community/zones/faq--------------------------------------------------------------------------


-----
Gregory Shaw, IT Architect
Phone: (303) 673-8273        Fax: (303) 673-8273
ITCTO Group, Sun Microsystems Inc.
1 StorageTek Drive MS 4382              [EMAIL PROTECTED] (work)
Louisville, CO 80028-4382                 [EMAIL PROTECTED] (home)

"When Microsoft writes an application for Linux, I've Won." - LinusTorvalds



_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] ZFS and Storage

Reply via email to