Re: [Dovecot] RAID1+md concat+XFS as mailstorage

Charles Marcus Sun, 01 Jul 2012 03:34:59 -0700

On 2012-06-29 12:07 PM, Ed W <li...@wildgooses.com> wrote:

On 29/06/2012 12:15, Charles Marcus wrote:

Depends on what you mean exactly by 'incorrect'...

I'm sorry, this wasn't meant to be an attack on you,

No worries - it wasn't taken that way - I simply disagreed with the mainpoint you were making, and still do. While I do agree there is sometruth to the issue you have raised, I just don't see it as quite thedisaster-in-waiting that you do. I have been running small RAID setupsfor quite a while, and while I had one older RAID5 (with NO hot spare)that I inherited (this was many years ago) that gave me fits for about amonth once (had drives randomly 'failing', but a rebuild - which took afew HOURS, and this was with small (by today's standards - 120GB drives)would fix it, then another one would do drop out 2 or 3 days later, etc.I finally found an identical replacement controller on ebay (old 3warecard) and once it was replaced it fixed the problem). I also had oneinstance in a RAID10 setup I configured myself a few years ago where oneof the pairs had some errors on an unclean shutdown (this was afterabout 3 years of 24/7 operation on a mail server) and went intoautomatic rebuild, which went smoothly (and was mucho faster than theRAID5 rebuilds were even though the drives were much bigger).

So, yes, while I acknowledge the risk, it is the risk we all run storingdata on hard drives.

I thought I was pointing out what is now fairly obvious stuff, but
it's only recently that the maths has been popularised by the common
blogs on the interwebs. Whilst I guess not everyone read the flurry
of blog articles about this last year, I think it's due to be
repeated in increasing frequency as we go forward:

The most recent article which prompted all of the above is I think this
one:
http://queue.acm.org/detail.cfm?id=1670144
More here (BARF = Battle Against Raid 5/4)
http://www.miracleas.com/BAARF/


I'll find time to read these over the next week or two, thanks...

Intel have a whitepaper which says:

Intelligent RAID 6 Theory Overview And Implementation

RAID 5 systems are commonly deployed for data protection in most
business environments.

While maybe true many years ago, I don't think this is true today. Iwouldn't touch RAID5 with a ten foot pole, but yes, maybe there arestill people who use it for some reason - and maybe there are somecorner cases where it is even desirable?

However, RAID 5 systems only tolerate a single drive failure, and the
probability of encountering latent defects [i.e. UREs, among other
problems] of drives approaches 100 percent as disk capacity and array
width increase.


Well, this is definitely true, but I wouldn't touch RAID5 today.

And to be clear - RAID5/RAID1 has a very significant probability that
once your first disk has failed, in the process of replacing that disk
you will discover an unrecoverable error on your remaining drive and
hence you have lost some data...

Well, this is true, but the part of your comment that I was respondingto and challenging was that the entire RAID just 'died' and you lost ALLof your data.


That is simply not true on modern systems.

So the vulnerability is not the first failed disk, but discovering
subsequent problems during the rebuild.

True, but this applies to every RAID mode (RAID6 included).

No, see RAID6 has a dramatically lower chance of this happening than
RAID1/5. See this is the real insight and I think it's important that
this fairly (obvious in retrospect) idea becomes widely known and
understood to those who manage arrays.

RAID6 needs a failed drive and *two* subsequent errors *per stripe* to
lose data. RAID5/1 simply need one subsequent error *per array* to lose
data. Quite a large difference!


Interesting... I'll look at this more closely then, thanks.

Also, one big disadvantage of RAID5/6 is the rebuild times

Hmm, at least theoretically both need a full linear read of the other
disks. The time for an idle array should be similar in both cases. Agree
though that for an active array the raid5/6 generally causes more drives
to read/write, hence yes, the impact is probably greater.

No 'probably' to it. It is definitely greater, even comparing thesmallest possible RAID setups (4 drives are minimum for each). But, asthe size of (number of disks in) the array increases, the differenceincreases dramatically. With RAID10, when a drive fails and a rebuildoccurs, only ONE drive must be read (remirrored) - in a RAID5/6, most ifnot *all* of the drives must be read from (depends on how it isconfigured I guess).

However, don't miss the big picture, your risk is a second error
occurring anywhere on the array with raid1/5, but with raid 6 your risk
is *two* errors per stripe, ie you can fail a whole second drive and
still continue rebuilding with raid6

And is the same with a RAID10, as long as the second drive failure isn'tthe one currently being remirrored.

I think you have proven your case that a RAID6 is statistically a littleless likely to suffer a catastrophic cascading disk failure scenariothan RAID10.

I personally feel that raid arrays *are* very fragile. Backups are often
the option when you get multi-drive failures (even if theoretically the
array is repairable). However, it's about the best option we have right
now, so all we can do is be aware of the limitations...

And since backups are stored on drives (well, mine are, I stopped usingtape long ago), they have the same associated risks... but of course Iagree with you that they are absolutely essential.

Additionally I have very much suffered this situation of a failing RAID5
which was somehow hanging together with just the odd uncorrectable read
error reported here and there (once a month say). I copied off all the
data and then as an experiment replaced one disk in this otherwise
working array, which then triggered a cascade of discovered errors all
over the disk and rebuilding was basically impossible.

Sounds like you had a bad controller to me... and yes, when a controllergoes bad, lots of weirdness and 'very bad things' can occur.

Roll on btrfs I say...


+1000 ;)

--

Best regards,

Charles

Re: [Dovecot] RAID1+md concat+XFS as mailstorage

Reply via email to