Re: mismatch_cnt questions

2007-03-07 Thread H. Peter Anvin
H. Peter Anvin wrote: Eyal Lebedinsky wrote: Neil Brown wrote: [trim Q re how resync fixes data] For raid1 we 'fix' and inconsistency by arbitrarily choosing one copy and writing it over all other copies. For raid5 we assume the data is correct and update the parity. Can raid6 identify the ba

Re: mismatch_cnt questions

2007-03-07 Thread H. Peter Anvin
Neil Brown wrote: On Monday March 5, [EMAIL PROTECTED] wrote: Neil Brown wrote: [trim Q re how resync fixes data] For raid1 we 'fix' and inconsistency by arbitrarily choosing one copy and writing it over all other copies. For raid5 we assume the data is correct and update the parity. Can raid6

Re: mismatch_cnt questions

2007-03-07 Thread H. Peter Anvin
Eyal Lebedinsky wrote: Neil Brown wrote: [trim Q re how resync fixes data] For raid1 we 'fix' and inconsistency by arbitrarily choosing one copy and writing it over all other copies. For raid5 we assume the data is correct and update the parity. Can raid6 identify the bad block (two parity blo

Re: Raid 10 Problems?

2007-03-07 Thread Jan Engelhardt
On Mar 7 2007 10:20, dean gaudet wrote: >>> http://gentoo-wiki.com/HOWTO_Install_on_Software_RAID#Write-intent_bitmap >> >> That information has been extremely useful. Thanks a >> lot. I fund a command to do the bitmap internal after >> the array was made so I added that. Seems like some of >> th

Re: RAID1, hot-swap and boot integrity

2007-03-07 Thread H. Peter Anvin
Mike Accetta wrote: I gathered the impression somewhere, perhaps incorrectly, that the active flag was a function of the boot block, not the BIOS. We use Grub in the MBR and don't even have an active flag set in the partition table. The system still boots. The active flag is indeed an MBR i

[patch 021/101] md: Avoid possible BUG_ON in md bitmap handling.

2007-03-07 Thread Greg KH
From: Neil Brown <[EMAIL PROTECTED]> md/bitmap tracks how many active write requests are pending on blocks associated with each bit in the bitmap, so that it knows when it can clear the bit (when count hits zero). The counter has 14 bits of space, so if there are ever more than 16383, we cannot c

[patch 045/101] md: Fix raid10 recovery problem.

2007-03-07 Thread Greg KH
From: NeilBrown <[EMAIL PROTECTED]> There are two errors that can lead to recovery problems with raid10 when used in 'far' more (not the default). Due to a '>' instead of '>=' the wrong block is located which would result in garbage being written to some random location, quite possible outside th

Re: no journaling and loops on softraid?

2007-03-07 Thread Bill Davidsen
Dexter Filmore wrote: http://gentoo-wiki.com/HOWTO_Gentoo_Install_on_Software_RAID#Data_Scrubbing "Warning: Be aware that the combination of RAID5 and loop-devices will most likely cause severe filesystem damage, especially when using ext3 and ReiserFS. Some users suggest that XFS is not affec

Re: Replace drive in RAID5 without losing redundancy?

2007-03-07 Thread Bill Davidsen
Neil Brown wrote: On Monday March 5, [EMAIL PROTECTED] wrote: Is it possible to mark a disk as "to be replaced by an existing spare", then migrate to the spare disk and kick the old disk _after_ migration has been done? Or not even kick - but mark as new spare. No, this is not possible

Re: detecting/correcting _slightly_ flaky disks

2007-03-07 Thread Bill Davidsen
Michael Stumpf wrote: Bill Davidsen wrote: Michael Stumpf wrote: This is the drive I think is most suspect. What isn't obvious, because it isn't listed in the self test log, is between #1 and #2 there was an aborted, hung test. The #4 short test that was aborted was also a hung test that I

Re: detecting/correcting _slightly_ flaky disks

2007-03-07 Thread berk walker
Michael Stumpf wrote: Bill Davidsen wrote: Michael Stumpf wrote: This is the drive I think is most suspect. What isn't obvious, because it isn't listed in the self test log, is between #1 and #2 there was an aborted, hung test. The #4 short test that was aborted was also a hung test that

Re: high mismatch count after scrub

2007-03-07 Thread Dexter Filmore
> Neil, > > So in his case, is there anyway to determine which files are potentially > corrupted? I'd like to know that, too... -- -BEGIN GEEK CODE BLOCK- Version: 3.12 GCS d--(+)@ s-:+ a- C UL++ P+>++ L+++> E-- W++ N o? K- w--(---) !O M+ V- PS+ PE Y++ PGP t++(---)@ 5 X+(++) R+(+

Re: Help with chunksize on raid10 -p o3 array

2007-03-07 Thread Peter Rabbitson
Bill Davidsen wrote: Peter Rabbitson wrote: Hi, I have been trying to figure out the best chunk size for raid10 before By any chance did you remember to increase stripe_cache_size to match the chunk size? If not, there you go. At the end of /usr/src/linux/Documentation/md.txt it specifical