Re: Odd data corruption problem with LVM/ReiserFS

2005-02-22 Thread pcg
On Tue, Feb 22, 2005 at 02:46:44PM -0600, Alex Adriaanse <[EMAIL PROTECTED]> wrote: > On Tue, 22 Feb 2005 20:49:00 +0100, Marc A. Lehmann <[EMAIL PROTECTED]> wrote: > > Well, I do use reiserfs->aes-loop->lvm/dm->md5/raid5, and it never failed > > for me, except once, and the error is likely to be

Re: Odd data corruption problem with LVM/ReiserFS

2005-02-22 Thread Alex Adriaanse
On Tue, 22 Feb 2005 20:49:00 +0100, Marc A. Lehmann <[EMAIL PROTECTED]> wrote: > > >A reboot fixes this for both ext3 and reiserfs (i.e. the error is gone). > > > > > > > Well, it didn't fix it for me. The fs was trashed for good. The major > > question for me is now usability of md/dm for any purp

Re: Odd data corruption problem with LVM/ReiserFS

2005-02-22 Thread pcg
On Tue, Feb 22, 2005 at 08:39:21PM +0100, Andreas Steinmetz <[EMAIL PROTECTED]> wrote: > To clarify: there were no disk I/O errors, only I/O errors were reported > by find during operation so it is definitely filesystem corruption > that is going on here. > Though find performs heavy read acti

Re: Odd data corruption problem with LVM/ReiserFS

2005-02-22 Thread Andreas Steinmetz
pcg( Marc)@goof(A.).(Lehmann )com wrote: I use both reiserfs and ext3 on lvm/dm on raid. Both filesystems have issues when restoring from backup (i.e. very heavy write activity). I did report this to the linux kernel, and got as reply that there are indeed races *somewhere*, but as of yet there is

Re: Odd data corruption problem with LVM/ReiserFS

2005-02-22 Thread pcg
On Mon, Feb 21, 2005 at 12:37:53PM +0100, Andreas Steinmetz <[EMAIL PROTECTED]> wrote: > >Anyway, what do you guys think could be the problem? Could it be that > >the LVM / Device Mapper snapshot feature is solely responsible for > >this corruption? (I'm sure there's a reason it's marked > >Expe

Re: Odd data corruption problem with LVM/ReiserFS

2005-02-21 Thread Alex Adriaanse
I found out some interesting things tonight. I removed my /var and /home snapshots, and all the corruption, with the exception of files I had changed while /var and /home were in their corrupted state, had disappeared! I overwrote several files on /var that were corrupt with clean copies from my

Re: Odd data corruption problem with LVM/ReiserFS

2005-02-21 Thread Alex Adriaanse
Alasdair, Thanks for the tips. Do you think it's possible DM's snapshots could've caused this corruption, or do you think the problem lies elsewhere? Alex On Mon, 21 Feb 2005 15:18:52 +, Alasdair G Kergon <[EMAIL PROTECTED]> wrote: > On Sun, Feb 20, 2005 at 11:25:37PM -0600, Alex Adriaanse

Re: Odd data corruption problem with LVM/ReiserFS

2005-02-21 Thread Andreas Steinmetz
Alex Adriaanse wrote: The weird thing is I did not see any I/O errors in my logs, and running find on /var worked without a problem. By the way, did you take any DM snapshots when you experienced that corruption? No, no snapshots. Just working find on a large dataset (source tree, about 16GB). Th

Re: Odd data corruption problem with LVM/ReiserFS

2005-02-21 Thread Alex Adriaanse
On Mon, 21 Feb 2005 12:37:53 +0100, Andreas Steinmetz <[EMAIL PROTECTED]> wrote: > Alex Adriaanse wrote: > > As far as I can tell all the directories are still intact, but there > > was a good number of files that had been corrupted. Those files > > looked like they had some chunks removed, and so

Re: Odd data corruption problem with LVM/ReiserFS

2005-02-21 Thread Alasdair G Kergon
On Sun, Feb 20, 2005 at 11:25:37PM -0600, Alex Adriaanse wrote: > This morning was the first time my backup script took > a snapshot since upgrading to 2.6.10-ac12 (yesterday I had taken a few > snapshots myself for testing purposes, this seemed to work fine). a) Activating a snapshot requires a

Re: Odd data corruption problem with LVM/ReiserFS

2005-02-21 Thread Andreas Steinmetz
Alex Adriaanse wrote: As far as I can tell all the directories are still intact, but there was a good number of files that had been corrupted. Those files looked like they had some chunks removed, and some had a bunch of NUL characters (in blocks of 4096 characters). Some files even had chunks of

Re: Odd data corruption problem with LVM/ReiserFS

2005-02-21 Thread Vladimir Saveliev
Hello On Mon, 2005-02-21 at 08:25, Alex Adriaanse wrote: > As of this morning I've experienced some very odd data corruption > problem on my server. Let me post some background information first. > > For the past few years I've been running this server under Linux 2.4.x > and Debian Woody. It h