Re: SU+J systems do not fsck themselves

2011-12-29 Thread Scott Long
On Dec 29, 2011, at 4:02 PM, David Thiel wrote: > On Wed, Dec 28, 2011 at 12:57:31AM -0700, Scott Long wrote: >> So, there's an assumption with SUJ+fsck that SU is keeping the filesystem >> consistent. Maybe that's a bad assumption, and I'm not trying to discredit >> your report. But the inte

Re: SU+J systems do not fsck themselves

2011-12-29 Thread David Thiel
On Thu, Dec 29, 2011 at 03:02:14PM -0800, David Thiel wrote: > = > Machine 1, with journal: > = > > Script started on Thu Dec 29 11:26:29 2011 > fsck / > ** /dev/ada0.eli Correction - machine 1 had an unclean shutdown. Will get additional logs soon. __

Re: SU+J systems do not fsck themselves

2011-12-29 Thread David Thiel
On Wed, Dec 28, 2011 at 12:57:31AM -0700, Scott Long wrote: > So, there's an assumption with SUJ+fsck that SU is keeping the filesystem > consistent. Maybe that's a bad assumption, and I'm not trying to discredit > your report. But the intention with SUJ is to eliminate the need for > anything

Re: SU+J systems do not fsck themselves

2011-12-28 Thread Lev Serebryakov
Hello, Mdf. You wrote 28 декабря 2011 г., 23:14:19: > Not required by SU as they use an explicit BIO_FLUSH which should be > handled by the driver. No, they don't. It was discussed here about month ago. -- // Black Lion AKA Lev Serebryakov ___ fre

Re: SU+J systems do not fsck themselves

2011-12-28 Thread Kostik Belousov
On Wed, Dec 28, 2011 at 11:14:19AM -0800, m...@freebsd.org wrote: > SU doesn't care about write ordering, as long as everything before a > BIO_FLUSH is really flushed by the time the BIO_FLUSH is acknowledged. No. SU and SU+J only require that write completed notification is issued when geom/drive

Re: SU+J systems do not fsck themselves

2011-12-28 Thread mdf
On Wed, Dec 28, 2011 at 8:54 AM, Maxim Khitrov wrote: > On Wed, Dec 28, 2011 at 11:42 AM, Matthias Andree > wrote: >> Am 27.12.2011 22:53, schrieb David Thiel: >>> I've had multiple machines now (9.0-RC3, amd64, i386 and earlier >>> 9-CURRENT on ppc) running SU+J that have had unexplained panics

Re: SU+J systems do not fsck themselves

2011-12-28 Thread Maxim Khitrov
On Wed, Dec 28, 2011 at 11:42 AM, Matthias Andree wrote: > Am 27.12.2011 22:53, schrieb David Thiel: >> I've had multiple machines now (9.0-RC3, amd64, i386 and earlier >> 9-CURRENT on ppc) running SU+J that have had unexplained panics and >> crashes start happening relating to disk I/O. When I en

Re: SU+J systems do not fsck themselves

2011-12-28 Thread Matthias Andree
Am 27.12.2011 22:53, schrieb David Thiel: > I've had multiple machines now (9.0-RC3, amd64, i386 and earlier > 9-CURRENT on ppc) running SU+J that have had unexplained panics and > crashes start happening relating to disk I/O. When I end up running a > full fsck, it keeps turning out that the di

Re: SU+J systems do not fsck themselves

2011-12-27 Thread Scott Long
On Dec 28, 2011, at 12:34 AM, David Thiel wrote: > On Tue, Dec 27, 2011 at 11:54:20PM -0700, Scott Long wrote: >> The first run of fsck, using the journal, gives results that I would >> expect. The second run seems to imply that the fixes made on the >> first run didn't actually get written to

Re: SU+J systems do not fsck themselves

2011-12-27 Thread David Thiel
On Tue, Dec 27, 2011 at 11:54:20PM -0700, Scott Long wrote: > The first run of fsck, using the journal, gives results that I would > expect. The second run seems to imply that the fixes made on the > first run didn't actually get written to disk. This is definitely an > oddity. I see that you

Re: SU+J systems do not fsck themselves

2011-12-27 Thread Scott Long
On Dec 27, 2011, at 10:14 PM, David Thiel wrote: > On Tue, Dec 27, 2011 at 02:48:22PM -0800, Xin Li wrote: - use journalled fsck; - use normal fsck to check if the journalled fsck did the right thing. > > Ok, here is the log of fsck with and without journal. > > http://redundancy.redu

Re: SU+J systems do not fsck themselves

2011-12-27 Thread David Thiel
On Tue, Dec 27, 2011 at 02:48:22PM -0800, Xin Li wrote: > >> - use journalled fsck; - use normal fsck to check if the > >> journalled fsck did the right thing. Ok, here is the log of fsck with and without journal. http://redundancy.redundancy.org/fscklog3 That was done the very next boot, after

Re: SU+J systems do not fsck themselves

2011-12-27 Thread Xin Li
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 On 12/27/11 14:36, David Thiel wrote: > On Tue, Dec 27, 2011 at 02:29:03PM -0800, Xin LI wrote: >> I'm not sure if your experiments are right here, the second log >> shows you're running it read-only, which is likely caused by >> running it on live fil

Re: SU+J systems do not fsck themselves

2011-12-27 Thread David Thiel
On Tue, Dec 27, 2011 at 02:29:03PM -0800, Xin LI wrote: > I'm not sure if your experiments are right here, the second log shows > you're running it read-only, which is likely caused by running it on > live file system. Yes, this most recent instance is me running it on a live FS, because I'm us

Re: SU+J systems do not fsck themselves

2011-12-27 Thread Xin LI
On Tue, Dec 27, 2011 at 1:53 PM, David Thiel wrote: > I've had multiple machines now (9.0-RC3, amd64, i386 and earlier > 9-CURRENT on ppc) running SU+J that have had unexplained panics and > crashes start happening relating to disk I/O. When I end up running a > full fsck, it keeps turning out tha

SU+J systems do not fsck themselves

2011-12-27 Thread David Thiel
I've had multiple machines now (9.0-RC3, amd64, i386 and earlier 9-CURRENT on ppc) running SU+J that have had unexplained panics and crashes start happening relating to disk I/O. When I end up running a full fsck, it keeps turning out that the disk is dirty and corrupted, but no mechanism is in