Marc Shapiro <marcns...@gmail.com> writes:

> BTW, what is your third partition, and why would you not separate it
> now if starting from scratch?
My third partition is for backups which I make to protect against
software or operator error.  At one point it was on a separate disk
since disks were small and without LVM had to be a different
partition/file system.
>
>
> One other question.  If using raid, how do you know when a disk is
> starting to have trouble, as mine did?  Since the whole purpose of
...
> Marc

Ok - I'm pretty paranoid about that.  smart is checking.
mdadm will notice if a disk is bad and turn
it off, so to speak.  Again in the logs.
I run a cron job to check form smart errors based on:

    smartctl -l error -q errorsonly "device"
    smartctl -H -q errorsonly "device"

But I've always checked all my disks once a week.  A root cron job
reads the whole disk with dd into /dev/null.  Any error get logged, of
course.  Separately, a cron job scans syslog and syslog.1 grepping for
"IO Error" and informs me by email if any new errors are found.  This
catches error in the dd check but also actual errors in operation.

Reply via email to