Marc Shapiro <marcns...@gmail.com> writes: > BTW, what is your third partition, and why would you not separate it > now if starting from scratch? My third partition is for backups which I make to protect against software or operator error. At one point it was on a separate disk since disks were small and without LVM had to be a different partition/file system. > > > One other question. If using raid, how do you know when a disk is > starting to have trouble, as mine did? Since the whole purpose of ... > Marc
Ok - I'm pretty paranoid about that. smart is checking. mdadm will notice if a disk is bad and turn it off, so to speak. Again in the logs. I run a cron job to check form smart errors based on: smartctl -l error -q errorsonly "device" smartctl -H -q errorsonly "device" But I've always checked all my disks once a week. A root cron job reads the whole disk with dd into /dev/null. Any error get logged, of course. Separately, a cron job scans syslog and syslog.1 grepping for "IO Error" and informs me by email if any new errors are found. This catches error in the dd check but also actual errors in operation.