Also, you want to look at combining SMART hard drive monitoring (most drives support SMART at this point) and combine it with Nagios.

It often lets us known when a hard drive is about to fail *and* when the drive is under-performing.

Brian

On Feb 3, 2009, at 6:18 PM, Aaron Kimball wrote:

Dmitry,

Look into cluster/system monitoring tools: nagios and ganglia are two to
start with.
- Aaron

On Tue, Feb 3, 2009 at 9:53 AM, Dmitry Pushkarev <[email protected]> wrote:

Dear hadoop users,



Recently I have had a number of drive failures that slowed down processes a lot until they were discovered. It is there any easy way or tool, to check
HDD performance and see if there any IO errors?

Currently I wrote a simple script that looks at /var/log/messages and greps everything abnormal for /dev/sdaX. But if you have better solution I'd
appreciate if you share it.



---

Dmitry Pushkarev

+1-650-644-8988





Reply via email to