Also, you want to look at combining SMART hard drive monitoring (most
drives support SMART at this point) and combine it with Nagios.
It often lets us known when a hard drive is about to fail *and* when
the drive is under-performing.
Brian
On Feb 3, 2009, at 6:18 PM, Aaron Kimball wrote:
Dmitry,
Look into cluster/system monitoring tools: nagios and ganglia are
two to
start with.
- Aaron
On Tue, Feb 3, 2009 at 9:53 AM, Dmitry Pushkarev <[email protected]>
wrote:
Dear hadoop users,
Recently I have had a number of drive failures that slowed down
processes a
lot until they were discovered. It is there any easy way or tool,
to check
HDD performance and see if there any IO errors?
Currently I wrote a simple script that looks at /var/log/messages
and greps
everything abnormal for /dev/sdaX. But if you have better solution
I'd
appreciate if you share it.
---
Dmitry Pushkarev
+1-650-644-8988