Re: how to write hadoop client to test HDFS with fault injection

2011-11-29 Thread Hao Yang
Thanks very much for the above. In 0.21 I found TestDFSIO in the mapred-test.jar, but I didn't find Teragen and Terasort in the three test jars (I found Teragen and Terasort in 0.20.04 test jars, but they cannot be executed in 0.21). When I run TestDFSIO benchmark, I sometimes encounter the situati

Re: how to write hadoop client to test HDFS with fault injection

2011-11-29 Thread Konstantin Boudnik
Once you have an instrumented cluster you can run some end-to-end tests (TeraSort has been suggested and it would do fine). The tricky part is to assemble all moving parts of the Hadoop cluster together from three different projects (if you are on .21+ branch) and will be a way easier for 0.20.+ b

Re: how to write hadoop client to test HDFS with fault injection

2011-11-29 Thread Steve Loughran
I'd add that one thing I'd like is better low level hardware fault simulation, something that doesn't inject faults into the software, so much as stamp on the underlying data or kill bits of the stack entirely That is -deliberately corrupting blocks or checksums -deleting blocks -dropping

RE: how to write hadoop client to test HDFS with fault injection

2011-11-29 Thread Brad Sarsfield
Hi Hao, You may want to setup a full Hadoop 'one box' install and start running the TestDFSIO, Teragen and Terasort (with a small amount of data) scenarios using the injected jar to run hdfs. That may be the fastest and best way to get up and running. When it comes to fault injection the valu