Dear Friends, I am Anandkumar, working as a test engineer in eBay and we use Hadoop extensively to store our log. I am in situation to validate or test our Data perfectly reaching the Hadoop infrastructure or not. could anyone of you recommend me the best testing methodologies and if there is any existing framework for testing Hadoop, please recommend to me.
My scenario is simple of Client will dump millions of data to Hadoop, I need to validate that the data has reached Hadoop perfectly and also there is not Data loss and also other testing like scalability and reliability. Anticipating your support Thanks, Anandkumar