Hello David, I am following your tip! Thanks.
Also, I configured a small cluster with three datanodes and on my MR program I printed every single key that the reducers received. I set three reducers(setNumReduceTasks). Analyzing the reducer outputs I could see that the keys were distributed as my partitioner ordered. Of course, I had to make things much much smaller than real. I prepared an input, built a small cluster and so on .... to assure a minimal control. Not that I doubt hadoop, I doubt my code, always! :-) Br, Fabio Almeida -----Original Message----- From: ext David Rosenstrauch [mailto:dar...@darose.net] Sent: Friday, February 17, 2012 12:16 AM To: mapreduce-u...@hadoop.apache.org Subject: Re: Partitioners - How to know if they are working On 02/16/2012 12:49 PM, ext-fabio.alme...@nokia.com wrote: > Hello All, > I wrote my own partitioner and I would like to see if it's working. > By printing the return of method getPartition I could see that the partitions > were different, but were they really working? To answer that I got the keys > that every reducer task processed and that was what I expected. It seems my > partitioner is working properly. But not easy to discover though. > Does anyone know if there is an easier way to see if your customized > partitioner is working? For instance, a counter that shows how many > partitioners a map generated or a reducer received? > Thanks in advance, > Fabio Almeida At my last job we wrote a custom partitioner, and we tested it out completely outside of Hadoop using standard JUnit unit tests. HTH, DR