Build failed in Jenkins: Hadoop-Common-0.23-Build #77

2011-11-26 Thread Apache Jenkins Server
See Changes: [szetszwo] svn merge -c 1206178 from trunk for HDFS-2590. -- [...truncated 7743 lines...] [INFO] --- maven-jar-plugin:2.3.1:jar (default-jar) @ hadoop-dist --- [WARNING] JAR w

Hadoop - Distributed sorting

2011-11-26 Thread madhu_sushmi
Hi, I need to implement distributed sorting using Hadoop. I am quite new to Hadoop and I am getting confused. If I want to implement Merge sort, what my Map and reduce should be doing. ? Should all the sorting happen at reduce side? Please help. This is an urgent requirement. Please guide me. T

Hadoop - Distributed sorting

2011-11-26 Thread madhu_sushmi
Hi, I need to implement distributed sorting using Hadoop. I am quite new to Hadoop and I am getting confused. If I want to implement Merge sort, what my Map and reduce should be doing. ? Should all the sorting happen at reduce side? Please help. This is an urgent requirement. Please guide me. T

Re: Hadoop - Distributed sorting

2011-11-26 Thread Prashant Sharma
Madhu, You can check out sorting code in examples. Actually you dont need to do anything for sorting. Map-reduce framework does the (merge-sort)sorting(which happens during shuffle phase before reducer even starts.) for you, all you need to do is make column you want to sort on as your key in ma

Re: Copying data between Hadoop

2011-11-26 Thread Linden Hillenbrand
I agree, distcp is the way to go here, you can specify down to the individual directory on HDFS. Also, you can add some flags to overwrite/not overwrite. It's pretty flexible IMO. On Thu, Nov 24, 2011 at 6:15 AM, Uma Maheswara Rao G wrote: > Why con't you use distcp command here? > > Regards, > U

[jira] [Created] (HADOOP-7863) Apply HADOOP-7424 "Log an error if the topology script doesn't handle multiple args" to 0.23 branch

2011-11-26 Thread Steve Loughran (Created) (JIRA)
Apply HADOOP-7424 "Log an error if the topology script doesn't handle multiple args" to 0.23 branch Key: HADOOP-7863 URL: https://issues.apache.org/jira/browse/H