Re: Optimized Hadoop

2012-02-20 Thread Schubert Zhang
We just update the slides of this improvements: http://www.slideshare.net/hanborq/hanborq-optimizations-on-hadoop-mapreduce-20120216a Updates: (1) modified some describes to make things more clear and accuracy. (2) add some benchmarks to make sense. On Sat, Feb 18, 2012 at 11:12 PM, Anty wrote:

Re: Optimized Hadoop

2012-02-18 Thread Anty
On Fri, Feb 17, 2012 at 3:27 AM, Todd Lipcon wrote: > Hey Schubert, > > Looking at the code on github, it looks like your rewritten shuffle is > in fact just a backport of the shuffle from MR2. I didn't look closely > additionally, the rewritten shuffle in MR2 has some bugs, which harm the overa

Re: Optimized Hadoop

2012-02-17 Thread Todd Lipcon
On Thu, Feb 16, 2012 at 8:25 PM, Schubert Zhang wrote: > 1) it should be sort-avoidance. right - that's a nice improvement, looking forward to getting that in trunk at some point. > 2) work pool (like Tenzing) > Looking at the code, it seems you only support the default task executor. Do you ha

Re: Optimized Hadoop

2012-02-16 Thread Schubert Zhang
1) it should be sort-avoidance. 2) work pool (like Tenzing) Sorry ,the adaptive heartbeat code is not in this github code, we are discussing it. On Fri, Feb 17, 2012 at 11:00 AM, Anty wrote: > Hi: Todd > > yes, the rewritten shuffle in actual a backport of the shuffle from MR2 . > We mainly ad

Re: Optimized Hadoop

2012-02-16 Thread Anty
Hi: Todd yes, the rewritten shuffle in actual a backport of the shuffle from MR2 . We mainly add the following two features: 1) shuffle avoidance 2) work pool On Fri, Feb 17, 2012 at 3:27 AM, Todd Lipcon wrote: > Hey Schubert, > > Looking at the code on github, it looks like your rewritten shuf

Re: Optimized Hadoop

2012-02-16 Thread Todd Lipcon
Hey Schubert, Looking at the code on github, it looks like your rewritten shuffle is in fact just a backport of the shuffle from MR2. I didn't look closely - are there any distinguishing factors? Also, the OOB heartbeat and adaptive heartbeat code seems to be the same as what's in 1.0? -Todd On

Re: Optimized Hadoop

2012-02-16 Thread Schubert Zhang
Here is the presentation to describe our job, http://www.slideshare.net/hanborq/hanborq-optimizations-on-hadoop-mapreduce-20120216aWellcome to give your advises. It's just a little step, and we are continue to do more improvements, thanks for your help. On Thu, Feb 16, 2012 at 11:01 PM, Anty wr