Hi All , Thanks for input I think I got enough information and also https://groups.google.com/forum/#!topic/camus_etl/1FcpqCnC5M4 gave me more info about the this.
Thank you all for entertaining my question. I am in luck on both form :) Thanks, Bhavesh On Tue, Feb 3, 2015 at 12:56 PM, Joel Koshy <jjkosh...@gmail.com> wrote: > There was some confusion here - turns out that they do turn it on. I added > Tu > to this thread and his response: > > <quote> > We have speculative set to true by default. With these settings, we are > seeing about 5-7% of the tasks have speculative tasks launched, other 90% > finished within the standard deviations difference and thus speculation > tasks were never launched. This will ensure if we have a slow datanode, > our job would not be impacted. > > Camus is setup to consume 10 minutes worth of offset/topic/run. If a topic > has more than 10 minutes of offset to be consumed, speculative will also > be active for that topic. We haven't play much with this setting. > However, if we ever get into a situation where we have to do catchup, it's > good to have this setting disabled. > > mapreduce.job.speculative.slownodethreshold 1.0 > mapreduce.job.speculative.speculativecap 0.1 > > mapreduce.map.speculative true > </quote> > > On Tue, Feb 03, 2015 at 05:14:02PM +0000, Aditya Auradkar wrote: > > Hi Bhavesh, > > > > I just checked with one of the devs on the Camus team. We run the Camus > job with speculative execution disabled. > > > > Aditya > > > > ________________________________________ > > From: Pradeep Gollakota [pradeep...@gmail.com] > > Sent: Monday, February 02, 2015 11:15 PM > > To: users@kafka.apache.org > > Subject: Re: Kafka ETL Camus Question > > > > Hi Bhavesh, > > > > At Lithium, we don't run Camus in our pipelines yet, though we plan to. > But > > I just wanted to comment regarding speculative execution. We have it > > disabled at the cluster level and typically don't need it for most of our > > jobs. Especially with something like Camus, I don't see any need to run > > parallel copies of the same task. > > > > On Mon, Feb 2, 2015 at 10:36 PM, Bhavesh Mistry < > mistry.p.bhav...@gmail.com> > > wrote: > > > > > Hi Jun, > > > > > > Thanks for info. I did not get answer to my question there so I > thought I > > > try my luck here :) > > > > > > Thanks, > > > > > > Bhavesh > > > > > > On Mon, Feb 2, 2015 at 9:46 PM, Jun Rao <j...@confluent.io> wrote: > > > > > > > You can probably ask the Camus mailing list. > > > > > > > > Thanks, > > > > > > > > Jun > > > > > > > > On Thu, Jan 29, 2015 at 1:59 PM, Bhavesh Mistry < > > > > mistry.p.bhav...@gmail.com> > > > > wrote: > > > > > > > > > Hi Kafka Team or Linked-In Team, > > > > > > > > > > I would like to know if you guys run Camus ETL job with speculative > > > > > execution true or false. Does it make sense to set this to false ? > > > > Having > > > > > true, it creates additional load on brokers for each map task > (create a > > > > map > > > > > task to pull same partition twice). Is there any advantage to this > > > > having > > > > > it on vs off ? > > > > > > > > > > mapred.map.tasks.speculative.execution > > > > > > > > > > Thanks, > > > > > > > > > > Bhavesh > > > > > > > > > > > > > >