Re: How to timeout a task?

2015-07-03 Thread William Ferrell
Ted, Thanks very much for your reply. It took me almost a week but I have finally had a chance to implement what you noted and it appears to be working locally. However, when I launch this onto a cluster on EC2 -- this doesn't work reliably. To expand, I think the issue is that some of the code w

Re: Killing Long running tasks (stragglers)

2015-06-25 Thread William Ferrell
ata. It will be helpful > if you can paste your code so that we will understand it better. > > Thanks > Best Regards > > On Wed, Jun 24, 2015 at 2:32 PM, William Ferrell > wrote: > >> Hello - >> >> I am using Apache Spark 1.2.1 via pyspark. Thanks to

Killing Long running tasks (stragglers)

2015-06-24 Thread William Ferrell
Hello - I am using Apache Spark 1.2.1 via pyspark. Thanks to any developers here for the great product! In my use case, I am running spark jobs to extract data from some raw data. Generally this works quite well. However, I am noticing that for certain data sets there are certain tasks that are