Ted,
Thanks very much for your reply. It took me almost a week but I have
finally had a chance to implement what you noted and it appears to be
working locally. However, when I launch this onto a cluster on EC2 -- this
doesn't work reliably.
To expand, I think the issue is that some of the code w
ata. It will be helpful
> if you can paste your code so that we will understand it better.
>
> Thanks
> Best Regards
>
> On Wed, Jun 24, 2015 at 2:32 PM, William Ferrell
> wrote:
>
>> Hello -
>>
>> I am using Apache Spark 1.2.1 via pyspark. Thanks to
Hello -
I am using Apache Spark 1.2.1 via pyspark. Thanks to any developers here
for the great product!
In my use case, I am running spark jobs to extract data from some raw data.
Generally this works quite well.
However, I am noticing that for certain data sets there are certain tasks
that are