Re: Largest input data set observed for Spark.

Henry Saputra Thu, 20 Mar 2014 11:30:34 -0700

Reynold, just curious did you guys ran it in AWS?

- Henry


On Thu, Mar 20, 2014 at 11:08 AM, Reynold Xin <[email protected]> wrote:
> Actually we just ran a job with 70TB+ compressed data on 28 worker nodes -
> I didn't count the size of the uncompressed data, but I am guessing it is
> somewhere between 200TB to 700TB.
>
>
>
> On Thu, Mar 20, 2014 at 12:23 AM, Usman Ghani <[email protected]> wrote:
>
>> All,
>> What is the largest input data set y'all have come across that has been
>> successfully processed in production using spark. Ball park?
>>

Re: Largest input data set observed for Spark.

Reply via email to