Re: DynamoDB data source questions

2016-04-14 Thread Travis Crawford
right direction to finish this up. Thanks, Travis On Wed, Apr 13, 2016 at 10:40 AM Reynold Xin wrote: > Responses inline > > On Wed, Apr 13, 2016 at 7:45 AM, Travis Crawford > wrote: > >> Hi Spark gurus, >> >> At Medium we're using Spark for an ETL job t

DynamoDB data source questions

2016-04-13 Thread Travis Crawford
Hi Spark gurus, At Medium we're using Spark for an ETL job that scans DynamoDB tables and loads into Redshift. Currently I use a parallel scanner implementation that writes files to local disk, then have Spark read them as a DataFrame. Ideally we could read the DynamoDB table directly as a DataFr