from:"Amol Patil"

Re: Spark SQL (Pyspark) - Parallel processing of multiple datasets

2017-04-17 Thread Amol Patil

ad but maybe the code related > with reading/writing temp table isn't thread safe. > > On Mon, Apr 17, 2017 at 9:45 PM, Amol Patil wrote: > >> Thanks Ryan, >> >> Each dataset has separate hive table. All hive tables belongs to same >> hive database. >>

Re: Spark SQL (Pyspark) - Parallel processing of multiple datasets

2017-04-17 Thread Amol Patil

> > Other than that, it should work. > > On Mon, Apr 17, 2017 at 6:52 AM, Amol Patil > wrote: > >> Hi All, >> >> I'm writing generic pyspark program to process multiple datasets using >> Spark SQL. For example Traffic Data, Crime Data, Weather Data. Da

Spark SQL (Pyspark) - Parallel processing of multiple datasets

2017-04-16 Thread Amol Patil

good idea to process those big datasets in parallel in one job? 3. Any other solution to process multiple datasets in parallel? Thank you, Amol Patil

Re: Spark SQL (Pyspark) - Parallel processing of multiple datasets

Re: Spark SQL (Pyspark) - Parallel processing of multiple datasets

Spark SQL (Pyspark) - Parallel processing of multiple datasets

3 matches

Site Navigation

Mail list logo

Footer information