Re: Spark SQL (Pyspark) - Parallel processing of multiple datasets

2017-04-17 Thread Amol Patil
ad but maybe the code related > with reading/writing temp table isn't thread safe. > > On Mon, Apr 17, 2017 at 9:45 PM, Amol Patil wrote: > >> Thanks Ryan, >> >> Each dataset has separate hive table. All hive tables belongs to same >> hive database. >>

Re: Spark SQL (Pyspark) - Parallel processing of multiple datasets

2017-04-17 Thread Amol Patil
> > Other than that, it should work. > > On Mon, Apr 17, 2017 at 6:52 AM, Amol Patil > wrote: > >> Hi All, >> >> I'm writing generic pyspark program to process multiple datasets using >> Spark SQL. For example Traffic Data, Crime Data, Weather Data. Da

Spark SQL (Pyspark) - Parallel processing of multiple datasets

2017-04-16 Thread Amol Patil
good idea to process those big datasets in parallel in one job? 3. Any other solution to process multiple datasets in parallel? Thank you, Amol Patil