Re: Sample sql query using pyspark

2016-03-01 Thread Maurin Lenglart
ot;user@spark.apache.org<mailto:user@spark.apache.org>" mailto:user@spark.apache.org>> Subject: Re: Sample sql query using pyspark Maurin, I don't know the technical reason why but: try removing the 'limit 100' part of your query. I was trying to do something simi

Re: Sample sql query using pyspark

2016-03-01 Thread James Barney
Maurin, I don't know the technical reason why but: try removing the 'limit 100' part of your query. I was trying to do something similar the other week and what I found is that each executor doesn't necessarily get the same 100 rows. Joins would fail or result with a bunch of nulls when keys weren