Re: LIMIT statement on SparkSQL

2016-10-26 Thread Liz Bai
Sorry for the typo in last mail. Compared with the Query-2, we have questions in Query-1 and Query-3. Also, may I know the difference between CollectLimit and BaseLimit? Thanks so much. Best, Liz > On 26 Oct 2016, at 7:25 PM, Liz Bai wrote: > > Hi all, > > We used Parquet and Spark 2.0 to do t

LIMIT statement on SparkSQL

2016-10-26 Thread Liz Bai
Hi all, We used Parquet and Spark 2.0 to do the testing. The table below is the summary of what we have found about `Limit` keyword. Query-2 reveals that SparkSQL does early stop upon getting adequate results. But we are curious of Query-1 and Query-2. It seems that, either writing result RDD a