Sorry for the typo in last mail.
Compared with the Query-2, we have questions in Query-1 and Query-3.
Also, may I know the difference between CollectLimit and BaseLimit?
Thanks so much.
Best,
Liz
> On 26 Oct 2016, at 7:25 PM, Liz Bai wrote:
>
> Hi all,
>
> We used Parquet and Spark 2.0 to do t
Hi all,
We used Parquet and Spark 2.0 to do the testing. The table below is the summary
of what we have found about `Limit` keyword. Query-2 reveals that SparkSQL does
early stop upon getting adequate results. But we are curious of Query-1 and
Query-2. It seems that, either writing result RDD a