I like the approach of applying an arbitrary limit. Hive's q files tend to add an ordering to everything. Would it make sense to simply order by multiple columns in the result set and conduct a large diff on them?
On Wednesday, June 26, 2019, Sungwoo Park <glap...@gmail.com> wrote: > I have published a new article on the correctness of Hive on MR3, Presto, > and Impala: > > https://mr3.postech.ac.kr/blog/2019/06/26/correctness- > hivemr3-presto-impala/ > > Hope you enjoy reading the article. > > --- Sungwoo > > -- Sorry this was sent from mobile. Will do less grammar and spell check than usual.