It will be helpful if you paste some sample data to repro. I have used collect_set and it works as documented for me.
Thanks, Viral On Sun, Sep 7, 2014 at 10:39 AM, Shushant Arora <shushantaror...@gmail.com> wrote: > While group by, if I do collect_set on some other column , documentation > says it will return Array of that column after removing duplicates, but its > not doing dedup?Is it expected? >