Thanks for the suggestion. I don't think sampling helps here, as I need to get the top 5% of rows ordered by a particular column (not a random sampling)
On Wed, Mar 28, 2012 at 5:03 PM, Gabi D <gabi...@gmail.com> wrote: > James, > See if sampling > <https://cwiki.apache.org/confluence/display/Hive/LanguageManual+Sampling>is > what you need > > > > > On Wed, Mar 28, 2012 at 5:53 PM, James Newhaven > <james.newha...@gmail.com>wrote: > >> I am trying to write a query that will return the first 5% of rows in a >> table. >> >> I've struggled with this for quite a while and can't figure out a command >> that works in Hive. >> >> Has anyone done this? >> >> Thanks, >> James >> > >