Thanks for the suggestion.

I don't think sampling helps here, as I need to get the top 5% of rows
ordered by a particular column (not a random sampling)



On Wed, Mar 28, 2012 at 5:03 PM, Gabi D <gabi...@gmail.com> wrote:

> James,
> See if sampling
> <https://cwiki.apache.org/confluence/display/Hive/LanguageManual+Sampling>is
> what you need
>
>
>
>
> On Wed, Mar 28, 2012 at 5:53 PM, James Newhaven 
> <james.newha...@gmail.com>wrote:
>
>> I am trying to write a query that will return the first 5% of rows in a
>> table.
>>
>> I've struggled with this for quite a while and can't figure out a command
>> that works in Hive.
>>
>> Has anyone done this?
>>
>> Thanks,
>> James
>>
>
>

Reply via email to