t; maintains the count, how can Hive be used to derive the percentile?
>
> Value Count
> 100 2
> 200 4
> 300 1
>
> Thanks,
> Seema
>
> From: Stephen Sprague
> Reply-To: "user@hive.apache.org"
> Date: Thursday
AM
To: "user@hive.apache.org<mailto:user@hive.apache.org>"
mailto:user@hive.apache.org>>
Subject: Re: computing median and percentiles
not a hive question is it? its more like a math question.
On Wed, Mar 19, 2014 at 1:30 PM, Seema Datar
mailto:sda...@yahoo-inc.com&
not a hive question is it? its more like a math question.
On Wed, Mar 19, 2014 at 1:30 PM, Seema Datar wrote:
>
>
> I understand the percentile function is supported in Hive in the latest
> versions. However, how does once calculate percentiles when the data is
> across two columns. So say
I understand the percentile function is supported in Hive in the latest
versions. However, how does once calculate percentiles when the data is across
two columns. So say -
Value Count
100 2 ( so basically 100 occurred twice)
200 4
300 1
400 6
500 3
I want to find out the 0.25 percentile