Re: computing median and percentiles

2014-03-20 Thread Stephen Sprague
t; maintains the count, how can Hive be used to derive the percentile? > > Value Count > 100 2 > 200 4 > 300 1 > > Thanks, > Seema > > From: Stephen Sprague > Reply-To: "user@hive.apache.org" > Date: Thursday

Re: computing median and percentiles

2014-03-19 Thread Seema Datar
AM To: "user@hive.apache.org<mailto:user@hive.apache.org>" mailto:user@hive.apache.org>> Subject: Re: computing median and percentiles not a hive question is it? its more like a math question. On Wed, Mar 19, 2014 at 1:30 PM, Seema Datar mailto:sda...@yahoo-inc.com&

Re: computing median and percentiles

2014-03-19 Thread Stephen Sprague
not a hive question is it? its more like a math question. On Wed, Mar 19, 2014 at 1:30 PM, Seema Datar wrote: > > > I understand the percentile function is supported in Hive in the latest > versions. However, how does once calculate percentiles when the data is > across two columns. So say

Re: computing median and percentiles

2014-03-19 Thread Seema Datar
I understand the percentile function is supported in Hive in the latest versions. However, how does once calculate percentiles when the data is across two columns. So say - Value Count 100 2 ( so basically 100 occurred twice) 200 4 300 1 400 6 500 3 I want to find out the 0.25 percentile