Owen,
it's for entire table. the sample TD query looks like below,
SELECT
columnA
,columnB
, columnC
, columnD
, columnX
,ROW_NUMBER() OVER (PARTITION BY columnA, columnB, columnC ORDER BY
columnX DESC, columnY DESC) AS rank
FROM table a
Regards,
Kumar
-----Original Message-----
From: Owen O'Malley <[email protected]>
To: user <[email protected]>
Sent: Thu, Feb 21, 2013 8:08 am
Subject: Re: ROW_NUMBER() equivalent in Hive
What are the semantics for ROW_NUMBER? Is it a global row number? Per a
partition? Per a bucket?
-- Owen
On Wed, Feb 20, 2013 at 11:33 PM, kumar mr <[email protected]> wrote:
Hi,
This is Kumar, and this is my first question in this group.
I have a requirement to implement ROW_NUMBER() from Teradata in Hive where
partitioning happens on multiple columns along with multiple column ordering.
It can be easily implemented in Hadoop MR, but I have to do in Hive. By doing
in UDF can assign same rank to grouping key considering dataset is small, but
ordering need to be done in prior step.
Can we do this in lot simpler way?
Thanks in advance.
Regards,
Kumar