RE: Graph/Plotting in Hive/Hadoop

2012-04-12 Thread karanveer.singh
Any usages of histogram UDF that you can share please? Regards, -Original Message- From: Edward Capriolo [mailto:edlinuxg...@gmail.com] Sent: 12 April 2012 01:33 To: user@hive.apache.org Subject: Re: Graph/Plotting in Hive/Hadoop The historgram UDF can be used to create data from GNU-

RE: Lag function in Hive

2012-04-11 Thread karanveer.singh
That's the whole problem rite, I am unable to create a unique column for my record rows within Hive. If that's there, I can get the lag functionality to work for me. I was hoping that ROWNUM will act like a pseudo column in Hive. Regards, From: Nitin Pawar [mai

RE: Lag function in Hive

2012-04-11 Thread karanveer.singh
Rob n all - I tried below and created the jar file. For adding jar to class path, I do following: hive> add jar /users/unix/singhka/Analytics.jar; The above seems to have worked fine as I see the resource added but when I go ahead and create a function, I get the following error. Any ideas wh

Graph/Plotting in Hive/Hadoop

2012-04-10 Thread karanveer.singh
Hi, Is there some way I can do graph, plotting, etc. within Hive? Are there any UDF's for same? Regards, Karan This e-mail and any attachments are confidential and intended solely for the addressee and may also be privileged or exempt from disclosure under applicable law. If you are not the

RE: Lag function in Hive

2012-04-10 Thread karanveer.singh
When I try using rownum in my Hive QL query, I get: "Invalid column reference rownum". Am I missing something here? Regards, Karan -Original Message- From: David Kulp [mailto:dk...@fiksu.com] Sent: 10 April 2012 20:15 To: user@hive.apache.org Subject: Re: Lag function in Hive New he

Web Log Analysis

2012-04-10 Thread karanveer.singh
Hi, Are there some UDF's in hive which will allow us to do quick analysis of web logs like click stream, unique visits, etc. Rgds, Karan This e-mail and any attachments are confidential and intended solely for the addressee and may also be privileged or exempt from disclosure under applicable la

Re: Lag function in Hive

2012-04-10 Thread karanveer.singh
Thanks - I will check this out. Meanwhile, would default clustering happen using rownum? How can I check on how is clustering happening in our environment? Rgds - Original Message - From: David Kulp To: user@hive.apache.org Sent: Tue Apr 10 15:45:25 2012 Subject: Re: Lag function in

RE: Lag function in Hive

2012-04-10 Thread karanveer.singh
Makes sense but is not the distribution across nodes for a chunk of records in that order. If Hive cannot help me do this, is there another way I can do this? I tried generating an identifier using the perl script invoked using Hive but it does not seem to work fine. While the stand alone scri

Lag function in Hive

2012-04-10 Thread karanveer.singh
Hi, Is there something like a 'lag' function in HIVE? The requirement is to calculate difference for the same column for every 2 subsequent records. For example. Row, Column A, Column B 1, 10, 100 2, 20, 200 3, 30, 300 The result that I need should be like: Row, Column A, Column B, Result 1