How to use the Hive Lineage Service

2015-09-11 Thread sumit ghosh
Hi,I am trying to use the lineage service built in Hive. I need the tables used the columns at the source and how they are related to the target. So hive has this lineage servicehive --service lineage `cat myQuery` However it always errors out failing to parse the Hive Query. What am I doing wro

Re: DECIMAL precision is too small

2014-06-28 Thread sumit ghosh
Did you try BigDecimal? It is the same datatype as Java BigDecimal. On Thursday, 26 June 2014 8:34 AM, Brian Jeltema wrote: Sorry, I meant 128 bit On Jun 26, 2014, at 11:31 AM, Brian Jeltema wrote: > I need to represent an unsigned 64-bit value as a Hive DECIMAL. The current > precisi

Re: Reg: Merging Rows

2014-06-24 Thread sumit ghosh
Did you try sum(col1), sum(col2) ...   group by id  On Tuesday, 24 June 2014 1:23 PM, usha hive wrote: Hi, I am trying to merge few rows in to 1 row. I am stuck. Please help me. Example id  col1      col2      col3        col4  1   44       NULL    NULL    NULL 1  NULL     37       NULL

Skewed Join

2013-06-06 Thread sumit ghosh
Hi,   I am hit by skewed Join, my last reducer is getting same number of Reduce input groups/records. Reduce input groups  432,446,942 Reduce shuffle bytes  13,012,613,275 Reduce input records 432,446,942    Why is this happening? I have tur

Re: Variable resolution Fails

2013-05-01 Thread sumit ghosh
Thanks for the solution & the tip. :) From: Sanjay Subramanian To: "user@hive.apache.org" ; sumit ghosh Sent: Tuesday, 30 April 2013 7:44 PM Subject: Re: Variable resolution Fails +1  agreed Also as a general script programming practice

Variable resolution Fails

2013-04-30 Thread sumit ghosh
Hi,   The following variable freq fails to resolve:   bash-4.1$ export freq=MNTH bash-4.1$ echo $freq MNTH bash-4.1$ hive -e "select ${env:freq} as dr  from dual" Logging initialized using configuration in file:/etc/hive/conf.dist/hive-log4j.properties Hive history file=/hadoop1/hive_querylog/sum

Re: Set JobClient properties from hive CLI

2013-04-22 Thread sumit ghosh
Thanks Mike. From: Mike Liddell To: "user@hive.apache.org" ; sumit ghosh Sent: Monday, 22 April 2013 11:02 AM Subject: RE: Set JobClient properties from hive CLI Dropping u...@hadoop.apache.org   try “set hive.exec.counters.pull.interval = 5000;

Re: Set JobClient properties from hive CLI

2013-04-21 Thread sumit ghosh
Hi,   Sending to a wider audience - How do I set jobclient.progress.monitor.poll.interval in Hive?   Thanks, Sumit From: sumit ghosh To: "user@hive.apache.org" Sent: Saturday, 20 April 2013 1:28 AM Subject: Set JobClient properties from hive CLI

Set JobClient properties from hive CLI

2013-04-20 Thread sumit ghosh
Hi,   I am trying to set the property to change the logging interval to more than 5 seconds. Hence I am using the following parameter:  jobclient.progress.monitor.poll.interval=5000   It works fine when I use it on an Map Reduce - $ hadoop jar /usr/lib/hadoop-0.20-mapreduce/hadoop-2.0.0-mr1-cdh4.