Re: [VOTE] Amend Hive Bylaws + Add HCatalog Submodule

2013-01-31 Thread Namit Jain
+1 and +1 On 1/30/13 6:53 AM, "Gunther Hagleitner" wrote: >+1 and +1 > >Thanks, >Gunther. > > >On Tue, Jan 29, 2013 at 5:18 PM, Edward Capriolo >wrote: > >> Measure 1: +1 >> Measure 2: +1 >> >> On Mon, Jan 28, 2013 at 2:47 PM, Carl Steinbach wrote: >> >> > I am calling a vote on the following t

Re: for each partition

2011-02-09 Thread Namit Jain
You can use dynamic partitioning: insert overwrite table item_view_aggregate partition (date_hour) select iv.sid, count(*), date_hour from item_view iv where (iv.date_hour='2011310116' or date_hour=''' or date_hour='.) group by iv.sid, date_hour; On 2/9/11 5:49 AM, "Cam Bazz" wrote: >We

Re: Query Optimization in Hive

2011-01-31 Thread Namit Jain
Bharath, This would be great. Why don¹t you write up something about how you are planning to proceed ? File a new jira and load some design notes/spec. there. We can definitely sync up. from there. This feature would be very useful to the community - We, at facebook, Would definitely like to us

Re: tons of bugs and problem found

2011-01-31 Thread Namit Jain
On 1/31/11 7:46 AM, "Laurent Laborde" wrote: >On Fri, Jan 28, 2011 at 8:05 AM, Laurent Laborde >wrote: >> On Fri, Jan 28, 2011 at 1:12 AM, Namit Jain wrote: >>> Hi Laurent, >>> >>> 1. Are you saying that _top.sql did not exist in the home d

Re: tons of bugs and problem found

2011-01-27 Thread Namit Jain
Hi Laurent, 1. Are you saying that _top.sql did not exist in the home directory. Or that, _top.sql existed, but hive was not able to read it after loading 2. I don¹t think reserved words are documented somewhere. Can you file a jira for this ? 3. The bad row is printed in the task log. 1. 2

Re: Hive Concurrency Model - does it work?

2011-01-26 Thread Namit Jain
The patch below has been committed. https://issues.apache.org/jira/browse/HIVE-1865 was a follow-up patch which should help concurrency. I have not tried backporting the patch on hive 0.5 or hive0.6, but I don’t think it will work, since the code has changed significantly, and a number of bug f

Re: Distinct in hive

2011-01-25 Thread Namit Jain
Is there skew in data ? You may want to set the parameter: hive.groupby.skewindata: to true. Thanks, -namit From: Guy Doulberg mailto:guy.doulb...@conduit.com>> Reply-To: mailto:user@hive.apache.org>> Date: Tue, 25 Jan 2011 08:25:36 -0800 To: "user@hive.apache.org"

Re: Is there a reason why this simple query would take a very long time?

2011-01-24 Thread Namit Jain
Although there is only 1 reducer, the amount of data to that reducer should be really small: it will have same number of rows as the number of mappers. Can you check how much data is your reducer getting ? Is it reading a long time to read the small data from each mapper ? Thanks, -namit On 1

Re: mergeJoinTree ??

2010-12-23 Thread Namit Jain
> Thanks > Bharath.V > 4th year undergraduate, > IIIT Hyderabad, > > On Fri, Dec 24, 2010 at 12:22 AM, Namit Jain wrote: >> The idea there is to merge the join trees. >> >> For eg. If the query is: >> >> Select * from >> T1 join T2 on T1.key = T2.key >&

Re: mergeJoinTree ??

2010-12-23 Thread Namit Jain
The idea there is to merge the join trees. For eg. If the query is: Select * from T1 join T2 on T1.key = T2.key join T3 on T1.key = T3.key Where ... One join can have 3 inputs: T1, T2 and T3. mergeJoinTree() merges the 2 join operators (T1,T2) and (result of (T1,T2), T3) into a new join operat

RE: Unions causing many scans of input - workaround?

2010-11-07 Thread Namit Jain
Other option would be to create a wrapper script (not use either UDF or UDTF) That script, in any language, can emit any number of output rows per input row. Look at: http://wiki.apache.org/hadoop/Hive/LanguageManual/Transform for details From: Sonal Goyal [so

RE: Does output directory remain in case of map/reduce task failures

2010-11-02 Thread Namit Jain
partitioned and non partitioned tables? From: Namit Jain [mailto:nj...@facebook.com] Sent: Tuesday, November 02, 2010 4:22 PM To: user@hive.apache.org; hive-u...@hadoop.apache.org Subject: RE: Does output directory remain in case of map/reduce task failures Hive

RE: Does output directory remain in case of map/reduce task failures

2010-11-02 Thread Namit Jain
Hive writes to a temporary directory first, and if the UDF fails, the temp. directory is removed. The expected final directory is not touched. -namit From: Pradeep Kamath [prade...@yahoo-inc.com] Sent: Tuesday, November 02, 2010 3:26 PM To: hive-u...@hadoop.apa

RE: [VOTE] Bylaws for Apache Hive Project

2010-10-22 Thread Namit Jain
+1 From: Ashish Thusoo [mailto:athu...@facebook.com] Sent: Friday, October 22, 2010 2:51 PM To: user@hive.apache.org Subject: [VOTE] Bylaws for Apache Hive Project Hi Folks, I propose that we adopt the following bylaws for the Apache Hive Project https://cwiki.apache.org/HIVE/bylaws.html Thes

RE: [VOTE] Bylaws for Apache Hive Project

2010-10-22 Thread Namit Jain
))) . All Hive users, contributors, committers are encouraged to vote, though only PMC votes will be binding From: Ashish Thusoo [mailto:athu...@facebook.com] Sent: Friday, October 22, 2010 2:51 PM To: user@hive.apache.org Subject: [VOTE] Bylaws for Apache Hive Project Hi Folks, I propose

RE: Hive 0.6.0 Release Update

2010-10-07 Thread Namit Jain
It is not a blocker for 0.6. -Original Message- From: Szymon Gwóźdź [mailto:sgwo...@gadu-gadu.pl] Sent: Thursday, October 07, 2010 2:08 AM To: hive-u...@hadoop.apache.org Subject: Re: Hive 0.6.0 Release Update Hi! What about https://issues.apache.org/jira/browse/HIVE-1508 ? It is mark