MapJoin error: .hashtable file not found

2012-11-23 Thread Vivek Mishra
Hi, I am trying to run a MapJoin query and somehow getting below error. I can see that file is not there in specified directory. Did some debugging but no luck as well. Here is the error: nOperator:Load back 1 hashtable file from tmp file uri:c:/hadoop/hdfs/mapred/local/taskTracker/di

Re: hive query not running in cron job

2012-11-23 Thread wd
This problem was caused by some env values not include when run by crontab, it's a common error for shell script writing . :)

RE: Effecient partitions usage in join

2012-11-23 Thread Dima Datsenko
Thanks Guys, it’s just what I’ve looked for. From: Dean Wampler [mailto:dean.wamp...@thinkbiganalytics.com] Sent: יום ו 23 נובמבר 2012 15:41 To: user@hive.apache.org Cc: Dima Datsenko Subject: Re: Effecient partitions usage in join A quick suggestion. If the format never changes, you might as wel

Re: Effecient partitions usage in join

2012-11-23 Thread Dean Wampler
A quick suggestion. If the format never changes, you might as well make the string and SimpleDateFormat object static to eliminate the overhead of creating them in every call to evaluate. On Fri, Nov 23, 2012 at 4:50 AM, Bennie Schut wrote: > Well this is the udf: > > ** ** > > package com.e

RE: Effecient partitions usage in join

2012-11-23 Thread Bennie Schut
Well this is the udf: package com.ebuddy.dwhhive.udf; import org.apache.hadoop.hive.ql.exec.Description; import org.apache.hadoop.hive.ql.exec.UDF; import org.apache.hadoop.hive.ql.udf.UDFType; import org.apache.hadoop.io.Text; import java.text.SimpleDateFormat; import java.util.Calendar; @Desc

Re: hive query not running in cron job

2012-11-23 Thread Chunky Gupta
Thanks, its working after adding this line :) Chunky. On Fri, Nov 23, 2012 at 11:24 AM, wd wrote: > Add the following line before your crontab config > > source ~/.bashrc > > > > On Thu, Nov 22, 2012 at 5:59 PM, Chunky Gupta wrote: > >> Hi, >> I have a python script :- >> >> ---c

RE: pblm in joining two tables in hive.....

2012-11-23 Thread Manjinder Singh01
Thank u…. ☺ From: Bejoy KS [mailto:bejoy...@yahoo.com] Sent: Friday, November 23, 2012 3:20 PM To: user@hive.apache.org Subject: Re: pblm in joining two tables in hive. Hi Do a Left outer join A to B and do a null check on B's columns. SELECT A.* FROM A LEFT OUTER JOIN B ON (...) WHERE B.cl

Re: pblm in joining two tables in hive.....

2012-11-23 Thread Bejoy KS
Hi Do a Left outer join A to B and do a null check on B's columns. SELECT A.* FROM A LEFT OUTER JOIN B ON (...) WHERE B.clmn IS NULL. Regards Bejoy KS Sent from handheld, please excuse typos. -Original Message- From: Manjinder Singh01 Date: Fri, 23 Nov 2012 09:03:06 To: user@hive.ap

Re: Creating Indexes again

2012-11-23 Thread Nitin Pawar
CDH forum might be better place to answer this. I never used CDH On Fri, Nov 23, 2012 at 2:28 PM, Peter Marron < peter.mar...@trilliumsoftware.com> wrote: > Hi Nitin, > > ** ** > > Can I set these parameters through the CDH management interface? > > If not then what file do they need to

pblm in joining two tables in hive.....

2012-11-23 Thread Manjinder Singh01
Hi, In hive I have two tables fst is 'a' and 2nd is 'b'. Now I want to display the data of 'a ' table which is not present in b table. Thnks Manjinder CAUTION - Disclaimer * This e-mail contains PRIVILEGED AND CONFIDENTIAL INFORMATION intended so

RE: Creating Indexes again

2012-11-23 Thread Peter Marron
Hi Nitin, Can I set these parameters through the CDH management interface? If not then what file do they need to be set in to make sure that CDH picks them up? Peter Marron Trillium Software UK Limited Tel : +44 (0) 118 940 7609 Fax : +44 (0) 118 940 7699 E: peter.mar...@trilliumsoftware.com

Re: Creating Indexes again

2012-11-23 Thread Nitin Pawar
try increasing ulimit on your hadoop cluster as well increase the memory for map and reducer both by setting them up on hive set mapred.job.map.memory.mb=6000; set mapred.job.reduce.memory.mb=4000; you can change the values based on the hadoop cluster you have setup On Fri, Nov 23, 2012 at 2:17

Creating Indexes again

2012-11-23 Thread Peter Marron
Hi, I'm trying to create indexes in Hive, and I've switched to using CDH-4. The creation of the index is failing and it's pretty obvious that the reducers are running out of heap space. When I use the web interface for the "Hadoop reduce task list" I can find this entry: Error: Java heap space Er