Re: Performance problems with Hive script

2012-01-23 Thread Igor Tatarinov
To compute moving averages, you should implement a custom reducer instead of doing a big join. That will work *much* faster. Also, Hive already has date_add(). Why did you have to implement your own? https://cwiki.apache.org/confluence/display/Hive/LanguageManual+UDF#LanguageManualUDF-DateFunction

JobTracker url shwoing less no of nodes available

2012-01-23 Thread hadoop hive
HI Folks, i got a problem in my job tracker Url, its not Showing the actual no of DN present in Cluster. any suggestion wats wrong with this, regards Vikas Srivastava

Performance problems with Hive script

2012-01-23 Thread Benjamin Poserow
I wrote, separately, a Hadoop job to calculate running averages of about 2000 stock tickers over a 180 day period as well as a Hive script which performs equivalent functionality. I have been using Amazon Elastic MapReduce as my platform for running these jobs. I have been trying for a while to

HIVE org.apache.hadoop.mapred.Task: Failed to contact the tasktracker

2012-01-23 Thread Gaurav Khanna
Hi, Downloaded and installed Hadoop 0.20.2-cdh3u2 with Hive. Hive is running from outside the cluster. Am able to create a table and load the data and run: select * from DA where priority='ERROR'; with no issues. But whenever the following command is run: select count(*) from DA; // this requir

RCFile and LazyBinarySerDe

2012-01-23 Thread Philip Tromans
Hi all, I'm having a problem, where I'm trying to insert into a table which has ROW FORMAT SERDE 'org.apache.hadoop.hive.serde2.lazybinary.LazyBinarySerDe', and is STORED AS RCFILE. The exception: java.lang.UnsupportedOperationException: Currently the writer can only accept BytesRefArrayWritable

Re: oozie / hive question

2012-01-23 Thread Alejandro Abdelnur
Appan, If you have a single oozie-hive action running your hive queries script, oozie calls hive once to run that script. Oozie does not do any hive script partitioning. Thanks. Alejandro On Sun, Jan 22, 2012 at 4:23 PM, Appan Thirumaligai wrote: > I'm running a bunch of hive queries (20+ and