To compute moving averages, you should implement a custom reducer instead
of doing a big join. That will work *much* faster.
Also, Hive already has date_add(). Why did you have to implement your own?
https://cwiki.apache.org/confluence/display/Hive/LanguageManual+UDF#LanguageManualUDF-DateFunction
HI Folks,
i got a problem in my job tracker Url, its not Showing the actual no of DN
present in Cluster.
any suggestion wats wrong with this,
regards
Vikas Srivastava
I wrote, separately, a Hadoop job to calculate running averages of about
2000 stock tickers over a 180 day period as well as a Hive script which
performs equivalent functionality. I have been using Amazon Elastic
MapReduce as my platform for running these jobs. I have been trying for a
while to
Hi,
Downloaded and installed Hadoop 0.20.2-cdh3u2 with Hive. Hive is running from
outside the cluster. Am able to create a table and load the data and run:
select * from DA where priority='ERROR';
with no issues.
But whenever the following command is run:
select count(*) from DA; // this requir
Hi all,
I'm having a problem, where I'm trying to insert into a table which
has ROW FORMAT SERDE
'org.apache.hadoop.hive.serde2.lazybinary.LazyBinarySerDe', and is
STORED AS RCFILE. The exception:
java.lang.UnsupportedOperationException: Currently the writer can only
accept BytesRefArrayWritable
Appan,
If you have a single oozie-hive action running your hive queries script,
oozie calls hive once to run that script. Oozie does not do any hive script
partitioning.
Thanks.
Alejandro
On Sun, Jan 22, 2012 at 4:23 PM, Appan Thirumaligai
wrote:
> I'm running a bunch of hive queries (20+ and