Hello,
hive> FROM (
> FROM subset
> MAP subset.patient_mrn, subset.encounter_date
> USING 'q1.txt'
> AS mp1, mp2
> CLUSTER BY mp1) map_output
> INSERT OVERWRITE TABLE t3
> REDUCE map_output.mp1
> USING 'retrieve'
> AS reducef1;
Total MapReduce jobs = 1
Launching Job 1 out of 1
Number of reduce tasks not specified. Estimated from input data size: 1
In order to change the average load for a reducer (in bytes):
set hive.exec.reducers.bytes.per.reducer=<number>
In order to limit the maximum number of reducers:
set hive.exec.reducers.max=<number>
In order to set a constant number of reducers:
set mapred.reduce.tasks=<number>
Starting Job = job_201112281627_0100, Tracking URL =
http://localhost:50030/jobdetails.jsp?jobid=job_201112281627_0100
Kill Command = /home/hadoop/hadoop-0.20.2-cdh3u2//bin/hadoop job
-Dmapred.job.tracker=localhost:54311 -kill job_201112281627_0100
2011-12-31 04:34:52,208 Stage-1 map = 0%, reduce = 0%
2011-12-31 04:35:52,939 Stage-1 map = 0%, reduce = 0%
2011-12-31 04:36:34,097 Stage-1 map = 100%, reduce = 100%
Ended Job = job_201112281627_0100 with errors
FAILED: Execution Error, return code 2 from
org.apache.hadoop.hive.ql.exec.MapRedTask
hive>
In 'q1.txt' I have written query in Hive which returns 2 columns
and
In 'retrieve' I have wriiten a java code which takes 2 input and display
the 1 column
Is there any mistake in query.
Pls suggest me some solution.
--
Regards,
Bhavesh Shah