Hive insert overwrite strange behavior

2014-07-15 Thread Azuryy Yu
Hi, I think the following two sql have the same effect. 1) hive -e "insert overwrite local directory 'output' select * from test limit 10;" 2) hive -e "select * from test limit 10;" > output but the second one read HDFS directly only takes two seconds, but the first one submit a MR job, which h

Hive and MR2

2014-05-15 Thread Azuryy Yu
Hi, I am using hive-0.13.0 and hadoop-2.4.0, why I must set 'mapreduce.jobtracker.address' in yarn-site.xml? otherwise, there are exceptions and job failed. And, 'mapreduce.jobtracker.address' can be set to any value. The following messages are gened without set 'mapreduce.jobtracker.address'.

Distinct includes duplicate records

2013-07-19 Thread Azuryy Yu
Hi Dear All, I filed a JIRA here: https://issues.apache.org/jira/browse/HIVE-4891 There is description in detail. Can anybody take a look or reply this mail? Many thanks.