Unable to execute hive script on AWS with arguments from java program

2013-09-09 Thread Puneet Khatod
the error is: Unrecognised option -d S3_INPUT_BUCKET=s3://my-input -d S3_OUT_BUCKET=s3://my-output... Please help me in getting this correct. How to pass arguments to the hive query using Java API? Regards, Puneet Khatod | puneet.kha...@tavant.com<mailto:puneet.kha...@tavant.com> Techni

How to validate data type in Hive

2013-08-26 Thread Puneet Khatod
Hi, I have a requirement to validate data type of the values present in my flat file (which is source for my hive table). I am unable to find any hive feature/function which would do that. Is there any way to validate data type of the values present in the underlying file? Something like BCP (B

RE: New to hive.

2013-07-17 Thread Puneet Khatod
Hi, There are many online tutorials and blogs to provide quick get-set-go sort of information. To start with you can learn Hadoop. For detailed knowledge you will have to go through e-books as mentioned by Lefty. These books are bulky but will provide every bit of hadoop. I recently came across

Need help with percentile calculation

2012-11-21 Thread Puneet Khatod
Hi, I am trying to use percentile function of HIVE but getting exception from Amazon EMR service. I am using version 0.7. Please assist. It is very critical and urgent. Below is the code snippet: CREATE EXTERNAL TABLE IF NOT EXISTS server_d ( ag_date STRING, median _time BIGINT, 95percen

RE: Continuous log analysis requires 'dynamic' partitions, is that possible?

2012-07-24 Thread Puneet Khatod
If you are using Amazon (AWS), you can use 'recover partitions' to enable all top level partitions. This will add required dynamicity. Regards, Puneet Khatod From: Bertrand Dechoux [mailto:decho...@gmail.com] Sent: 24 July 2012 21:15 To: user@hive.apache.org Subject: Continuous lo

Not in clause in hive query

2012-07-18 Thread Puneet Khatod
Hi, I am working on Hive 0.7. I am migrating SQL queries to hive and facing issues with the queries that have 'Not in' clause usage. Example: select * from customer where cust_id not in (12022,11783); I am getting: FAILED: Parse Error: line 1:38 cannot recognize input near ' cust_id ' 'not' '

Error in Hive execution on cluster : Wrong FS: hdfs

2012-05-31 Thread Puneet Khatod
Hi, I am facing below error when I am firing any query in Hive. My Hive setup is present on the master node in my cluster. Hadoop is configured using IP_addresses in configuration xmls and in master, slave files and it is running fine. The error only arises when hive query is executed which had