RE: Regex and serde with hive

2011-12-23 Thread Raghunath, Ranjith
Thanks Mark, Loren, and Vijay for helping out with this. I did try the serde configuration and it worked well. I did come across the performance implications as I increased the volume so I went ahead and cleansed the data prior to inserting it. Thanks again. Thank you, Ranjith -Original

Exception related to adding a jar in Hive

2011-12-23 Thread Mark Grover
Hi all, I would really appreciate your thoughts and comments on this. I am using Hive 0.7.1 and Hadoop 0.20.203 on Amazon Elastic MapReduce. My .hiverc file looks like: add jar /home/hadoop/hive/lib/hive-contrib-0.7.1.jar add jar my_dir/my_jar.jar However, with the "add jar my_dir/my_jar.jar" in

Cannot recognize input near..

2011-12-23 Thread Periya.Data
Hi, I am trying to run a simple query in my JDBC-Client java code: = // create mon_tbl - Monthly aggregated data String mon_tblName = "month_tbl"; sql = "DROP TABLE IF EXISTS " + mon_tblName; res = stmt.executeQuery(sql);

Re: Efficient ways to parse xml from hive column(for selection/filters based on xml node values)

2011-12-23 Thread Mark Grover
You might want to take a look at this: https://cwiki.apache.org/Hive/languagemanual-xpathudf.html Mark Grover, Business Intelligence Analyst OANDA Corporation www: oanda.com www: fxtrade.com e: mgro...@oanda.com "Best Trading Platform" - World Finance's Forex Awards 2009. "The One to Watch"

Efficient ways to parse xml from hive column(for selection/filters based on xml node values)

2011-12-23 Thread ravikumar visweswara
Hello All, One of my hive columns has text data in xml format. What are all the efficient ways to parse the xml and query based on certain node values. Biz User select/filter Query requirements are based 6 or 7 nodes in xml. Is there any built-in support or supporting libraries for this in HIVE?

Re: Regex and serde with hive

2011-12-23 Thread Mark Grover
Hi Ranjith, Like Loren, I don't think the regex you are using is correct. If you use a create table command like the following, it should work: create external table my_table(a STRING, b STRING, c STRING) ROW FORMAT SERDE 'org.apache.hadoop.hive.contrib.serde2.RegexSerDe' WITH SERDEPROPERTIES ( "