Thanks Mark, Loren, and Vijay for helping out with this. I did try the serde
configuration and it worked well. I did come across the performance
implications as I increased the volume so I went ahead and cleansed the data
prior to inserting it. Thanks again.
Thank you,
Ranjith
-Original
Hi all,
I would really appreciate your thoughts and comments on this.
I am using Hive 0.7.1 and Hadoop 0.20.203 on Amazon Elastic MapReduce.
My .hiverc file looks like:
add jar /home/hadoop/hive/lib/hive-contrib-0.7.1.jar
add jar my_dir/my_jar.jar
However, with the "add jar my_dir/my_jar.jar" in
Hi,
I am trying to run a simple query in my JDBC-Client java code:
=
// create mon_tbl - Monthly aggregated data
String mon_tblName = "month_tbl";
sql = "DROP TABLE IF EXISTS " + mon_tblName;
res = stmt.executeQuery(sql);
You might want to take a look at this:
https://cwiki.apache.org/Hive/languagemanual-xpathudf.html
Mark Grover, Business Intelligence Analyst
OANDA Corporation
www: oanda.com www: fxtrade.com
e: mgro...@oanda.com
"Best Trading Platform" - World Finance's Forex Awards 2009.
"The One to Watch"
Hello All,
One of my hive columns has text data in xml format. What are all the
efficient ways to parse the xml and query based on certain node values. Biz
User select/filter Query requirements are based 6 or 7 nodes in xml. Is
there any built-in support or supporting libraries for this in HIVE?
Hi Ranjith,
Like Loren, I don't think the regex you are using is correct.
If you use a create table command like the following, it should work:
create external table
my_table(a STRING, b STRING, c STRING)
ROW FORMAT SERDE 'org.apache.hadoop.hive.contrib.serde2.RegexSerDe'
WITH SERDEPROPERTIES (
"