Re: Partitioning EXTERNAL TABLE without copying or moving files

2011-12-08 Thread Vince Hoang
-09-01’; Matt Tucker Associate eBusiness Analyst Walt Disney Parks and Resorts Online Ph: 407-566-2545 Tie: 8-296-2545 From: Vince Hoang [mailto:vho...@cafepress.com] Sent: Thursday, December 08, 2011 3:47 PM To: user@hive.apache.org<mailto:user@hive.apache.org> Subject: Partitioning EXTERNAL T

Partitioning EXTERNAL TABLE without copying or moving files

2011-12-08 Thread Vince Hoang
Hi, I am running Hive 0.7.0 with Hadoop 0.20.2. I have one HDFS folder full of web server logs dated back several months. Is possible to partition an EXTERNAL TABLE without copying/moving files or altering the layout of the directory? For example, in HDFS, I have: > /logs/log-2011-09-01 > /l

Re: How to load quote-separated fields?

2011-09-24 Thread Vince Hoang
Hi Mark, You could try RegexSerDe to deserialize using regular expression. Here is a good example: http://books.google.com/books?id=Nff49D7vnJcC&lpg=PA391&ots=IicwYn7zOq&dq=ROW%20FORMAT%20SERDE%20input.regex&pg=PA391#v=onepage&q=ROW%20FORMAT%20SERDE%20input.regex&f=false Good luck, Vince From

Selectively include EXTERNAL TABLE source files via regex or symlink

2011-08-12 Thread Vince Hoang
I have a folder full of logs aggregated by hour like so: logs/ > 20110812-07 > 20110812-08 > 20110812-09 > 20110812-10 > ... > 20110812-23 > 20110813-00 > 20110813-01 > 20110813-02 > 20110813-03 > ... > Is there a way to selective include files files into an