Re: Tez session after closing CLI

2014-12-08 Thread Gopal V
On 12/8/14, 10:09 PM, Fabio wrote: Hi everyone, when running Hive on Tez, a Tez session is alive within the Hive CLI until I leave the CLI. So if I run on the terminal something like "hive -f query.sql", once the query is completed the Tez session is closed. Is there a way to run a query in this

Specify encoding for columns in parquet

2014-12-08 Thread Liu, Jun A
Hi everyone I've been searching this for a while and haven't find an answer for it. Does anyone know that if I can explicitly specify the encoding algorithm to use for individual columns of a Hive table stored as parquet file format. For example, is it possible to choose RLE for column1, dictiona

Tez session after closing CLI

2014-12-08 Thread Fabio
Hi everyone, when running Hive on Tez, a Tez session is alive within the Hive CLI until I leave the CLI. So if I run on the terminal something like "hive -f query.sql", once the query is completed the Tez session is closed. Is there a way to run a query in this way (let's say from the linux te

Using xPATH and Hive SQL to access XML data, but xPath a problem

2014-12-08 Thread David Novogrodsky
I created a Hive table using one column. Each row contains one XML record. Here is the script I used to create this first table: CREATE EXTERNAL TABLE xml_event_table ( xmlevent string) STORED AS TEXTFILE LOCATION “/user/cloudera/vector/events”; Here is a sample XML in one row of

Hive returns different results with/without LZO index when hive.hadoop.supports.splittable.combineinputformat=true

2014-12-08 Thread Nathalie Blais
Hello, We are experiencing this old issue in our current installation: https://issues.apache.org/jira/browse/MAPREDUCE-5537 All our data is LZO compressed and indexed; the case is 100% reproducible on our CDH 5.2.0 cluster (using MR2 and Yarn). Do you know if we might be missing a patch or if