Unsubscribe

2012-10-02 Thread Prasanna Kumar Jalakam
Unsubscribe -- Thanks & Regards, Prasanna.J R & D Networks IMImobile Pvt. Ltd. Plot 770, Rd. 44 Jubilee Hills

Re: Hive and RESTFul with RESTEasy (jax-rs)

2012-10-02 Thread Russell Jurney
Check out HCatalog and Templeton. https://issues.apache.org/jira/browse/HCATALOG-182 It provides a REST interface to Hive job execution, as well as Hive's metadata. It is included in HCatalog 0.5, which is yet to be released. You might check on the HCatalog mailing lists about this. http://incubat

Re: Hive and RESTFul with RESTEasy (jax-rs)

2012-10-02 Thread MiaoMiao
Don't know any besides JDBC or THRIFT. On Tue, Oct 2, 2012 at 11:24 PM, Zebeljan, Nebojsa wrote: > Hi, > I'm very new to Hive and I need to approach how to fire Hive sql queries via > the RESTEasy framework and to stream back the query result as a JSON string > to the client. > > I wonder, if the

Re: best way to load millions of gzip files in hdfs to one table in hive?

2012-10-02 Thread Abhishek
Hi Edward, I am kind of interested in this, for crush to work do we need install any thing?? How can it be used in a cluster. Regards Abhi Sent from my iPhone On Oct 2, 2012, at 5:45 PM, Edward Capriolo wrote: > You may want to use: > > https://github.com/edwardcapriolo/filecrush > > We

Re: best way to load millions of gzip files in hdfs to one table in hive?

2012-10-02 Thread Edward Capriolo
You may want to use: https://github.com/edwardcapriolo/filecrush We use this to deal with pathological cases although the best idea is to avoid big files all together. Edward On Tue, Oct 2, 2012 at 4:16 PM, Alexander Pivovarov wrote: > Options > 1. create table and put files under the table di

unsubscribe

2012-10-02 Thread Nic Chidu

Re: Hive does not run - Typical NoSuchFieldError

2012-10-02 Thread Anthony Ikeda
Okay, thanks Edward! On Tue, Oct 2, 2012 at 12:38 PM, Edward Capriolo wrote: > If your working on datastax/riptano branch you probably should take > this up on one their forums. > > Edward > > On Tue, Oct 2, 2012 at 3:35 PM, Anthony Ikeda > wrote: > > So is the Hive with Casaandra Data Handler

Re: best way to load millions of gzip files in hdfs to one table in hive?

2012-10-02 Thread Alexander Pivovarov
Options 1. create table and put files under the table dir 2. create external table and point it to files dir 3. if files are small then I recomend to create new set of files using simple MR program and specifying number of reduce tasks. Goal is to make files size > hdfs block size (it safes NN me

Re: Hive does not run - Typical NoSuchFieldError

2012-10-02 Thread Edward Capriolo
If your working on datastax/riptano branch you probably should take this up on one their forums. Edward On Tue, Oct 2, 2012 at 3:35 PM, Anthony Ikeda wrote: > So is the Hive with Casaandra Data Handler officially not working? I.e the > riptano git repository branch cassandra-1.0 > > Sent from m

Re: Hive does not run - Typical NoSuchFieldError

2012-10-02 Thread Anthony Ikeda
So is the Hive with Casaandra Data Handler officially not working? I.e the riptano git repository branch cassandra-1.0 Sent from my [6th Gen] iPhone On 02/10/2012, at 12:08, Edward Capriolo wrote: > If you are trying to build the brisk versions of hive with cassandra > support chosing a VM and

Re: Hive does not run - Typical NoSuchFieldError

2012-10-02 Thread Edward Capriolo
If you are trying to build the brisk versions of hive with cassandra support chosing a VM and cdh are not going to help you with this issue. Edward On Tue, Oct 2, 2012 at 2:31 PM, Anthony Ikeda wrote: > Yeah I think the Vm is the next option. We run RedHat, I'll try that first. > > Running on Ma

Re: Hive does not run - Typical NoSuchFieldError

2012-10-02 Thread Anthony Ikeda
Yeah I think the Vm is the next option. We run RedHat, I'll try that first. Running on Mac was just to spike the tech, if running a VM means better compatibility then I guess I'll take that route instead. Thanks Chuck. On Tue, Oct 2, 2012 at 11:19 AM, Connell, Chuck wrote: > Seems easier to cr

RE: Hive does not run - Typical NoSuchFieldError

2012-10-02 Thread Connell, Chuck
Seems easier to create a new VM, install CentOS and CDH4 on it, and you are off and running. This setup runs pretty much perfectly on the first try. I have built 5-6 of them. Why do you have to do it on a Mac? Chuck From: Anthony Ikeda [mailto:anthony.ikeda@gmail.com] Sent: Tuesday, Octob

Re: Hive does not run - Typical NoSuchFieldError

2012-10-02 Thread Anthony Ikeda
Yeah I get this. I've tried the different branches in the GitHub repository (cassandra-0.7, cassandra-1.0, etc) but all seem to yield the same issue. Even the 0.9.0 tar.gz download is exhibiting these issues and I don't think that is actually using cassandra jar files. I might also look at the vers

Re: Hive does not run - Typical NoSuchFieldError

2012-10-02 Thread Edward Capriolo
You are in a heap of trouble. The problem is Cassandra and Hive use different versions of ANTLR and when you get two versions of antlr on a single java classpath, well you get your result. I have talked to a few people about this and the only way to handle this is tools like jarjar that edit class

Re: Hive does not run - Typical NoSuchFieldError

2012-10-02 Thread Anthony Ikeda
Unfortunately not an option. This is an internal application and if I can't get it running locally, then it's no longer a tech option. I know I've had this working in the past but it seems that when the HiveLexer is generated in the ql project, the "type" field definition is not created - each tim

RE: Hive does not run - Typical NoSuchFieldError

2012-10-02 Thread Connell, Chuck
Try the easy way... Cloudera CDH4 running on Centos 5.8. Can install everything on one machine. Chuck From: Anthony Ikeda [mailto:anthony.ikeda@gmail.com] Sent: Tuesday, October 02, 2012 1:23 PM To: user@hive.apache.org Subject: Hive does not run - Typical NoSuchFieldError I've tried diffe

Hive and RESTFul with RESTEasy (jax-rs)

2012-10-02 Thread Zebeljan, Nebojsa
Hi, I'm very new to Hive and I need to approach how to fire Hive sql queries via the RESTEasy framework and to stream back the query result as a JSON string to the client. I wonder, if there is any approach or best practice how I can achieve this with Hive and a RESTFul service. Thanks in adva

RE: File Path and Partition names

2012-10-02 Thread carla.staeben
Yep, dynamic. Let me know if you figure something out. I'd hate to have to go through all of the trouble to etl the data and then create tables on top with the alter table command. Such a waste of time and effort... Carla -Original Message- From: ext Doug Houck [mailto:doug.ho...@tru

Re: File Path and Partition names

2012-10-02 Thread Doug Houck
Hi Carla, I assume you are using dynamic partitioning for this, correct?? Assuming so, I have the same question and am trying to figure it out, and will let you know if I do. If you are using static partitions, you just need to specify the location on the 'alter table' command when the partiti

RE: File Path and Partition names

2012-10-02 Thread carla.staeben
Thanks Bejoy, I was kind of hoping to avoid all of the 'extra' work...it would be nice if hive didn't include the partition name in the path creation...I was hoping that there was a 'set' parameter/config I was missing. Thanks Carla From: ext Bejoy KS [mailto:bejoy...@yahoo.com] Sent: Tuesday,

Re: File Path and Partition names

2012-10-02 Thread Bejoy KS
Hi Carla If you like to have your custom directory structure for your partitions. You can create dirs in hdfs of your choice , load data into them (If from another hive table then you can use 'Insert Overwrite Directory..' To populate an hdfs dir). Now you need to register this dir as a new pa

RE: Percentile calculation

2012-10-02 Thread Mayank Bansal
I have a 11 node hadoop cluster, the map phase runs, the process fails at the reduce phase after 67% competition with the out of java heap space error. Could you please tell me, what further info do you want? -Original Message- From: MiaoMiao [mailto:liy...@gmail.com] Sent: Tuesday, Octo

File Path and Partition names

2012-10-02 Thread carla.staeben
Quick question about using hive to create new hdfs file paths. Generally speaking, we like to keep our data files with a path similar to Dataset/year/month/day/hour I need to create a new table in hive and populate it with data from a different dataset, using a HiveQL query. If I do this: CREA