RE: Setting | Verifying | Hive Query Parameters from Java

2014-03-06 Thread Garg, Rinku
Hi Gordon, Thanks a lot for your reply. The properties mentioned in mail trail are just an example. Actual properties that we want to set are as given below: set yarn.nodemanager.resource.memory-mb=16384; set mapreduce.map.memory.mb=2048; set mapreduce.reduce.memory.mb=2048; set mapreduce.map.

Re: Setting | Verifying | Hive Query Parameters from Java

2014-03-06 Thread Gordon Wang
The 2 following props are for hive server2. I don't think you can change it in your jdbc session. I am wondering why you need to change them in your jdbc connection. props.setProperty(hive.server2.async.exec.threads,50); props.setProperty(hive.server2.thrift.max.worker.threads,500); You can s

Re: Partitions in Hive

2014-03-06 Thread Nitin Pawar
partition in hive is done on the column value and not on the sub portion of column value. If you want to separate data based on the first character then create another column to store that value On Thu, Mar 6, 2014 at 11:42 PM, nagarjuna kanamarlapudi < nagarjuna.kanamarlap...@gmail.com> wrote

Partitions in Hive

2014-03-06 Thread nagarjuna kanamarlapudi
Hi, I have a table with 3 columns in hive. I want that table to be partitioned based on first letter of column 1. How do we define such partition condition in hive ? Regards, Nagarjuna K

RE: Setting | Verifying | Hive Query Parameters from Java

2014-03-06 Thread java8964
If you want to set some properties of hive, just run it as it is in your JDBC connection. Any command in the hive JDBC will send to the server as the same if you run "set hive.server2.async.exec.threads=50;" in the hive session. Run the command "set hive.server2.async.exec.threads=50;" as a SQL

Re: Automatic replacement of partitions in hive

2014-03-06 Thread Nitin Pawar
Thanks for clarifying that Bryan On Thu, Mar 6, 2014 at 7:55 PM, Bryan Jeffrey wrote: > Nitin, > > #3 will not work. msck repair table does not remove partitions if the > files associated with the partition do not exist. We have successfully > applied #2 in our application. > > Regards, > > Br

Re: Automatic replacement of partitions in hive

2014-03-06 Thread Bryan Jeffrey
Nitin, #3 will not work. msck repair table does not remove partitions if the files associated with the partition do not exist. We have successfully applied #2 in our application. Regards, Bryan Jeffrey On Thu, Mar 6, 2014 at 5:37 AM, Nitin Pawar wrote: > There is no concept called automati

Hive unwanted location directory

2014-03-06 Thread Valluri, Sathish
We are creating external table in Hive and if the location path is not present in the HDFS say /testdata(as shown below), Hive is creating the '/testdata' dummy folder. Is there any option in hive or any way to stop creating dummy directories if the location folder not exists. Our use case nee

Re: Data Modeling Tool

2014-03-06 Thread Joseph D Antoni
A data architect friend said the latest release of CA Erwin can handle Hive, but it doesn't support Postgres directly. Thanks Joey D'Antoni On Wednesday, March 5, 2014 9:59 PM, Ronak Bhatt wrote: Hello Hive Experts Is there any data modeling tool that you can suggest that can work with Hi

RE: Setting | Verifying | Hive Query Parameters from Java

2014-03-06 Thread Garg, Rinku
Hi All, Can anybody help me on below mail trail. Thanks Rinku Garg From: Garg, Rinku Sent: Tuesday, March 04, 2014 5:14 PM To: user@hive.apache.org Subject: Setting | Verifying | Hive Query Parameters from Java Hi All, We have installed CDH4.2.0 and hive-0.10.0-cdh4.2.0. Both are working as d

Re: Automatic replacement of partitions in hive

2014-03-06 Thread Nitin Pawar
There is no concept called automatic. Please wait for expert hive gurus to reply before using any of my suggestions Few options which I can think of are 1) Insert overwrite table with dynamic partitions enabled and restricting the partition column values for the date range you want. Cost of this

Automatic replacement of partitions in hive

2014-03-06 Thread Kasi Subrahmanyam
Hi, I have a table in hive which has data of three months old. I have partitioned the data and I got 90 partitions. Now when I get the new data from next day I want to replace the partition 1week old with the new one automatically. Can this partitioning and replacement be done using swoop at the s

Re: Hive append

2014-03-06 Thread Nitin Pawar
You may want to look at partitioned tables and load data into partitions. For my that seems like the easiest way. If you do not have a defined partition column in your data, then another approach is load data into a temporary staging table and from there load into partitioned table. In this approa

Re: Hive append

2014-03-06 Thread Raj hadoop
Hi Nitin, existing records should remain same and the new records should get inserted into the table On Thu, Mar 6, 2014 at 2:11 PM, Nitin Pawar wrote: > are you talking about adding new records to tables or updating records in > already existing table? > > > On Thu, Mar 6, 2014 at 1:59 PM, Ra

Re: Hive append

2014-03-06 Thread Raj hadoop
Hi Nitin, existing records should remain same and the new records should get inserted into the table On Thu, Mar 6, 2014 at 2:11 PM, Nitin Pawar wrote: > are you talking about adding new records to tables or updating records in > already existing table? > > > On Thu, Mar 6, 2014 at 1:59 PM, Ra

Re: Hive append

2014-03-06 Thread Nitin Pawar
are you talking about adding new records to tables or updating records in already existing table? On Thu, Mar 6, 2014 at 1:59 PM, Raj hadoop wrote: > Query in HIVE > > > > I tried merge kind of operation in Hive to retain the existing records and > append the new records instead of dropping the

Hive append

2014-03-06 Thread Raj hadoop
Query in HIVE I tried merge kind of operation in Hive to retain the existing records and append the new records instead of dropping the table and populating it again. If anyone can come help with any other approach other than this or the approach to perform merge operation will be great he