Re: Building out Hive in EC2/S3 versus dedicated servers

2011-11-22 Thread Guy Bayes
Apache Whirr is also an option for building a hadoop cluster on ec2, this allows you a more cloud neutral approach, also eases the pain on in-housing it later if you need to http://whirr.apache.org/ Guy On Tue, Nov 22, 2011 at 12:47 PM, Mark Grover wrote: > Here is another article that might be

Re: Building out Hive in EC2/S3 versus dedicated servers

2011-11-22 Thread Mark Grover
Here is another article that might be insightful for you: http://www.cloudera.com/blog/2011/06/migrating-from-elastic-mapreduce-to-a-cloudera%E2%80%99s-distribution-including-apache-hadoop-cluster/?s-distribution-including-apache-hadoop-cluster/ Sam raised some valid points and going with Amazon d

Re: Building out Hive in EC2/S3 versus dedicated servers

2011-11-22 Thread Sam Wilson
We recently adopted Hadoop and Hive for doing some significant data processing. We went the Amazon route. My own $.02 is as follows: If you are already incredibly experienced with Hadoop and Hive and have someone on staff who has previously built a cluster at least as big as the one you are pr

Building out Hive in EC2/S3 versus dedicated servers

2011-11-22 Thread Loren Siebert
My colleague has a Heroku-based startup and they are just getting started with Hadoop and Hive. They’re evaluating running Hive in EC2/S3 versus buying a handful of boxes and installing CDH. One nice (albeit dated) analysis on this question is here, but I’m curious if anyone here has a differen

Re: Converting Array to a String

2011-11-22 Thread Mark Grover
Matt and Miguel, You may also be able to dynamically cast the column to string and then do your replace instead of altering the metadata associated with the table. Mark - Original Message - From: "Matt Tucker" To: user@hive.apache.org Sent: Tuesday, November 22, 2011 2:12:29 PM Subject:

RE: Converting Array to a String

2011-11-22 Thread Tucker, Matt
Thanks Miguel! That did the trick. Now I just need to sort the input to collect_set(), so I can 'GROUP BY' properly. Matt Tucker From: Miguel Cabero [mailto:miguel.cab...@gmail.com] Sent: Friday, November 18, 2011 7:38 PM To: user@hive.apache.org Subject: Re: Converting Array to a String Hi,

Re:

2011-11-22 Thread shashwat shriparv
I have these versions of Hive : 0.7.1 Hbase :0.90.4 and Hadoop: 0.20.203.0rc1. I have configured Hive, Hadoop ad Hbase, separately i can create tables in Hbase adn Hive(HDFS) , I need to integrate Hive with Hbase i have tried but not working successfully. Can you please provide me some reference o

[no subject]

2011-11-22 Thread Denis Kreis
Hi list, I am new to hive and have encountered a problem. Settings in my conf/hive-site.xml file seem to have no effect. I've tried to set the HIVE_CONF_DIR variable, but this did not help. Any Ideas? Denis

Re: Mysql metastore configuration error.

2011-11-22 Thread Sriram Krishnan
As far as I can tell, there are two things going on here: 1. You are trying to access Hive via JDBC – if so, you are not going to be able to connect to the Hive DB the way you are trying to. If you do want to access Hive via JDBC, follow instructions here: https://cwiki.apache.org/confluence

Re: Capacity Planning using Dfsadmin command

2011-11-22 Thread Mark Grover
Hi Abhishek, Although someone on this mailing list might know the answer to your question, I believe you'd be better off asking this question on the Hadoop mailing list. Check out: http://hadoop.apache.org/common/mailing_lists.html#Users Mark - Original Message - From: "Abhishek Pratap S

Capacity Planning using Dfsadmin command

2011-11-22 Thread Abhishek Pratap Singh
Hi, I have a query about hadoop, this is very important for capacity planning and requesting new hardware. Hadoop command: *hadoop dfsadmin –report* depicts the DFS usage and DFS remaning. I have checked that the DFS usage does not match the size of *hadoop dfs –dus* on root or user dir

Re: FW: Mysql metastore configuration error.

2011-11-22 Thread Jasper Knulst
Hi Aditya, Do you have a mysql-connector jar in your classpath for the Hadoop jobtracker/tasktrackers ? Gr Jasper 2011/11/22 Aditya Singh30 > Hi, > > I tried some more things : > > ** ** > > I created a new table with same structure as table abcd as follows: > > hi

FW: Mysql metastore configuration error.

2011-11-22 Thread Aditya Singh30
Hi, I tried some more things : I created a new table with same structure as table abcd as follows: hive> create table abcdfinal(ab int, cd string) row format delimited fields terminated by '#' stored as rcfile; and then loaded the data into this new table by selecting * from old

Re: Mysql metastore configuration error.

2011-11-22 Thread Alexander C.H. Lorenz
check your /etc/my.cnf for "old-passwords = 1". If "1" set to "0" and restart mysql. After you have to set the password for your user again. best, Alex On Tue, Nov 22, 2011 at 9:32 AM, Aditya Singh30 wrote: > I checked the TBLS table. It contains an entry for abcd. > > About Hadoop only jo

RE: Mysql metastore configuration error.

2011-11-22 Thread Aditya Singh30
I checked the TBLS table. It contains an entry for abcd. About Hadoop only jobs. Yes I ran examples provided with Hadoop which don't use hive at all. They ran fine, I checked their output also. I tried executing "select ab from abcd" on hive CLI and checked the job tracker UI. There were 2 faile