Hi, I set up my clusters on EC2 via with Cloudera Whirr. My hadoop.properties file is like :
whirr.hardware-id=m1.xlarge # Ubuntu 10.04 LTS Lucid. See http://alestic.com/ whirr.image-id=eu-west-1/ami-1e34016a whirr.location-id=eu-west-1 whirr.service-name=hadoop whirr.cluster-name=myhadoopcluster whirr.instance-templates=1 jt+nn,5 dn+tt whirr.provider=ec2 whirr.identity=SECRET whirr.credential=SECRET whirr.private-key-file=${sys:user.home}/.ssh/id_rsa whirr.public-key-file=${sys:user.home}/.ssh/id_rsa.pub whirr.hadoop-install-runurl=cloudera/cdh/install whirr.hadoop-configure-runurl=cloudera/cdh/post-configure Cloudera default settings are currently fine for me. I recommend using MySql as metastore instead of Derby. I use Amazon RDS for its simplicity. Regards, Lemig Le 30 août 2011 à 08:02, MIS a écrit : > Hi, > > Can somebody point me to production level setup of Hive in EC2. The intent is > to know the setup best practices being employed. > > Thanks.