RE: Amazon EMR Best Practices for Hive metastore

2012-03-06 Thread Steven Wong
ailto:swil...@monetate.com] Sent: Tuesday, March 06, 2012 7:15 PM To: user@hive.apache.org Subject: Re: Amazon EMR Best Practices for Hive metastore We also do #4. Initially we had lots of conversations about all the other options and we should do this or that... Ultimately we focused on just going live

Re: Amazon EMR Best Practices for Hive metastore

2012-03-06 Thread Sam Wilson
We also do #4. Initially we had lots of conversations about all the other options and we should do this or that... Ultimately we focused on just going live as quickly as possible and getting more involved in the setup later. Since then the only thing we've needed to do is hack a few o the basel

RE: Amazon EMR Best Practices for Hive metastore

2012-03-06 Thread Jeff Sternberg
Mark, We do 4), basically. We have a simple hive script that does all the "create external table" statements, and we run that script as step 1 of the EMR jobs we spin up. Then our "real" processing takes over in step 2 and beyond. We're only working with about 50 tables, so it's pretty manageab