Re: Using s3 as warehouse on emr

2016-01-22 Thread Elliot West
Related to this, might it be better to use the s3a protocol instead of s3n? https://wiki.apache.org/hadoop/AmazonS3 Additionally, can anyone advise when EMRFS is required when storing Hive tables in S3? http://docs.aws.amazon.com/ElasticMapReduce/latest/ManagementGuide/emr-overview-arch.html#emr-a

Re: Using s3 as warehouse on emr

2016-01-22 Thread 董亚军
hi, the location of table created in hive within the location of database by default. for example, create database test location 's3://warehouse/hive/test.db'; use test; create table t1... and then the location of t1 will host on s3. On Fri, Jan 22, 2016 at 8:52 PM, Zsolt Tóth wrote: > Hi,

Using s3 as warehouse on emr

2016-01-22 Thread Zsolt Tóth
Hi, I'd like to use S3 as the hive warehouse on my emr 4.x cluster. I've set hive.metastore.warehouse.dir=s3n://testbucket/hive_warehouse and fs.s3.impl=org.apache.hadoop.fs.s3native.NativeS3FileSystem (not sure if this is needed) in the hive-site.xml on the master node. Double checked the "set -v