Related to this, might it be better to use the s3a protocol instead of s3n?
https://wiki.apache.org/hadoop/AmazonS3
Additionally, can anyone advise when EMRFS is required when storing Hive
tables in S3?
http://docs.aws.amazon.com/ElasticMapReduce/latest/ManagementGuide/emr-overview-arch.html#emr-a
hi,
the location of table created in hive within the location of database by
default.
for example,
create database test location 's3://warehouse/hive/test.db';
use test;
create table t1...
and then the location of t1 will host on s3.
On Fri, Jan 22, 2016 at 8:52 PM, Zsolt Tóth
wrote:
> Hi,
Hi,
I'd like to use S3 as the hive warehouse on my emr 4.x cluster.
I've set hive.metastore.warehouse.dir=s3n://testbucket/hive_warehouse and
fs.s3.impl=org.apache.hadoop.fs.s3native.NativeS3FileSystem (not sure if
this is needed) in the hive-site.xml on the master node. Double checked the
"set -v