> On 23 Jun 2015, at 00:09, Danny <kont...@dannylinden.de> wrote: > > hi, > > have you tested > > "s3://ww-sandbox/name_of_path/" instead of "s3://ww-sandbox/name_of_path" >
+ make sure the bucket is there already. Hadoop s3 clients don't currently handle that step > or have you test to add your file extension with placeholder (*) like: > > "s3://ww-sandbox/name_of_path/*.gz" > > or > > "s3://ww-sandbox/name_of_path/*.csv" > > depend on your files. If it does not work pls test with the new "s3a" > protocol of Spark/Hadoop: > > https://issues.apache.org/jira/browse/HADOOP-10400 ...but don't expect performance or scalability in Hadoop 2.6; Hadoop 2.7 has the fixes needed for production use, as does CDH5.4 --------------------------------------------------------------------- To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h...@spark.apache.org