Good Day! I think there are some problems between ORC and AWS EMRFS.
When I was trying to read "upper 150M" ORC files from S3, ArrayOutOfIndex Exception occured. I'm sure that it's AWS side issue because there was no exception when trying from HDFS or S3NativeFileSystem. Parquet runs ordinarily but it's inconvenience(Almost our system runs based on ORC) Does anybody knows about this issue? I've tried spark 1.4.1(EMR 4.0.0) and there are no 1.5 patch-note about this Thank You -- ca...@korea.com cazen....@samsung.com http://www.Cazen.co.kr -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Question-ORC-EMRFS-Problem-tp24673.html Sent from the Apache Spark User List mailing list archive at Nabble.com. --------------------------------------------------------------------- To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h...@spark.apache.org