Do you have a stack trace of the array out of bounds exception? I don't remember an array out of bounds problem off the top of my head. A stack trace will tell me a lot, obviously.
If you are using Spark 1.4 that implies Hive 0.13, which is pretty old. It may be a problem that we fixed a while ago. Thanks, Owen On Sat, Sep 12, 2015 at 8:15 AM, Cazen Lee <cazen....@gmail.com> wrote: > Good Day! > > I think there are some problems between ORC and AWS EMRFS. > > When I was trying to read "upper 150M" ORC files from S3, ArrayOutOfIndex > Exception occured. > > I'm sure that it's AWS side issue because there was no exception when > trying from HDFS or S3NativeFileSystem. > > Parquet runs ordinarily but it's inconvenience(Almost our system runs > based on ORC) > > Does anybody knows about this issue? > > I've tried spark 1.4.1(EMR 4.0.0) and there are no 1.5 patch-note about > this > > Thank You > --------------------------------------------------------------------- > To unsubscribe, e-mail: user-unsubscr...@spark.apache.org > For additional commands, e-mail: user-h...@spark.apache.org > >