I was working on a proof of concept java implementation for LargeList  [1]
implementation (64-bit array offsets).  Our Java implementation doesn't
appear to support Vectors/Arrays larger then Integer.MAX_VALUE addressable
space.

It looks like Message.fbs was updated quite a while ago to support 64-bit
lengths/offsets [2].  I had some questions:

1.  For Java:
  * Is my assessment accurate that is doesn't support 64-bit ranged sizes?
  * Is there a desire to support the 64 bit sizes? (I didn't come across
any JIRAs when I did a search)
 *  Is there a technical blocker for doing so?
 * Any thoughts on approach for doing such a large change (I'm mostly
concerned with breaking existing consumers/performance regressions)?
   - Given that the Java code base appears relatively stable, it might be
that forking and creating a version "2.0" is the best viable option.

2.  For other language implementations, is there support for 64-bit sizes
or only 32-bit?

Thanks,
Micah

P.S. It looks like our spec docs are out of date in regards to this issue,
they still list Int::MAX_VALUE as the largest possible array, it is on my
plate to update and consolidate them.

[1] https://issues.apache.org/jira/browse/ARROW-4810
[2]
https://github.com/apache/arrow/commit/ced9d766d70e84c4d0542c6f5d9bd57faf10781d

Reply via email to