I guess I would be of some help w.r.t Big Endian systems, I in fact want to be part of this development and testing work, quite held-up now for a month. Thanks,Sanjay
> From: wesmck...@gmail.com > Date: Sat, 6 Aug 2016 17:52:26 -0700 > Subject: Re: Is there plan to support BigEndian Systems like SUN SPARC > Hardware ? > To: dev@arrow.apache.org > CC: emkornfi...@gmail.com; jul...@dremio.com > > I suspect that relaxing the constraint to native endianness (and > including this in any IPC/RPC metadata (per ARROW-245) will not cause > too many problems. One of the challenges for us will be testing and > continuous integration -- what are the options for running the test > suite on a regular basis on big endian platforms? I know that in > pandas we occasionally ran into esoteric test failures for the PPC / > big-endian Debian package builds but for the most part there haven't > been any problems. > > - Wes > > On Fri, Aug 5, 2016 at 4:39 AM, Sanjay Rao <getsanjay...@live.com> wrote: > > Some places where explicit check for Little Endian is there- > > ./memory/src/main/java/io/netty/buffer/UnsafeDirectLittleEndian.java: if > > (!NATIVE_ORDER || buf.order() != ByteOrder.BIG_ENDIAN) > > {./memory/src/main/java/io/netty/buffer/UnsafeDirectLittleEndian.java: > > throw new IllegalStateException("Arrow only runs on LittleEndian systems."); > > Sanjay > >> From: pchan...@maprtech.com > >> Date: Thu, 4 Aug 2016 17:04:34 -0700 > >> Subject: Re: Is there plan to support BigEndian Systems like SUN SPARC > >> Hardware ? > >> To: dev@arrow.apache.org; emkornfi...@gmail.com > >> CC: jul...@dremio.com > >> > >> Drill's assumption of little endian is in the ValueVector code, and Arrow > >> has inherited the same assertion. ( > >> https://github.com/apache/arrow/blob/master/java/memory/src/main/java/io/netty/buffer/UnsafeDirectLittleEndian.java#L58 > >> ) > >> > >> In the Java implementation, the underlying Netty implementation handles the > >> conversion between endianness fairly well, so potentially this assert can > >> be removed from here and Drill can move this higher up in the Drill code. > >> > >> > >> Parth > >> > >> On Thu, Aug 4, 2016 at 1:14 PM, Micah Kornfield <emkornfi...@gmail.com> > >> wrote: > >> > >> > Hi Julien, > >> > Thats the theory. I don't think that there is anything in the C++ code > >> > base that should break but we don't have access to hardware to verify > >> > that. > >> > > >> > The java Arrow code currently asserts that it is running on a little > >> > endian > >> > machine. I did a very quick scan of the Java code and didn't see > >> > anything > >> > there would break on a big-endian system, but according to at least one > >> > person who is working on Drill, it seems that Drill assumes little > >> > endianness (I don't know if this is in Arrow/ValueVector code or it is > >> > higher up the stack in the Drill code). > >> > > >> > Thanks, > >> > Micah > >> > > >> > > >> > On Thu, Aug 4, 2016 at 11:36 AM, Julien Le Dem <jul...@dremio.com> wrote: > >> > > >> > > So it sounds like right now it just works as long as there are no > >> > > inter-system communication (with different endianness) because both > >> > > java > >> > > and c++ code just use the underlying endianness. > >> > > Is that correct? > >> > > > >> > > > >> > > On Thu, Aug 4, 2016 at 11:17 AM, Micah Kornfield > >> > > <emkornfi...@gmail.com> > >> > > wrote: > >> > > > >> > >> Hi Sanjay, > >> > >> I think we are trying to work that out now. As you've seen with some > >> > >> of > >> > >> you initial investigation we have no coverage for big-endian machines > >> > yet. > >> > >> But in the long run, we should be able to make it work (it seems like > >> > >> there > >> > >> might be some difference of opinion on how to make it work). > >> > >> > >> > >> Thanks, > >> > >> Micah > >> > >> > >> > >> On Mon, Aug 1, 2016 at 11:16 AM, Sanjay Rao <getsanjay...@live.com> > >> > >> wrote: > >> > >> > >> > >> > Hi Wes, Hi Micah, > >> > >> > I understood what you meant, so point 2. Arrow working with Big > >> > >> > Endian > >> > >> > machine to Big Endian shouldn't be an issue right ? > >> > >> > Please confirm. > >> > >> > Thanks,Sanjay > >> > >> > > From: wesmck...@gmail.com > >> > >> > > Date: Mon, 1 Aug 2016 11:07:07 -0700 > >> > >> > > Subject: Re: Is there plan to support BigEndian Systems like SUN > >> > SPARC > >> > >> > Hardware ? > >> > >> > > To: dev@arrow.apache.org; emkornfi...@gmail.com > >> > >> > > > >> > >> > > hey Micah, > >> > >> > > > >> > >> > > On Mon, Aug 1, 2016 at 11:02 AM, Micah Kornfield < > >> > >> emkornfi...@gmail.com> > >> > >> > wrote: > >> > >> > > > Hi Wes, > >> > >> > > > The point I was trying to argue from an earlier thread is that > >> > >> > > > the > >> > >> most > >> > >> > > > common cases for relocation are: > >> > >> > > > 1. Little endian machine to little endian machine (most likely > >> > same > >> > >> > > > machine) > >> > >> > > > 2. big endian machine to big endian machine (most likely same > >> > >> machine) > >> > >> > > > 3. big endian machine to little endian machine or vice versa > >> > >> > > > > >> > >> > > > The purpose of the metadata would be to make use-cases 1 and 2 > >> > >> possible > >> > >> > > > without byte-swapping. Use case 3 would obviously require byte > >> > >> > swapping > >> > >> > > > but for an initial implementation the code could simply indicate > >> > >> that > >> > >> > it is > >> > >> > > > not supported. > >> > >> > > > > >> > >> > > > This seems less complex to me than actually implementing any > >> > >> > > > sort > >> > of > >> > >> > > > byte-swapping logic while still supporting the widest variety of > >> > >> > hardware > >> > >> > > > with the same code for the most common use-cases. > >> > >> > > > >> > >> > > This makes sense. My comments were for the situation that a big > >> > endian > >> > >> > > system would be exposing memory to an unknown consumer -- for > >> > example, > >> > >> > > if we implemented an RPC wire format for Arrow memory, then in > >> > general > >> > >> > > a big endian system would need to send little-endian integers to > >> > >> > > an > >> > >> > > arbitrary receiver. I'm not sure the best way to provide for easy > >> > >> > > native-endianness support for cases 1/2, but trying to fully solve > >> > >> > > this problem now seems premature until we've established some of > >> > these > >> > >> > > tools (so long as we haven't painted ourselves into a corner). > >> > >> > > > >> > >> > > - Wes > >> > >> > > > >> > >> > > > > >> > >> > > > Thanks, > >> > >> > > > Micah > >> > >> > > > > >> > >> > > > P.S. If anybody can provide pointers I'd be interested to > >> > understand > >> > >> > which > >> > >> > > > pieces of the java code make assumptions about > >> > >> > > > little-endianness. > >> > >> > > >> > >> > > >> > >> > >> > > > >> > > > >> > > > >> > > -- > >> > > Julien > >> > > > >> > > >