[jira] [Created] (ARROW-2017) Array initialization with large (>2**31-1) uint64 values fails

2018-01-22 Thread Ian Roddis (JIRA)
Ian Roddis created ARROW-2017: - Summary: Array initialization with large (>2**31-1) uint64 values fails Key: ARROW-2017 URL: https://issues.apache.org/jira/browse/ARROW-2017 Project: Apache Arrow

Re: Uniform types in Arrow table columns (pyarrow.array) and the case of python dictionaries

2018-01-22 Thread simba nyatsanga
Great! Thanks Wes. It's really great and interesting to see a concerted effort to have a conversion from a language specific implementation of common data structures into a common memory layout that can be consumed by another language (HashMap in Java/ Hash in Ruby etc). Excited to see how the API

Re: Uniform types in Arrow table columns (pyarrow.array) and the case of python dictionaries

2018-01-22 Thread Wes McKinney
Note we have https://issues.apache.org/jira/browse/ARROW-1705 (and maybe some other JIRAs, I'd have to go digging) about improving support for converting Python dicts to the right Arrow memory layout. - Wes On Mon, Jan 22, 2018 at 4:50 PM, simba nyatsanga wrote: > Hi Uwe, > > Thank you very much

Re: Uniform types in Arrow table columns (pyarrow.array) and the case of python dictionaries

2018-01-22 Thread simba nyatsanga
Hi Uwe, Thank you very much for the detailed explanation. I have a much better understanding now. Cheers On Mon, 22 Jan 2018 at 19:37 Uwe L. Korn wrote: > Hello Simba, > > find the answers inline. > > On Mon, Jan 22, 2018, at 7:29 AM, simba nyatsanga wrote: > > Hi Everyone, > > > > I've got tw

Re: Uniform types in Arrow table columns (pyarrow.array) and the case of python dictionaries

2018-01-22 Thread Uwe L. Korn
Hello Simba, find the answers inline. On Mon, Jan 22, 2018, at 7:29 AM, simba nyatsanga wrote: > Hi Everyone, > > I've got two questions that I'd like help with: > > 1. Pandas and numpy arrays can handle multiple types in a sequence eg. a > float and a string by using the dtype=object. From wha