Discussion: Should we make string/binary types first class Arrow Array types?

2016-07-13 Thread Micah Kornfield
Today String and Binary types are represented in memory as list [1] and we use logical types to distinguish between a list of bytes and string type [2]. The question of whether this is sufficient or if we should make a first class string/binary type has come up tangentially on a few threads and w

Re: Installing Arrow

2016-07-13 Thread Manoj Kumar
I stringed together a set of installation instructions to help future Python developers. The PR is here https://github.com/apache/arrow/pull/105 On Wed, Jul 6, 2016 at 2:28 PM, Wes McKinney wrote: > You can look at the Travis CI scripts to see the build procedure for > each component: > > https

[jira] [Created] (ARROW-240) Installation instructions for pyarrow

2016-07-13 Thread Manoj Kumar (JIRA)
Manoj Kumar created ARROW-240: - Summary: Installation instructions for pyarrow Key: ARROW-240 URL: https://issues.apache.org/jira/browse/ARROW-240 Project: Apache Arrow Issue Type: Improvement

Re: Proposed new type: Fixed width list

2016-07-13 Thread Micah Kornfield
Thanks Jacques. I'm ok dropping the fixed width proposal for now and revisiting it at a later point. I'll start a thread later today to break off the discussion on adding string/binary as a primitive type. -Micah On Wed, Jul 13, 2016 at 7:49 AM, Jacques Nadeau wrote: > > On Tue, Jul 12, 2016 a

Re: [JAVA] Contribute to Apache Arrow

2016-07-13 Thread Jacques Nadeau
Hey Kiril, I think our biggest issues on the java side right now are a sample application, some getting started documentation and better tests. It would also be helpful if you could review the changes that Steven posted about cleaning up some of the interfaces. See this pr: https://github.com/apa

Re: Proposed new type: Fixed width list

2016-07-13 Thread Jacques Nadeau
On Tue, Jul 12, 2016 at 10:42 PM, Micah Kornfield wrote: > Two questions come to mind. > 1. Is it useful to have fixed width with list types exclusive of > binary types? > I think "useful" isn't a strong enough reason to add more types. It seems like a fairly rare occurrence and thus a prematur

ApacheCon Europe call for papers open

2016-07-13 Thread Rich Bowen
As you are no doubt already aware, we will be holding ApacheCon in Seville, Spain, the week of November 14th, 2016. The call for papers (CFP) for this event is now open, and will remain open until September 9th. The event is divided into two parts, each with its own CFP. The first part of the eve