subject:"pandas and Apache Arrow in context"

Re: pandas and Apache Arrow in context

2016-02-22 Thread Ted Dunning

Put that answer on the front page of the web site. Well said. On Mon, Feb 22, 2016 at 2:05 PM, Wes McKinney wrote: > hi Stuart, > > Currently pandas and NumPy only support flat, non-nested data. Nested > data includes column value types including arrays, structs, maps, and > unions. This enab

Re: pandas and Apache Arrow in context

2016-02-22 Thread Wes McKinney

hi Stuart, Currently pandas and NumPy only support flat, non-nested data. Nested data includes column value types including arrays, structs, maps, and unions. This enables you to analyze JSON-like data natively in-memory without pre-flattening or normalization. There's also an open question about

Re: pandas and Apache Arrow in context

2016-02-22 Thread Stuart Axelbrooke

Hey Wes, Very exciting to see things moving along on the Python front. As you state in your post, fast, ubiquitous columnar data will be a great foundation, especially for more modern data processing and ETL tools. Though I am a bit curious what you mean by nested columnar data... Thanks, Stuar

pandas and Apache Arrow in context

2016-02-22 Thread Wes McKinney

hi all, I did a little bit of analysis of the costs of serialization bottlenecks in data access for Python pandas users and how (at a high level, no perf numbers yet!) Apache Arrow will help: http://wesmckinney.com/blog/pandas-and-apache-arrow/ Feedback and comments welcome. cheers, Wes

Re: pandas and Apache Arrow in context

Re: pandas and Apache Arrow in context

Re: pandas and Apache Arrow in context

pandas and Apache Arrow in context

4 matches

Site Navigation

Mail list logo

Footer information