hi all, I did a little bit of analysis of the costs of serialization bottlenecks in data access for Python pandas users and how (at a high level, no perf numbers yet!) Apache Arrow will help:
http://wesmckinney.com/blog/pandas-and-apache-arrow/ Feedback and comments welcome. cheers, Wes