Hi folks, There was some discussion on here a couple of weeks ago about using the Apache Arrow in memory format for Cassandra data so I thought I'd share the following posts / code we just released as alpha (apache 2 license).
Code: https://github.com/datastax/sstable-to-arrow Post Part 1: https://www.datastax.com/blog/analyzing-cassandra-data-using-gpus-part-1 Post Part 2: https://www.datastax.com/blog/analyzing-cassandra-data-using-gpus-part-2 I also think the cross language sstable parsing code and visual documentation is a tremendous contribution for the project and would love to see more folks pick it up and use it for other purposes. If anyone is interested feel free to reach out or join our live workshop on this topic in mid August: https://www.eventbrite.com/e/workshop-analyzing-cassandra-data-with-gpus-tickets-164294668777 --Seb