> On Wed, May 11, 2011 at 10:23 AM, Luke Biddell <luke.bidd...@gmail.com> wrote: >> I wouldn't mind knowing how other people are approaching this problem too. >> >> On 11 May 2011 11:27, Oliver Dungey <oliver.dun...@gmail.com> wrote: >>> I am currently working on a system with Cassandra that is written purely in >>> Java. I know our end solution will require other languages to access the >>> data in Cassandra (Python, C++ etc.). What is the best way to store data to >>> ensure I can do this? Should I serialize everything to strings/json/xml >>> prior to the byte conversion? We currently use the Hector serializer, I >>> wondered if we should just switch this to something like Jackson/JAXB? Any >>> thoughts very welcome. >> >
We have clients that use python and C++. We just generated thrift bindings for C++ and use thrift directly - it really is not bad at all, the cassandra.h file generated defines most all methods that any higher level API would, it just doesn't have built in pooling, or reconnect etc. As far as data formats go - we are capturing data (packets) using libpcap and storing the raw bytes (literally the entire raw packet including ethernet and ip headers) into cassandra which are later read out by python scripts for ascii display, or conversion to pcap files to open in wireshark. -Eric