Astyanax provides a streaming file feature and was written by netflix who is 
storing probably a huge amount of files with that feature.  I was going to use 
that feature for one product but I never got around to creating the 
product…..but I still use astyanax under the hood of PlayOrm  (we kind of use a 
combination so we can put some relational data in cassandra with PlayOrm and 
then do our own thing as well with noSQL with the raw astyanx apis as 
well)…..it gets rid of us needing the RDBMS at all which is nice.

Later,
Dean

From: Vasileios Vlachos 
<vasileiosvlac...@gmail.com<mailto:vasileiosvlac...@gmail.com>>
Reply-To: "user@cassandra.apache.org<mailto:user@cassandra.apache.org>" 
<user@cassandra.apache.org<mailto:user@cassandra.apache.org>>
Date: Tuesday, October 16, 2012 9:49 AM
To: "user@cassandra.apache.org<mailto:user@cassandra.apache.org>" 
<user@cassandra.apache.org<mailto:user@cassandra.apache.org>>
Subject: Using Cassandra to store binary files?

ed to store about 40G of binary files in a redundant way and since we are 
already using Cassandra for other applications we were thinking that we could 
just solve that problem using the same Cassandra cluster. Each individual File 
will be approximately 1MB.

We are thinking that the data structure should be very simple for this case, 
using one CF with just one column which will contain the actual files. The row 
key should then uniquely identify each file. Speed is not an issue when we 
retrieving the files. Impacting other applications using Cassandra is more 
important for us. In order to prevent performance issues with other 
applications using our Cassandra cluster at the moment, we think we should 
disable key_cache and row_cache for this column family.

Anyone tried this before or anyone thinks this is going to be a bad idea? Do 
you think our current plan is sensible? Any input would be much appreciated. 
Thank you in adv

Reply via email to