Re: Archive cassandra old data into Hadoop

2018-03-12 Thread Rahul Singh
HDFS / S3 is a great place to dump this data. You can also consider other types of compaction strategies for “COLD DATA” in not so powerful C* clusters for which the purpose is write only. C* is still better in my opinion for data management than S3/HDFS.  It depends on how easy you want the ret

Archive cassandra old data into Hadoop

2018-03-12 Thread Javier Pareja
Hi, I understand that a well designed cassandra system will allow to query ANY data within it at an incredible speed as well as ingesting data at a very fast pace. However this data is going to grow until it is archived. As I see it, data has two stages, HOT DATA when data is accessible to be que