Hi all, We are investigating using Cassandra in our data platform. We would like data to go into Cassandra first and to eventually be replicated into our data lake in HDFS for long term cold storage. Does anyone know of a good way of doing this? We would rather not have parallel writes to HDFS and Cassandra because we were hoping that we could use Cassandra primary keys to de-duplicate events.
Thanks, -- <http://shopkick.com/> *BENJAMIN VOGAN* | Data Platform Team Lead shopkick <http://www.shopkick.com/> <http://facebook.com/shopkick> <http://instagram.com/shopkick> <http://pinterest.com/shopkick> <http://twitter.com/shopkick> <https://www.linkedin.com/company/831240?trk=tyah&trkInfo=clickedVertical%3Acompany%2CentityType%3AentityHistoryName%2CclickedEntityId%3Acompany_831240%2Cidx%3A0> The indispensable app that rewards you for shopping.