Hi all,

We are investigating using Cassandra in our data platform.  We would like
data to go into Cassandra first and to eventually be replicated into our
data lake in HDFS for long term cold storage.  Does anyone know of a good
way of doing this?  We would rather not have parallel writes to HDFS and
Cassandra because we were hoping that we could use Cassandra primary keys
to de-duplicate events.

Thanks,
-- 
<http://shopkick.com/>
*BENJAMIN VOGAN* | Data Platform Team Lead
shopkick <http://www.shopkick.com/>
<http://facebook.com/shopkick> <http://instagram.com/shopkick>
<http://pinterest.com/shopkick> <http://twitter.com/shopkick>
<https://www.linkedin.com/company/831240?trk=tyah&trkInfo=clickedVertical%3Acompany%2CentityType%3AentityHistoryName%2CclickedEntityId%3Acompany_831240%2Cidx%3A0>

The indispensable app that rewards you for shopping.

Reply via email to