Agreed w/ ES not being the durable data store.  I would recommend treating it 
as ephemeral, and using Cassandra as your source of truth.  Keep in mind if you 
change your ES index mapping, you’ll require a full reindex in order to search 
the data properly.  It’s not like adding a secondary index w/ a DB, where it’ll 
go back and take care of it for you.

Jon

On May 3, 2014, at 12:31 AM, DuyHai Doan <doanduy...@gmail.com> wrote:

> Hello Tim
> 
>  You're absolutely right about ES for the query part. This is the perfect fit 
> for complex queries. Now regarding your question:
> 
> "What advantages does Cassandra give me over ES?" --> linear scalability & 
> durability. ES is just a super index cluster. I've talked to ES guys. If they 
> do not sell ES right now as a "database for complex search" it's because 
> there is no strong guarantee about durability for your data. Many people just 
> live with it and it's fine. Also, if you store the original data and just 
> pump it into ES it's also fine.
> 
> 
> 
> 
> On Sat, May 3, 2014 at 9:14 AM, Tim Uckun <timuc...@gmail.com> wrote:
> Hey all.
> 
> I have been trying out some data stores for time series data and Cassandra 
> was the first on my list because so many people are using it for the same 
> purpose.  I have read many articles on how to model my time series data and 
> tried several variations of schemas which I thought made sense for my data 
> but I have really struggled to run some complex queries I need to run.  This 
> has led me down a kind of a rabbit hole of trying to create various 
> "materialized views" and shotgunning the data into multiple tables which 
> might be able to run my queries.
> 
> In the mean time I also took the same data and pumped it into Elasticsearch 
> and was able to run almost all the queries I needed without doing anything 
> fancy. Just put the data in, and run your query. The new aggregations in ES 
> are pretty slick although they don't seem to be 100% accurate compared to 
> running the same query in Postgres.
> 
> My question is this.  What advantages does Cassandra give me over ES?  Does 
> it compact the data better? Is it faster to query once your data sizes are 
> huge? Does it use less bandwidth? Is it easier to administer? 
> 
> I know there must be very compelling reasons to use C* because so many 
> companies are depending on it for their bread and butter so I'd love to hear 
> your take.
> 
> Thanks.
> 

Reply via email to