That’s a key advantage of DataStax Enterprise – Solr is fully integrated into 
the Casssandra cluster, so there is only a single infrastructure.

-- Jack Krupansky

From: Tim Uckun 
Sent: Sunday, May 4, 2014 6:36 AM
To: user@cassandra.apache.org 
Subject: Re: Cassandra vs Elasticsearch.

I am hesitant about keeping both a cassandra and ES cluster because it 
effectively doubles my infrastructure costs.  It may be much cheaper to keep 
the data in log files and have ES index them for searching.    Thanks for the 
input everybody, there is much to think about here.






On Sat, May 3, 2014 at 7:31 PM, DuyHai Doan <doanduy...@gmail.com> wrote:

  Hello Tim


  You're absolutely right about ES for the query part. This is the perfect fit 
for complex queries. Now regarding your question:

  "What advantages does Cassandra give me over ES?" --> linear scalability & 
durability. ES is just a super index cluster. I've talked to ES guys. If they 
do not sell ES right now as a "database for complex search" it's because there 
is no strong guarantee about durability for your data. Many people just live 
with it and it's fine. Also, if you store the original data and just pump it 
into ES it's also fine.






  On Sat, May 3, 2014 at 9:14 AM, Tim Uckun <timuc...@gmail.com> wrote:

    Hey all.


    I have been trying out some data stores for time series data and Cassandra 
was the first on my list because so many people are using it for the same 
purpose.  I have read many articles on how to model my time series data and 
tried several variations of schemas which I thought made sense for my data but 
I have really struggled to run some complex queries I need to run.  This has 
led me down a kind of a rabbit hole of trying to create various "materialized 
views" and shotgunning the data into multiple tables which might be able to run 
my queries.


    In the mean time I also took the same data and pumped it into Elasticsearch 
and was able to run almost all the queries I needed without doing anything 
fancy. Just put the data in, and run your query. The new aggregations in ES are 
pretty slick although they don't seem to be 100% accurate compared to running 
the same query in Postgres.


    My question is this.  What advantages does Cassandra give me over ES?  Does 
it compact the data better? Is it faster to query once your data sizes are 
huge? Does it use less bandwidth? Is it easier to administer? 


    I know there must be very compelling reasons to use C* because so many 
companies are depending on it for their bread and butter so I'd love to hear 
your take.


    Thanks.


Reply via email to