And there is this https://github.com/vroyer/elassandra which is still under
active development and not sure how they plan to keep up with Apache
Cassandra moving forward.

On Mon, Nov 7, 2016 at 9:36 AM, kant kodali <kanth...@gmail.com> wrote:

> Fixing typo's
>
> Hi Tauzell,
>
> Yeah our users want to query, do aggregations on Elastic Search directly
> and we cannot have inconsistent data  because say the writes didn't make it
> into Cassandra but made it to Elastic search then a simple aggregations
> like count will lead to a wrong answer but again as @Hans pointed out this
> is no longer a Kafka question and also your solution has merits in its own
> way which I really appreciate it! your solution does make writes faster and
> probably some performance penalty on the read side given repairs happen
> during the read stage in Cassandra (We could check in both but since our
> users query elastic search directly there is no way for us to check it in
> Cassandra else we could go with your solution as well).
>
> Basically, we use ES as an index for Cassandra since secondary indexes in
> Cassandra (including the latest implementation SASI) doesn't work with our
> use case since we have high cardinality columns (which means every row in a
> column is unique so index on a high cardinality column is not very
> efficient given the underlying data structure used by SASI, but with
> inverted index which is used by ES is much faster).
>
> We do use Apache Spark along with Cassandra and I am trying to explore
> Succint http://succinct.cs.berkeley.edu/wp/wordpress/ and if everything
> works out with Succint we can get rid of elastic search. The only thing
> that I worry and still testing with Spark, Cassandra and Succint is whether
> If the aggregations/computations of a column or search on particular
> Cassandra field/column  can happen in real time given a big dataset (with
> ES it does so the goal is to see if we can get somewhere close or perform
> even better).
>
> Thanks!
>
>
>

Reply via email to