The vendor application is not likely to change a tad.

There are 200+ times more updates and 50x inserts than analytical loads.

 
I can simply remove the indexes (in the RDBMS) and thus avoid the issue 
altogether, but I expect the analytical loads to suffer.

In Cassandra to just be able to query (in CQL) on a column I have to have an 
index, the question is what tall the fragmentation coming from the frequent 
updates and inserts has on a CF? Do I also need to manually defrug? 
Or it is more or less manageable?

 

Regards,

Arthur


 

 

---- Original Message ----
From: Jack Krupansky <j...@basetechnology.com>
To: user <user@cassandra.apache.org>
Sent: Sun, Sep 28, 2014 11:41 am
Subject: Re: Indexes Fragmentation




It’s always a tradeoff between the level of sophistication of the platform and 
how much work you want to do in the application itself.
 
But, yes, secondary indexing is always added overhead, and added complexity.
 
And index tables are a viable approach as well. Again, trading off a simpler 
platform for added complexity in the application.
 
Which way to go? As we say in data modeling, always start by looking at what 
queries and access patterns you expect to be using.
 
So, how many different ways do you expect to query?
 
Your original inquiry related to fragmentation due to heavy updates, but the 
background question remains how you intend to access that updated data? I mean, 
any perceived fragmentation may just be statistical noise compared to access 
efficiency overall.
 
-- Jack Krupansky

 

From: Arthur Zubarev 
Sent: Sunday, September 28, 2014 11:19 AM
To: user@cassandra.apache.org 
Subject: Re: Indexes Fragmentation

 

Thank you Jack,

But I am afraid it may be an overhead. Added complexity.

/Arthur

 
 
---- Original Message ----
From: Jack Krupansky <j...@basetechnology.com>
To: user <user@cassandra.apache.org>
Sent: Sun, Sep 28, 2014 11:03 am
Subject: Re: Indexes Fragmentation




Take a look at DataStax Enterprise as well, with its integrated Solr indexing 
of Cassandra data.
 
-- Jack Krupansky

 

From: Arthur Zubarev 
Sent: Sunday, September 28, 2014 10:55 AM
To: user@cassandra.apache.org 
Subject: Indexes Fragmentation

 

Hi all:

A client on a RDBMS faces quick index fragmentations, statistics become 
inaccurate. Many within 4 hours (fast updates + writes, but mostly updates).

I am looking into replacing the RDBMS with Cassandra.

Will I face the same issue with indexes with Cassandra?

Thank you!

Regards,

Arthur 






Reply via email to