Re: Drop TTLd rows: upgradesstables -a or scrub?

2018-09-10 Thread Charulata Sharma (charshar)
Scrub takes a very long time and does not remove the tombstones. You should do garbage cleaning. It immediately removes the tombstones. Thaks, Charu From: Oleksandr Shulgin Reply-To: "user@cassandra.apache.org" Date: Monday, September 10, 2018 at 6:53 AM To: "user@cassandra.apache.org" Subjec

Re: Too many tombstones using TTL

2018-09-07 Thread Charulata Sharma (charshar)
Hi, I have struggled a lot with tombstones and finally learnt the following: - Deletes are not the only operation that cause tombstones. Check if you are inserting any nulls in any of the table columns. If yes then if you use Prepared statements, then you can unset the null value.

Tombstone experience

2018-08-23 Thread Charulata Sharma (charshar)
Hi All, I have shared my experience of tombstone clearing in this blog post. Sharing it in this forum for wider distribution. https://medium.com/cassandra-tombstones-clearing-use-case/the-curios-case-of-tombstones-d897f681a378 Thanks, Charu

Strange metadata being appended in some rows

2018-05-30 Thread Charulata Sharma (charshar)
Hi, I am observing a very strange behavior in our cluster. Metadata is being prefixed in some rows. This metadata cannot be sent by application primarily because application writing to C* will not have this data, and also applications use custom Java objects and this metadata doesn’t fall in

Re: Using Spark to delete from Transactional Cluster

2018-03-23 Thread Charulata Sharma (charshar)
is an old way of thinking. Why not store the data and just use the relevant data .. let really old data expire .. -- Rahul Singh rahul.si...@anant.us Anant Corporation On Mar 23, 2018, 11:38 AM -0700, Charulata Sharma (charshar) , wrote: Hi Rahul, Thanks for your answer. Why do you s

Re: Using Spark to delete from Transactional Cluster

2018-03-23 Thread Charulata Sharma (charshar)
#x27;s the same thing - normal deletion with tombstones replicated. Is it that you're doing deletes in the analytics DC instead of your real time one? On Fri, Mar 23, 2018 at 11:38 AM Charulata Sharma (charshar) mailto:chars...@cisco.com>> wrote: Hi Rahul, Thanks for your answer. W

Re: Using Spark to delete from Transactional Cluster

2018-03-23 Thread Charulata Sharma (charshar)
spread your deletes over days. Another thing to consider is instead of deleting setting a TTL which will eventually get cleansed. -- Rahul Singh rahul.si...@anant.us Anant Corporation On Mar 22, 2018, 2:19 PM -0500, Charulata Sharma (charshar) , wrote: Hi, Wanted to know the community’s exp

Using Spark to delete from Transactional Cluster

2018-03-22 Thread Charulata Sharma (charshar)
Hi, Wanted to know the community’s experiences and feedback on using Apache Spark to delete data from C* transactional cluster. We have spark installed in our analytical C* cluster and so far we have been using Spark only for analytics purposes. However, now with advanced features of Spark 2.

Re: One time major deletion/purge vs periodic deletion

2018-03-06 Thread Charulata Sharma (charshar)
val and you just focus on insertion. On Tue, Mar 6, 2018, 07:39 Charulata Sharma (charshar) mailto:chars...@cisco.com>> wrote: Hi, Wanted the community’s feedback on deciding the schedule of Archive and Purge job. Is it better to Purge a large volume of data at regular intervals (like

One time major deletion/purge vs periodic deletion

2018-03-05 Thread Charulata Sharma (charshar)
Hi, Wanted the community’s feedback on deciding the schedule of Archive and Purge job. Is it better to Purge a large volume of data at regular intervals (like run A&P jobs once in 3 months ) or purge smaller amounts more frequently (run the job weekly??) Some estimates on the number of d

Re: Right sizing Cassandra data nodes

2018-02-19 Thread Charulata Sharma (charshar)
2018, 3:08 PM -0500, Charulata Sharma (charshar) , wrote: Hi All, Looking for some insight into how application data archive and purge is carried out for C* database. Are there standard guidelines on calculating the amount of space that can be used for storing data in a specific node. Some point

Right sizing Cassandra data nodes

2018-02-19 Thread Charulata Sharma (charshar)
Hi All, Looking for some insight into how application data archive and purge is carried out for C* database. Are there standard guidelines on calculating the amount of space that can be used for storing data in a specific node. Some pointers that I got while researching are; - Alloca

Re: Async queries

2017-10-02 Thread Charulata Sharma (charshar)
ionStage/CompletableFuture (java 8 futures), although we'll probably provide a guava extension as well for those who still want to use ListenableFuture. Thanks, Andy On Mon, Oct 2, 2017 at 6:44 PM Charulata Sharma (charshar) mailto:chars...@cisco.com>> wrote: Hi , We are observing som

Async queries

2017-10-02 Thread Charulata Sharma (charshar)
Hi , We are observing some performance issues when executing a large number of read/write queries. We use executeAsync query for most of our read and write requests and then future.getUninterruptibly() methods before returning to the client application. Code snippet is: (In the bind portion

Re: Lightweight transaction in Multi DC

2017-09-08 Thread Charulata Sharma (charshar)
p 8, 2017 at 2:33 PM, Charulata Sharma (charshar) mailto:chars...@cisco.com>> wrote: Yes …it is with LOCAL_SERIAL. Should I be using SERIAL ? Thanks, Charu From: DuyHai Doan mailto:doanduy...@gmail.com>> Reply-To: "user@cassandra.apache.org<mailto:user@cassandra.apache.org>

Re: Lightweight transaction in Multi DC

2017-09-08 Thread Charulata Sharma (charshar)
sing CAS with SERIAL consistency level for your multi-DC setup ? On Fri, Sep 8, 2017 at 9:27 PM, Charulata Sharma (charshar) mailto:chars...@cisco.com>> wrote: Hi, We are facing a serious issue with CAS in a multi DC setup and I wanted to get some input on it from the forum. We have a Column

Lightweight transaction in Multi DC

2017-09-08 Thread Charulata Sharma (charshar)
Hi, We are facing a serious issue with CAS in a multi DC setup and I wanted to get some input on it from the forum. We have a Column family which stores counts for the number of events our application receives. When the counts reach a certain threshold, there is another process which kicks

Simulating Read and WriteTimeouts programatically

2017-07-06 Thread Charulata Sharma (charshar)
Hi All, Recently I learnt that in C* driver versions 3.1.x and up, there is a change in retry. If the query statement is not idempotent, there won't be any retry and by default the queries are considered to be non idempotent. Hence, even though Retry policy is mentioned while setting up the C*

Re: Cannot achieve consistency level LOCAL_ONE

2017-07-06 Thread Charulata Sharma (charshar)
Hi, I am facing similar issues with SYSTEM_AUTH keyspace and wanted to know the implication of disabling the "cassandra" superuser. Thanks, Charu From: "wxn...@zjqunshuo.com" mailto:wxn...@zjqunshuo.com>> Date: Wednesday, June 14, 2017 at 2:16 AM To: Oleksandr Shul

Commit Log question

2017-04-27 Thread Charulata Sharma (charshar)
Hi , Can anyone please tell me the implication of increasing the commitlog_segment_size_in_mb from the default value of 32 MB to a higher value? Some of our mutations are > 16MB, so the writes are failing. This is because of the way we store data in our Column families. 95% of the data is <

Re: Read after Write inconsistent at times

2017-02-24 Thread Charulata Sharma (charshar)
estamps: https://docs.datastax.com/en/developer/java-driver/3.1/manual/query_timestamps/ (of course only when you are using a single client or all clients are in sync via ntp). Am 24.02.2017 um 07:29 schrieb Charulata Sharma (charshar): Hi All, In my application sometimes I cannot read d

Read after Write inconsistent at times

2017-02-23 Thread Charulata Sharma (charshar)
Hi All, In my application sometimes I cannot read data that just got inserted. This happens very intermittently. Both write and read use LOCAL QUOROM. We have a cluster of 12 nodes which spans across 2 Data Centers and a RF of 3. Has anyone encountered this problem and if yes what steps have y

Multi DC setup question

2016-06-27 Thread Charulata Sharma (charshar)
Hi All, We are setting up another Data Center and have the following question: 6 nodes in each DC Cassandra cluster. All key spaces have an RF of 3 Our scenario is Apps node connect to Cassandra cluster using LOCAL_QUORUM consistency. We want to ensure that If 5 nodes out of the 6 are a

RE: Cassandra Data Audit

2016-02-24 Thread Charulata Sharma (charshar)
6:36 GMT+01:00 Charulata Sharma (charshar) mailto:chars...@cisco.com>>: To all Cassandra experts out there, Can you please let me know if there is any inbuilt Cassandra feature that allows audits on Column family data ? When I change any data in a CF, I want to record that c

Cassandra Data Audit

2016-02-23 Thread Charulata Sharma (charshar)
To all Cassandra experts out there, Can you please let me know if there is any inbuilt Cassandra feature that allows audits on Column family data ? When I change any data in a CF, I want to record that change. Probably store the old value as well as the changed one. One way of doing