design principle to manage roll back

2020-07-13 Thread Manu Chadha
Hi What are the design approaches I can follow to ensure that data is consistent from an application perspective (not from individual tables perspective). I am thinking of issues which arise due to unavailability of rollback or executing atomic transactions in Cassandra. Is Cassandra not suitab

Relation between num_tokens and cluster extend limitations

2020-07-13 Thread onmstester onmstester
Hi, I'm using allocate_tokens_for_keyspace and num_tokens=32 and i wan't to extend the size of some clusters. I read in articles that for num_tokens=4, one should add more 25% of cluster size for the cluster to become balanced again. 1. For example, with num_tokens=4 and already have 16 n

Re: Cqlsh copy command on a larger data set

2020-07-13 Thread Kiran mk
I wouldn't say it's good approach for that size. But you can try dsbulk approach too. Try to split output into multiple files. Best Regards, Kiran M K On Tue, Jul 14, 2020, 5:17 AM Jai Bheemsen Rao Dhanwada < jaibheem...@gmail.com> wrote: > Hello, > > I would like to copy some data from one ca

Cqlsh copy command on a larger data set

2020-07-13 Thread Jai Bheemsen Rao Dhanwada
Hello, I would like to copy some data from one cassandra cluster to another cassandra cluster using the CQLSH copy command. Is this the good approach if the dataset size on the source cluster is very high(500G - 1TB)? If not what is the safe approach? and are there any limitations/known issues to

Re: Running Large Clusters in Production

2020-07-13 Thread Reid Pinchback
I don’t know if it’s the OPs intent in this case, but the response latency profile will likely be different for two clusters equivalent in total storage but different in node count. Multiple reasons for that, but probably the biggest would be that you’re changing a divisor in I/O queuing statist

RE: Upgrading cassandra cluster from 2.1 to 3.X when using custom TWCS

2020-07-13 Thread Marie-Anne
How do I get off this listserv? From: Gil Ganz [mailto:gilg...@gmail.com] Sent: Monday, July 13, 2020 11:30 AM To: user@cassandra.apache.org Subject: Re: Upgrading cassandra cluster from 2.1 to 3.X when using custom TWCS Jon, great advice, I ended up doing just that, if anyone else needs t

Re: Upgrading cassandra cluster from 2.1 to 3.X when using custom TWCS

2020-07-13 Thread Gil Ganz
Jon, great advice, I ended up doing just that, if anyone else needs to do the same thing, here is a 3.11.6 version https://github.com/gilganz/twcs/tree/cassandra-3.11 Indeed switching between twcs strategies did not trigger any sstable change. Jeff, nothing to be sorry about, gave me an opportun

Remove from listserv

2020-07-13 Thread Marie-Anne
Remove me from Cassandra listserv. maharkn...@comcast.net

RE: Running Large Clusters in Production

2020-07-13 Thread Durity, Sean R
I’m curious – is the scaling needed for the amount of data, the amount of user connections, throughput or what? I have a 200ish cluster, but it is primarily a disk space issue. When I can have (and administer) nodes with large disks, the cluster size will shrink. Sean Durity From: Isaac Reath

Re: Running Large Clusters in Production

2020-07-13 Thread Isaac Reath (BLOOMBERG/ 919 3RD A)
Thanks for the info Jeff, all very helpful! From: user@cassandra.apache.org At: 07/11/20 12:30:36To: user@cassandra.apache.org Subject: Re: Running Large Clusters in Production Gossip related stuff eventually becomes the issue For example, when a new host joins the cluster (or replaces a faile