Re: Cassandra crashes after loading data with sstableloader

2018-07-29 Thread Rahul Singh
What does “hash” Data look like? Rahul On Jul 24, 2018, 11:30 AM -0400, Arpan Khandelwal , wrote: > I need to clone data from one keyspace to another keyspace. > We do it by taking snapshot of keyspace1 and restoring in keyspace2 using > sstableloader. > > Suppose we have following table with ind

Re: Cassandra crashes after loading data with sstableloader

2018-07-29 Thread Jeff Jirsa
What’s the cardinality of hash? Do they have the same schema? If so you may be able to take a snapshot and hardlink it in / refresh instead of sstableloader. Alternatively you could drop the index from the destination keyspace and add it back in after the load finishes. How big are the sstabl

Re: cassandro nodes restarts

2018-07-29 Thread Rahul Singh
Need to review java gc, system , network, disk, memory, node, and table statistics. A lot can be discerned from visually examining the charts. Eg. if the nodes with the most local reads is failing or is it the one with the most writes or is it completely unrelated. Since it’s a distributed syst

Re: optimization to cassandra-env.sh

2018-07-29 Thread Rahul Singh
Depends on which GC you are using but you can definitely manage GC - but you will always be stuck to the upper limit of memory. I found the Hubspot gc visualizer and the associated blog post very helpful in the past. https://github.com/HubSpot/gc_log_visualizer/blob/master/README.md https://p

Re: Data model storage optimization

2018-07-29 Thread Rahul Singh
How many rows in average per partition? Let me get this straight : You are bifurcating your partitions on either email or username , essentially potentially doubling the data because you don’t have a way to manage a central system of record of users ? I would do this: (my opinion) Migrate to a

Server kernal Parameters for cassandra

2018-07-29 Thread rajasekhar kommineni
Hello, Do we have any standard values for server kernel parameters to run Cassandra. Please share some insight. Thanks, - To unsubscribe, e-mail: user-unsubscr...@cassandra.apache.org For additional commands, e-mail: user-h...

Fwd: Re: Data model storage optimization

2018-07-29 Thread onmstester onmstester
How many rows in average per partition? around 10K. Let me get this straight : You are bifurcating your partitions on either email or username , essentially potentially doubling the data because you don’t have a way to manage a central system of record of users ? We are just analyzing output log

Re: Auto Compactions not running on Cassandra 3.10

2018-07-29 Thread Anshul Rathore
Thanks Jeff for your response ,and apologies for such a delayed response , had some personal emergency. So Following the config which we are using for that table PRIMARY KEY ((customer_app_prefix, customer_session_id), beacon_client_type, sim_created_at) ) WITH CLUSTERING ORDER BY (beacon_client_