Re: snapshot strategy?

2018-11-05 Thread Lou DeGenaro
The issue really is how to manage disk space. It is certainly possible to take snapshots by name and delete them by name, perhaps one for each day of the week. But how do you clear the automatic ones (e.g. names unknown) without clearing the named ones? Thanks. Lou. On Fri, Nov 2, 2018 at 12:2

Multiple cluster for a single application

2018-11-05 Thread onmstester onmstester
Hi, One of my applications requires to create a cluster with more than 100 nodes, I've read documents recommended to use clusters with less than 50 or 100 nodes (Netflix got hundreds of clusters with less 100 nodes on each). Is it a good idea to use multiple clusters for a single application, ju

Info about sstableloader

2018-11-05 Thread Kalyan Chakravarthy
Hi, I’m new to Cassandra, please help me with sstableloader. Thank you in advance. I’m trying to migrate data between two clusters which are on different networks. Migrating data from ‘c1’ to ‘c2’ Which one will be the source and which one will be destination?? And where should I run sstablel

Exception when running sstableloader

2018-11-05 Thread Kalyan Chakravarthy
I’m trying to migrate data between two clusters on different networks. Ports: 7001,7199,9046,9160 are open between them. But port:7000 is not open. When I run sstableloader command, got the following exception. Command: :/a/cassandra/bin# ./sstableloader -d 192.168.98.99/abc/cassandra/data/app

Compacting more than the actual used space

2018-11-05 Thread Pedro Gordo
Hi We have an ongoing compaction for roughly 2.5 TB, but "nodetool status" reports a load of 1.09 TB. Even if we take into account that the load presented by "nodetool status" is the compressed size, I very much doubt that compression would work to reduce from 2.5 TB to 1.09. We can also take into

Re: Compacting more than the actual used space

2018-11-05 Thread Alexander Dejanovski
You can check cfstats to see what's the compression ratio. It's totally possible to have the values you're reporting as a compression ratio of 0.2 is quite common depending on the data you're storing (compressed size is then 20% of the original data). Compaction throughput changes are taken into a

Re: Compacting more than the actual used space

2018-11-05 Thread Pedro Gordo
Hi Alexander Thanks. Using the compression ratio, the sizes check out. Regarding the new values for compaction throughput, that explains it then. We are using 2.1. :-) Cheers Pedro Gordo On Mon, 5 Nov 2018 at 19:53, Alexander Dejanovski wrote: > You can check cfstats to see what's the compre

Re: snapshot strategy?

2018-11-05 Thread Alain RODRIGUEZ
Hello Lou, how do you clear the automatic ones (e.g. names unknown) without clearing > the named ones? > The option '-t' might be what you are looking for: 'nodetool clearsnapshot -t nameOfMySnapshot'. >From the documentation here: http://cassandra.apache.org/doc/latest/tools/nodetool/clearsnaps

Re: snapshot strategy?

2018-11-05 Thread Lou DeGenaro
Alain, Thanks for the suggestion, but I think I did not make myself clear. In order to utilize disk space efficiently, we want to keep snapshots that are no older than X days old while purging the older ones. My understanding is that there are 2 kinds of snapshots : (a) those created on demand

Query With Limit Clause

2018-11-05 Thread shalom sagges
Hi All, If I run for example: select * from myTable limit 3; Does Cassandra do a full table scan regardless of the limit? Thanks!