Re: Query With Limit Clause

2018-11-07 Thread shalom sagges
Thanks a lot for the info :) On Tue, Nov 6, 2018 at 11:11 AM DuyHai Doan wrote: > Cassandra will execute such request using a Partition Range Scan. > > See more details here http://www.doanduyhai.com/blog/?p=13191, chapter E > Cluster Read Path (look at the formula of Concurrency Factor) > > > >

Cassandra 2.1 bootstrap - No streaming progress from one node

2018-11-07 Thread Steinmaurer, Thomas
Hello, while bootstrapping a new node into an existing cluster, a node which is acting as source for streaming got restarted unfortunately. Since then, from nodetool netstats I don't see any progress for this particular node anymore. E.g.: /X.X.X.X Receiving 94 files, 260.09 GB total.

Cassandra crashed during major compaction

2018-11-07 Thread Gabriel Giussi
After a bulk load of writes to existing partition keys (with a higher timestamp), I wanted to free disk space, suspecting that rows will be in the highest levels and it would take a time until they were compacted. I've started a major compaction, and the disk usage went from ~30% to ~40% (as expect

RE: Cassandra 2.1 bootstrap - No streaming progress from one node

2018-11-07 Thread Durity, Sean R
I would wipe the new node and bootstrap again. I do not know of any way to resume the streaming that was previously in progress. Sean Durity From: Steinmaurer, Thomas Sent: Wednesday, November 07, 2018 5:13 AM To: user@cassandra.apache.org Subject: [EXTERNAL] Cassandra 2.1 bootstrap - No stream

Re: Multiple cluster for a single application

2018-11-07 Thread Eric Stevens
We are engaging in both strategies at the same time: 1) We call it functional sharding - we write to clusters targeted according to the type of data being written. Because different data types often have different workloads this has the nice side effect of being able to tune each cluster accordin

Re: Multiple cluster for a single application

2018-11-07 Thread Jonathan Haddad
Interesting approach Eric, thanks for sharing that. Regarding this: > I've read documents recommended to use clusters with less than 50 or 100 nodes (Netflix got hundreds of clusters with less 100 nodes on each). Not sure where you read that, but it's nonsense. We work with quite a few clusters

Re: Multiple cluster for a single application

2018-11-07 Thread Ben Slater
I tend to recommend an approach similar to Eric’s functional sharding although I describe it at quality of service sharding - group your small, hot data into one cluster and your large, cooler data into another so you can provision infrastructure and tune according. I guess it depends on you manage