Disc size for cluster

2017-01-26 Thread Raphael Vogel
Hi Just want to validate my estimation for a C* cluster which should have around 3 TB of usable storage. Assuming a RF of 3 and SizeTiered Compaction Strategy. Is it correct, that SizeTiered Compaction Strategy needs (in the worst case) 50% free disc space during compaction?   So this would

Re: Disc size for cluster

2017-01-26 Thread Benjamin Roth
Hi! This is basically right, but: 1. How do you know the 3TB storage will be 3TB on cassandra? This depends how the data is serialized, compressed and how often it changes and it depends on your compaction settings 2. 50% free space on STCS is only required if you do a full compaction of a single

Re: Disc size for cluster

2017-01-26 Thread Anuj Wadehra
Adding to what Benjamin said.. It is hard to estimate disk space if you are using STCS for a table where rows are updated frequently leading to lot of fragmentation. STCS may also lead to scenarios where tombstones are not evicted for long times. You may go live and everything goes well for mont

Re: Expensive to run nodetool status often?

2017-01-26 Thread Eric Evans
On Wed, Jan 25, 2017 at 11:20 AM, Xiaolei Li wrote: > Thanks for the advice! > > I do export a lot via JMX already. But I couldn't find the equivalent of the > Status column (Up/Down + Normal/Leaving/Joining/Moving) from the status > output. Does anyone know if those are available via JMX? I've b

Re: Expensive to run nodetool status often?

2017-01-26 Thread Jonathan Haddad
Very cool! On Thu, Jan 26, 2017 at 8:53 AM Eric Evans wrote: > On Wed, Jan 25, 2017 at 11:20 AM, Xiaolei Li > wrote: > > Thanks for the advice! > > > > I do export a lot via JMX already. But I couldn't find the equivalent of > the > > Status column (Up/Down + Normal/Leaving/Joining/Moving) from

Cassandra ad hoc search options

2017-01-26 Thread Yu, John
Hi All, Hope I can get some help here. We're using Cassandra for services, and recently we're adding UI support. With Cassandra, what are the options for ad hoc query/search similar to RDBMS? We love the features of Cassandra but it seems it's a known "weakness" that it doesn't come with strong

Re: Cassandra ad hoc search options

2017-01-26 Thread Jonathan Haddad
> With Cassandra, what are the options for ad hoc query/search similar to RDBMS? Your best options are Spark w/ the DataStax connector or Presto. Cassandra isn't built for ad-hoc queries so you need to use other tools to make it work. On Thu, Jan 26, 2017 at 2:22 PM Yu, John wrote: > Hi All, >

Re: Expensive to run nodetool status often?

2017-01-26 Thread Xiaolei Li
Nice! Will take a look. Best, x. On Thu, Jan 26, 2017 at 10:30 AM, Jonathan Haddad wrote: > Very cool! > > On Thu, Jan 26, 2017 at 8:53 AM Eric Evans > wrote: > >> On Wed, Jan 25, 2017 at 11:20 AM, Xiaolei Li >> wrote: >> > Thanks for the advice! >> > >> > I do export a lot via JMX already. B

RE: [External] Re: Cassandra ad hoc search options

2017-01-26 Thread Yu, John
Thanks a lot. Mind sharing a couple of points where you feel it’s better than the alternatives. Regards, John From: Jonathan Haddad [mailto:j...@jonhaddad.com] Sent: Thursday, January 26, 2017 2:33 PM To: user@cassandra.apache.org Subject: [External] Re: Cassandra ad hoc search options > With C