Any excellent tutorials or automated scripts for cluster setup on EC2?

2016-01-27 Thread Richard L. Burton III
I'm curious to see if there's automated scripts or tutorials on setting up Cassandra on EC2 with security taken care of etc. Thanks, -- -Richard L. Burton III @rburton

Re: why one of the new added nodes' bootstrap is very slow?

2016-01-27 Thread 土卜皿
Hi Alain, Thank you very much! > UJ 192.21.0.185 299.22 GB 256 ? > 84c0dd16-6491-4bfb-b288-d4e410cd8c2a RAC1 >> UN 192.21.0.184 670.14 MB 256 ? >> 4041c232-c110-4315-89a1-23ca53b851c2 RAC1 >> >> > Obviously .184 didn't bootstrap correctly. When a node is added, it >

Re: Production with Single Node

2016-01-27 Thread Nikolay Mihaylov
HI We have 2 - 3 installations with single node Cassandra. They working fine, no problems there, except if Cassandra stops, everything stops. Even on one node, we usually "rolling" 500-600 GB data, sometimes even 2-3 TB. We use mostly standard configuration with almost no changes there. Here are

Re: Rename Keyspace offline

2016-01-27 Thread Robert Coli
On Wed, Jan 27, 2016 at 6:49 AM, Jean Tremblay < jean.tremb...@zen-innovations.com> wrote: > Since it takes me 2 days to load my data, I was planning to load the new > set on a new keyspace (KS-Y), and when loaded drop KS-X and rename KS-Y to > KS-X. > Why bother with the rename? Just have two ke

Logging configuration (security)

2016-01-27 Thread oleg yusim
Greetings, I decided to put together a separate thread with logging configuration questions I have (I'm trying to figure out what from security best practices on logging Cassandra can and can't do): 1) Can Cassandra log IP and hostname of the host, DB resides at? 2) Can Cassandra log IP and hostn

Re: Embedded cassandra

2016-01-27 Thread Jack Krupansky
And do share your conclusions with us, so others can benefit. -- Jack Krupansky On Wed, Jan 27, 2016 at 1:28 AM, Enrico Olivelli wrote: > Thank you all for your feedback. > I and my team will take all these suggestions into account. > > Cheers > Enrico > > Il giorno Mar 26 Gen 2016 23:51 Jack K

Re: Need Column Family Schema Suggestion

2016-01-27 Thread Jack Krupansky
If the goal is to maximize performance/throughput, you need to assure that data is contiguous as much as possible. IOW, so you can ask Cassandra for a slice of consecutive rows rather than require slow and expensive scanning. Typically this means careful attention to partition keys so that the data

Re: Logging

2016-01-27 Thread oleg yusim
Sam, Paulo, One more question on logging. Can I add IP and hostname to the log message? If it is possible, can you give me example of how I would need to change %-5level %date{HH:mm:ss,SSS} %msg%n to add this information? Thanks, Oleg On Tue, Jan 26, 2016 at 4:42 PM, oleg yusim wrote: > Sam

Re: Rename Keyspace offline

2016-01-27 Thread Jack Krupansky
If you are doing this full bulk reload a lot, it may make more sense to use a separate cluster to bring up the new data and then atomically switch your clients/apps to the IP address of the new cluster once you've validated it, and then decommission and recyle the machines of the old cluster. This

Re: Rename Keyspace offline

2016-01-27 Thread Alain RODRIGUEZ
> > 3.1) rm -r data/KS-X > 3.2) mv data/KS-Y data/KS-X This won't work, sstable names contains keyspace name. I had this issue too (wanted to split a keyspace into multiple ones, use this occasion to rename tables, etc I finally ended up writing a small python script there : https://github.co

Re: Rename Keyspace offline

2016-01-27 Thread Jonathan Haddad
Why rename the keyspace? If it was me I'd just give it a name that includes the date or some identifier and include that logic in my app. That's way easier. On Wed, Jan 27, 2016 at 6:49 AM Jean Tremblay < jean.tremb...@zen-innovations.com> wrote: > Hi, > > I have a huge set of data, which takes ab

Re: why one of the new added nodes' bootstrap is very slow?

2016-01-27 Thread Alain RODRIGUEZ
Hi Dillon, 2 emails again for the same issue, just saying :-). I'll add something I forgot answering the other email UJ 192.21.0.185 299.22 GB 256 ? 84c0dd16-6491-4bfb-b288-d4e410cd8c2a RAC1 > UN 192.21.0.184 670.14 MB 256 ? > 4041c232-c110-4315-89a1-23ca53b851c2 RAC1 > >

Re: About cassandra's reblance when adding one or more nodes into the existed cluster?

2016-01-27 Thread Alain RODRIGUEZ
> > I am so sorry for this issue! I should not use the command "nodetool move" > because I set "num_tokens: 256" in every node's cassandra.yaml. > I was guessing this was the issue. > So I restarted it and the join continued! I don't know why there is the > difference between the two nodes? > My

Rename Keyspace offline

2016-01-27 Thread Jean Tremblay
Hi, I have a huge set of data, which takes about 2 days to bulk load on a Cassandra 3.0 cluster of 5 nodes. That is about 13 billion rows. Quite often I need to reload this data, new structure, or data is reorganise. There are clients reading from a given keyspace (KS-X). Since it takes me 2 d

why one of the new added nodes' bootstrap is very slow?

2016-01-27 Thread 土卜皿
Hi Cassandra version: 2.1.11 The existed cluster has three nodes: [root@report-02 cassandra]# bin/nodetool status UN 192.21.0.135 120.85 GB 512 ? 11e1e80f-9c5f-4f7c-81f2-42d3b704d8e3 RAC1 UN 192.21.0.133 129.13 GB 512 ? 3e662ccb-fa2b-427b-9ca1-c2d3468bfbc9 RAC1 UN 192.21.0.131