Hey all, I've been experimenting with Cassandra on a small scale and in my own sandbox for a while now. I'm pretty used to working with it to get small clusters up and running and gossiping with each other.
But I just had a new project at work drop into my lap that requires a NoSQL data store. And the developers have selected... you guessed it! Cassasndra as their back end database. So I'll be asked to setup a 6 node cluster all hosted in one data center. I want to just make sure that I understand the concept of seeds correctly. I think since we'll be dealing with 6 nodes, what I'll want to do is have 2 seeds. And have each seed seeing each other as it's own seed. Then the other 2 nodes in each sub-group will have the IP for it's seed on each of it's cassandra.yml files. Then I'll want to set the replication factor to 5. Since it'll be the total number of nodes -1. I just want to make sure I have all that right. Another thing that will have to happen is that I will need to connect Cassandra into a 4 node ElasticSearch cluster. I think there are a few options for doing that. I've seen names like Titan and Gremlin. And I was wondering if anyone has any recommendations there. And lastly I'd like to point out that I know literally nothing about the data that will be stored there just as of yet. The first meeting about the project will be tomorrow. My manager gave me an advanced heads up about what will be required. Thank you, Tim -- GPG me!! gpg --keyserver pool.sks-keyservers.net --recv-keys F186197B