Hi guys, Thanks for your reply. It's very helpful. I agree with Plotnik on the scaling part.
For the business logic, it sounds obvious that "it make sense to divide, i.e. metadata and really BIG data into different clusters, of course." as you mentioned. But after I think about it a bit more, what is the real reason for that if the cluster can be scaled horizontally? Each node still has the same amount of data, what is the benefit of having a separate cluster? For analytics application, it has to be on a separate cluster as Paulo pointed out. But if all of use cases are for web application (as of now) what are the drawbacks to put everything into one big cluster? Thanks. -Wei On Monday, October 14, 2013 4:15 AM, Paulo Motta <pauloricard...@gmail.com> wrote: By clusters do you mean data centers? If so, I think it depends on your use case and application requirements. For instance, if you have a web application and and analytics application (hadoop), you would want to separate your cluster in 2 different data centers (even if they're located in the same physical zone). If it's just one application then you can start with one data center and add more data centers later if needed. 2013/10/14 Plotnik, Alexey <aplot...@rhonda.ru> If you are talking about scaling: Cassandra scaling is absolutely horizontal without namenodes or other Mongo-bulshit-like intermediate daemons. And that’s why one big cluster has the same throughput as many smaller clusters. >What will you do when your small clusters will exceed it’s capacity? Cassandra >is designed for very large data so feel free to utilize it’s capabilities. > >If you are talking in terms of business logic: it make sense to divide, i.e. >metadata and really BIG data into different clusters, of course. > >From:Wz1975 [mailto:wz1...@yahoo.com] >Sent: 14 октября 2013 г. 7:20 >To: user@cassandra.apache.org > >Subject: Re: one big cluster vs multiple smaller clusters >Importance: Low > >we have choices of making one big cluster vs a few small clusters. I am trying >to get pros and cons for both options in genera. > > >Thanks. >-Wei > >Sent from my Samsung smartphone on AT&T > > >-------- Original message -------- >Subject: Re: one big cluster vs multiple smaller clusters >From: Jon Haddad <j...@jonhaddad.com> >To: user@cassandra.apache.org >CC: > > >This is a pretty vague question. What are you trying to achieve? > >On Oct 12, 2013, at 9:05 PM, Wei Zhu <wz1...@yahoo.com> wrote: > > > >Hi, >As we bring more use cases to Cassandra, we have been thinking about the best >way to host it. Let's say we will have 15 physical machines available, we can >use all of them to form a big cluster or divide them into 3 clusters with 5 >nodes each. As we will deploy to 1.2, it becomes easier to expand the cluster >with vnodes. I really don't see any good reasons to make 3 smaller clusters. >Did I miss anything obvious? > >Thanks. >-Wei > -- Paulo Ricardo -- European Master in Distributed Computing Royal Institute of Technology - KTH Instituto Superior Técnico - IST http://paulormg.com