hi, Thank you for your answers, starting with the most important point from your answers I understand that
"it is OK to go more than 1 TB in disk usage" so in this case if I am going to use the 50% of the disk capacity I will end up having around 3 TB per node which in this case I will not need to use a docker solution which is a very good usa case for us. The goal of my setup is to save large data volumes in every node (~ 3 TB - 50% usage of HD) with the current hardware that we possess. The high availability I consider it standard since we are going to have 2 DCs with RF3. I also have to note that Datastax also recommends usage no more than 500 GB - 1 TB. Cheers, Vasilis Sent with [ProtonMail](https://protonmail.com) Secure Email. ‐‐‐‐‐‐‐ Original Message ‐‐‐‐‐‐‐ On Thursday, April 18, 2019 6:56 PM, Jacques-Henri Berthemet <jacques-henri.berthe...@genesys.com> wrote: > So how much data can you safely fit per node using SSDs with Cassandra 3.11? > How much free space do you need on your disks? > > There should be some recommendations on node sizes on: > > http://cassandra.apache.org/doc/latest/operating/hardware.html > > [Documentation - Apache > Cassandra](http://cassandra.apache.org/doc/latest/operating/hardware.html) > cassandra.apache.org > The Apache Cassandra database is the right choice when you need scalability > and high availability without compromising performance. Linear scalability > and proven fault-tolerance on commodity hardware or cloud infrastructure make > it the perfect platform for mission-critical data. Cassandra's support for > replicating across multiple datacenters is best-in-class, providing lower > latency for your ... > > --------------------------------------------------------------- > > From: Jon Haddad <j...@jonhaddad.com> > Sent: Thursday, April 18, 2019 6:43:15 PM > To: user@cassandra.apache.org > Subject: Re: [EXTERNAL] multiple Cassandra instances per server, possible? > > Agreed with Jeff here. The whole "community recommends no more than > 1TB" has been around, and inaccurate, for a long time. > > The biggest issue with dense nodes is how long it takes to replace > them. 4.0 should help with that under certain circumstances. > > On Thu, Apr 18, 2019 at 6:57 AM Jeff Jirsa <jji...@gmail.com> wrote: >> >> Agreed that you can go larger than 1T on ssd >> >> You can do this safely with both instances in the same cluster if you >> guarantee two replicas aren’t on the same machine. Cassandra provides a >> primitive to do this - rack awareness through the network topology snitch. >> >> The limitation (until 4.0) is that you’ll need two IPs per machine as both >> instances have to run in the same port. >> >> >> -- >> Jeff Jirsa >> >> >> On Apr 18, 2019, at 6:45 AM, Durity, Sean R <sean_r_dur...@homedepot.com> >> wrote: >> >> What is the data problem that you are trying to solve with Cassandra? Is it >> high availability? Low latency queries? Large data volumes? High concurrent >> users? I would design the solution to fit the problem(s) you are solving. >> >> >> >> For example, if high availability is the goal, I would be very cautious >> about 2 nodes/machine. If you need the full amount of the disk – you *can* >> have larger nodes than 1 TB. I agree that administration tasks (like >> adding/removing nodes, etc.) are more painful with large nodes – but not >> impossible. For large amounts of data, I like nodes that have about 2.5 – 3 >> TB of usable SSD disk. >> >> >> >> It is possible that your nodes might be under-utilized, especially at first. >> But if the hardware is already available, you have to use what you have. >> >> >> >> We have done multiple nodes on single physical hardware, but they were two >> separate clusters (for the same application). In that case, we had a >> different install location and different ports for one of the clusters. >> >> >> >> Sean Durity >> >> >> >> From: William R <tri...@protonmail.com.INVALID> >> Sent: Thursday, April 18, 2019 9:14 AM >> To: user@cassandra.apache.org >> Subject: [EXTERNAL] multiple Cassandra instances per server, possible? >> >> >> >> Hi all, >> >> >> >> In our small company we have 10 nodes of (2 x 3 TB HD) 6 TB each, 128 GB ram >> and 64 cores and we are thinking to use them as Cassandra nodes. From what I >> am reading around, the community recommends that every node should not keep >> more than 1 TB data so in this case I am wondering if it is possible to >> install 2 instances per node using docker so each docker instance can write >> to its own physical disk and utilise more efficiently the rest hardware (CPU >> & RAM). >> >> >> >> I understand with this setup there is the danger of creating a single point >> of failure for 2 Cassandra nodes but except that do you think that is a >> possible setup to start with the cluster? >> >> >> >> Except the docker solution do you recommend any other way to split the >> physical node to 2 instances? (VMWare? or even maybe 2 separate >> installations of Cassandra? ) >> >> >> >> Eventually we are aiming in a cluster consisted of 2 DCs with 10 nodes each >> (5 baremetal nodes with 2 Cassandra instances) >> >> >> >> Probably later when we will start introducing more nodes to the cluster we >> can decommissioning the "double-instaned" ones and aim for a more >> homogeneous solution.. >> >> >> >> Thank you, >> >> >> >> Wil >> >> >> ________________________________ >> >> The information in this Internet Email is confidential and may be legally >> privileged. It is intended solely for the addressee. Access to this Email by >> anyone else is unauthorized. If you are not the intended recipient, any >> disclosure, copying, distribution or any action taken or omitted to be taken >> in reliance on it, is prohibited and may be unlawful. When addressed to our >> clients any opinions or advice contained in this Email are subject to the >> terms and conditions expressed in any applicable governing The Home Depot >> terms of business or client engagement letter. The Home Depot disclaims all >> responsibility and liability for the accuracy and content of this attachment >> and for any damages or losses arising from any inaccuracies, errors, >> viruses, e.g., worms, trojan horses, etc., or other items of a destructive >> nature, which may be contained in this attachment and shall not be liable >> for direct, indirect, consequential or special damages in connection with >> this e-mail message or its attachment. > > --------------------------------------------------------------------- > To unsubscribe, e-mail: user-unsubscr...@cassandra.apache.org > For additional commands, e-mail: user-h...@cassandra.apache.org