Thank you Sean and Yakir. Is 4.x the same?
So if you were to build a 1PByte system, you would want 512-1024 nodes?
Doesn't seem space efficient vs say 48TByte nodes where you would need
~21 machines.
What would you do to build a 1PByte configuration? I know there are a
lot of - it depends - on that question, but say it was a write heavy,
light read setup. Thank you!
-Joe
On 1/20/2021 10:06 AM, Durity, Sean R wrote:
Yakir is correct. While it is feasible to have large disk nodes, the
practical aspect of managing them is an issue. With the current
technology, I do not build nodes with more than about 3.5 TB of disk
available. I prefer 1-2 TB, but costs/number of nodes can change the
considerations.
Putting more than 1 node of Cassandra on a given host is also
possible, but you will want to consider your availability if that
hardware goes down. Losing 2 or more nodes with one failure is usually
not good.
NOTE: DataStax has some new features for supporting much larger disks
and alleviating many of the admin pains associated with it. I don’t
have personal experience with it, yet, but I will be testing it soon.
In my understanding it is for use cases with massive needs for disk,
but low to moderate throughput (ie, where node expansion is only for
disk, not additional traffic).
Sean Durity
*From:* Yakir Gibraltar <yaki...@gmail.com>
*Sent:* Wednesday, January 20, 2021 9:21 AM
*To:* user@cassandra.apache.org
*Subject:* [EXTERNAL] Re: Node Size
It possible to use large nodes and it will work, the problem of large
nodes will be:
* Maintenance like join/remove nodes will take more time.
* Larger heap
* etc.
On Wed, Jan 20, 2021 at 3:54 PM Joe Obernberger
<joseph.obernber...@gmail.com <mailto:joseph.obernber...@gmail.com>>
wrote:
Anyone know where I could find out more information on this?
Thanks!
-Joe
On 1/13/2021 8:42 AM, Joe Obernberger wrote:
> Reading the documentation on Cassandra 3.x there is recommendations
> that node size should be ~1TByte of data. Modern servers can
have 24
> SSDs, each at 2TBytes in size for data. Is that a bad idea for
> Cassandra? Does 4.0beta4 handle larger nodes?
> We have machines that have 16, 8TBytes SATA drives - would that
be a
> bad server for Cassandra? Would it make sense to run multiple
copies
> of Cassandra on the same node in that case?
>
> Thanks!
>
> -Joe
>
---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscr...@cassandra.apache.org
<mailto:user-unsubscr...@cassandra.apache.org>
For additional commands, e-mail: user-h...@cassandra.apache.org
<mailto:user-h...@cassandra.apache.org>
--
*בברכה,*
*יקיר גיברלטר*
------------------------------------------------------------------------
The information in this Internet Email is confidential and may be
legally privileged. It is intended solely for the addressee. Access to
this Email by anyone else is unauthorized. If you are not the intended
recipient, any disclosure, copying, distribution or any action taken
or omitted to be taken in reliance on it, is prohibited and may be
unlawful. When addressed to our clients any opinions or advice
contained in this Email are subject to the terms and conditions
expressed in any applicable governing The Home Depot terms of business
or client engagement letter. The Home Depot disclaims all
responsibility and liability for the accuracy and content of this
attachment and for any damages or losses arising from any
inaccuracies, errors, viruses, e.g., worms, trojan horses, etc., or
other items of a destructive nature, which may be contained in this
attachment and shall not be liable for direct, indirect, consequential
or special damages in connection with this e-mail message or its
attachment.
<http://www.avg.com/email-signature?utm_medium=email&utm_source=link&utm_campaign=sig-email&utm_content=emailclient>
Virus-free. www.avg.com
<http://www.avg.com/email-signature?utm_medium=email&utm_source=link&utm_campaign=sig-email&utm_content=emailclient>
<#DAB4FAD8-2DD7-40BB-A1B8-4E2AA1F9FDF2>