We use Ansible to manage a fairly large (200+ nodes) cluster. We created
our own Ansible playbooks for common tasks, such as rolling restart. We
also use Cassandra Reaper for scheduling and running repairs on the same
cluster. We occasionally also use pssh (parallel SSH) for inspecting the
logs or configurations on selected nodes. Running pssh on very larger
number of servers is obviously not practical due the the available
screen space constraint.
On 28/02/2022 21:59, Joe Obernberger wrote:
Hi all - curious what tools are folks using to manage large Cassandra
clusters? For example, to do tasks such as nodetool cleanup after a
node or nodes are added to the cluster, or simply rolling start/stops
after an update to the config or a new version?
We've used puppet before; is that what other folks are using?
Thanks for any suggestions.
-Joe