[
https://issues.apache.org/jira/browse/CASSANDRA-20233?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Andy Tolbert updated CASSANDRA-20233:
-------------------------------------
Resolution: Fixed
Status: Resolved (was: Triage Needed)
I'm closing this out as a fixed by [CASSANDRA-20421] as we added a section
{{Enabling Incremental Repair on existing clusters with a large amount of
data}} in auto_repair.adoc and also refer to various configuration such as
{{reject_repair_compaction_threshold}} and
{{incremental_repair_disk_headroom_reject_ratio}} in the guide.
> Add guidance on enabling Incremental Repair using AutoRepair on an existing
> data set
> ------------------------------------------------------------------------------------
>
> Key: CASSANDRA-20233
> URL: https://issues.apache.org/jira/browse/CASSANDRA-20233
> Project: Apache Cassandra
> Issue Type: Improvement
> Components: Documentation
> Reporter: Andy Tolbert
> Assignee: Andy Tolbert
> Priority: Normal
>
> CASSANDRA-20184 added an overview document of AutoRepair and some guidance in
> cassandra.yaml and how to tune it.
> Granted a cluster is not massively out of sync, I would expect one could turn
> on AutoRepair for full repair and the defaults would do a relatively good job
> at not overwhelming a cluster.
> For incremental repair on the other hand, while AutoRepair does its best to
> tune it out of the box to reduce impact, there are still a bunch of
> considerations to tune it effectively on an existing cluster with data.
> There is some existing guidance on enabling incremental repair for an
> existing cluster in cassandra.yaml:
> {{When turning on incremental repair for the first time with a decent amount
> of data it may be advisable to increase this interval to 24h or longer to
> reduce the impact of anticompaction caused by incremental repair.}}
> There are enough considerations for enabling incremental repair that it's
> worth covering it in detail in its own section. The following come to mind.
> # Define what anticompaction is and how it should impact how you tune auto
> repair's incremental repair overrides. For example, one might thing that
> reducing the {{max_bytes_per_schedule}} would be an intuitive configuration,
> but this could possibly cause a lot of anticompaction for large SSTables.
> # Define what the repaired and unrepaired data set means.
> # Cover how compaction may interact with incremental repair.
> LeveledCompactionStrategy tends to be better suited than SizeTieredCompaction
> for incremental repair because partitions tend to only exist in 1 SSTable per
> level, and fixed sized SSTables reduce the possible impact of anticompaction.
> Consider adding some guidance for UnifiedCompactionStrategy.
> # Reference other properties that might act as good guardrails, e.g.:
> {{auto_repair.sstable_upper_threshold}} and
> {{{}reject_repair_compaction_threshold{}}}.
> # Reference metrics that are worth monitoring. ({{{}PercentRepaired{}}},
> {{{}BytesAnticompacted{}}}, {{{}BytesMutatedAnticompaction{}}},
> {{{}AnticompactionTime{}}}.
>
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]