Wei-Chiu Chuang created HDDS-13268:
--------------------------------------

             Summary: [Docs] Advanced topics: troubleshooting OM snapshot 
installation
                 Key: HDDS-13268
                 URL: https://issues.apache.org/jira/browse/HDDS-13268
             Project: Apache Ozone
          Issue Type: Task
          Components: OM HA, Ozone Manager
            Reporter: Wei-Chiu Chuang


[https://github.com/apache/ozone/pull/8600#pullrequestreview-2925327285]

Create a new user doc page for "Troubleshooting OM HA snapshot installation 
issues"
{quote}I think it is worth adding how to handle an issue that we encountered 
before that because of the a huge OM DB, while the follower's install the 
snapshot from leader (i.e. download the OM DB), the leader's Raft log for the 
particular snapshot index has already been purged. So after the OM follower 
finished downloading the OM DB, the leader cannot send the purged logs and 
request to follower to redownload the OM DB.

Currently, this can be handled by setting the configurations introduced in 
HDDS-8131. So either:
 # Set {{ozone.om.ratis.log.purge.preservation.log.num}} to a high enough value 
(e.g. 1000000) so that the OM leader will not the purge the last N logs.
 # Set {{ozone.om.ratis.log.purge.upto.snapshot.index}} to false, which causes 
OM leader logs to never be purged until all the follower already catch up.

You can refer to the ticket for a full explanation and tradeoffs.
{quote}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to