On Wed, Nov 15, 2017 at 9:06 AM, Masahiko Sawada <sawada.m...@gmail.com> wrote: >> On Nov 15, 2017 2:59 AM, "Fujii Masao" <masao.fu...@gmail.com> wrote: >> + /* Quick exit if we have done the backup */ >> + if (XLogCtl->Insert.exclusiveBackupState == EXCLUSIVE_BACKUP_NONE) >> + return; >> >> This quick exit seems to cause another problem. Please imagine the >> case where there is no exclusive backup running, a backend starts >> non-exclusive backup and is terminated before calling pg_stop_backup(). >> In this case, do_pg_abort_backup() should decrement >> XLogCtl->Insert.nonExclusiveBackups, but, with the patch, because of >> the above quick exit, do_pg_abort_backup() fails to decrement that. > > Hmm, I think that in this case because exclusiveBackupState is not > EXCLUSIVE_BACKUP_NONE, nonExclusiveBackups is surely decremented. Am I > missing something?
Nah. Fujii-san is right here as exclusiveBackupState is never updated for non-exclusive backups. You need an extra check on sessionBackupState to make sure that even for non-exclusive backups the counter is correctly decremented if a non-exclusive session lock is hold. For an exclusive backup, the session lock can be either SESSION_BACKUP_EXCLUSIVE if an exclusive backup is stopped on the same session as the start phase, or SESSION_BACKUP_NONE if the exclusive backup is stopped from a different session. So you'd basically need that: + /* + * Quick exit if we have done the exclusive backup or if session is + * not keeping around a started non-exclusive backup. + */ + if (XLogCtl->Insert.exclusiveBackupState == EXCLUSIVE_BACKUP_NONE && + sessionBackupState != SESSION_BACKUP_NON_EXCLUSIVE) + return; At the same time it would be safer at startup phase to update sessionBackupState when holding the WAL insert lock to prevent other CHECK_FOR_INTERRUPTS hazards. Suggestion of patch attached. -- Michael
fix_do_pg_abort_backup_v4.patch
Description: Binary data