Hi, On Wed, Sep 26, 2018 at 10:46:06AM +0200, Fabien COELHO wrote: > The xml documentation should be updated! (It is kind of hard to notice what > is not there:-) > > See "doc/src/sgml/ref/pg_verify_checksums.sgml".
Right, I've added a paragraph. > >>The time() granularity to the second makes the result awkward on small > >>tests: > >> > >> 8/1554552 kB (0%, 8 kB/s) > >> 1040856/1554552 kB (66%, 1040856 kB/s) > >> 1554552/1554552 kB (100%, 1554552 kB/s) > >> > >>I'd suggest to reuse the "portability/instr_time.h" stuff which allows a > >>much better precision. > > I still think it would make sense to use that instead of low-precision > time(). As the use-case of this is not for small tests, I don't think it is useful to make the code more complicated for this. > >>The "kB" kilo-byte symbols stands for 1000 bytes, but you are displaying > >>1024-bytes kilos, which symbol is either "KiB" or "KB". Given that we are > >>about storage which is usually measured in powers of 1,000, so I'd suggest > >>to use thousands. > >> > >>Another reserve I have is that on any hardware it is likely that there will > >>be a lot of kilos, so maybe megas (MB) should be used instead. > > > >The display is exactly the same as in pg_basebackup (except the > >bandwith is shown as well), so right now I think it is more useful to be > >consistent here. > > Hmmm... units are units, and the display is wrong. The fact that other > commands are wrong as well does not look like a good argument to persist in > an error. I've had a look around, and "kB" seems to be a common unit for 1024 bytes, e.g. in pg_test_fsync etc., unless I am mistaken? > >So if we change that, pg_basebackup should be changed as well I think. > > I'm okay with fixing pg_basebackup as well! I'm unsure about the best place > to put such a function, though. Probably under "src/common/" where there is > already some front-end specific code ("fe_memutils.c"). > > >Maybe the code could be factored out to some common file in the future. > > I would be okay with a progress printing function shared between some > commands. It just needs some place. pg_rewind also has a --rewind option. I guess you mean pg_rewind also has a --progress option. I do agree it makes sense to refactor that, but I don't think this should be part of this patch. > >> + memset(&act, '\0', sizeof(act)); > >> > >>pg uses 0 instead of '\0' everywhere else. > > > >Ok. > > Not '0', plain 0 (the integer of value zero). Oops, thanks for spotting that. I've attached v4 of the patch. Michael -- Michael Banck Projektleiter / Senior Berater Tel.: +49 2166 9901-171 Fax: +49 2166 9901-100 Email: michael.ba...@credativ.de credativ GmbH, HRB Mönchengladbach 12080 USt-ID-Nummer: DE204566209 Trompeterallee 108, 41189 Mönchengladbach Geschäftsführung: Dr. Michael Meskes, Jörg Folz, Sascha Heuer Unser Umgang mit personenbezogenen Daten unterliegt folgenden Bestimmungen: https://www.credativ.de/datenschutz
diff --git a/doc/src/sgml/ref/pg_verify_checksums.sgml b/doc/src/sgml/ref/pg_verify_checksums.sgml index 905b8f1222..5550b7b2bf 100644 --- a/doc/src/sgml/ref/pg_verify_checksums.sgml +++ b/doc/src/sgml/ref/pg_verify_checksums.sgml @@ -80,6 +80,19 @@ PostgreSQL documentation </varlistentry> <varlistentry> + <term><option>-P</option></term> + <term><option>--progress</option></term> + <listitem> + <para> + Enable progress reporting. Turning this on will deliver an approximate + progress report during the checksum verification. Sending the + <systemitem>SIGUSR1</systemitem> signal will toggle progress reporting + on or off during the verification run. + </para> + </listitem> + </varlistentry> + + <varlistentry> <term><option>-V</option></term> <term><option>--version</option></term> <listitem> diff --git a/src/bin/pg_verify_checksums/pg_verify_checksums.c b/src/bin/pg_verify_checksums/pg_verify_checksums.c index 1bc020ab6c..ed0dec2325 100644 --- a/src/bin/pg_verify_checksums/pg_verify_checksums.c +++ b/src/bin/pg_verify_checksums/pg_verify_checksums.c @@ -10,6 +10,8 @@ #include "postgres_fe.h" #include <dirent.h> +#include <signal.h> +#include <time.h> #include <sys/stat.h> #include <unistd.h> @@ -29,9 +31,18 @@ static ControlFileData *ControlFile; static char *only_relfilenode = NULL; static bool verbose = false; +static bool show_progress = false; static const char *progname; +/* + * Progress status information. + */ +int64 total_size = 0; +int64 current_size = 0; +pg_time_t last_progress_update; +pg_time_t scan_started; + static void usage(void) { @@ -42,6 +53,7 @@ usage(void) printf(_(" [-D, --pgdata=]DATADIR data directory\n")); printf(_(" -v, --verbose output verbose messages\n")); printf(_(" -r RELFILENODE check only relation with specified relfilenode\n")); + printf(_(" -P, --progress show progress information\n")); printf(_(" -V, --version output version information, then exit\n")); printf(_(" -?, --help show this help, then exit\n")); printf(_("\nIf no data directory (DATADIR) is specified, " @@ -57,6 +69,71 @@ static const char *const skip[] = { NULL, }; +static void +toggle_progress_report(int signo, + siginfo_t * siginfo, + void *context) +{ + + /* we handle SIGUSR1 only, and toggle the value of show_progress */ + if (signo == SIGUSR1) + show_progress = !show_progress; + +} + +/* + * Report current progress status. Parts borrowed from + * PostgreSQLs' src/bin/pg_basebackup.c + */ +static void +report_progress(bool force) +{ + pg_time_t now = time(NULL); + int total_percent = 0; + + char totalstr[32]; + char currentstr[32]; + char currspeedstr[32]; + + /* Make sure we report at most once a second */ + if ((now == last_progress_update) && !force) + return; + + /* Save current time */ + last_progress_update = now; + + /* + * Calculate current percent done, based on KiB... + */ + total_percent = total_size ? (int64) ((current_size / 1024) * 100 / (total_size / 1024)) : 0; + + /* Don't display larger than 100% */ + if (total_percent > 100) + total_percent = 100; + + /* The same for total size */ + if (current_size > total_size) + total_size = current_size / 1024; + + snprintf(totalstr, sizeof(totalstr), INT64_FORMAT, + total_size / 1024); + snprintf(currentstr, sizeof(currentstr), INT64_FORMAT, + current_size / 1024); + snprintf(currspeedstr, sizeof(currspeedstr), INT64_FORMAT, + (current_size / 1024) / (((time(NULL) - scan_started) == 0) ? 1 : (time(NULL) - scan_started))); + fprintf(stderr, "%s/%s kB (%d%%, %s kB/s)", + currentstr, totalstr, total_percent, currspeedstr); + + /* + * If we are reporting to a terminal, send a carriage return so that we + * stay on the same line. If not, send a newline. + */ + if (isatty(fileno(stderr))) + fprintf(stderr, "\r"); + else + fprintf(stderr, "\n"); +} + static bool skipfile(const char *fn) { @@ -110,6 +187,7 @@ scan_file(const char *fn, BlockNumber segmentno) continue; csum = pg_checksum_page(buf.data, blockno + segmentno * RELSEG_SIZE); + current_size += r; if (csum != header->pd_checksum) { if (ControlFile->data_checksum_version == PG_DATA_CHECKSUM_VERSION) @@ -117,6 +195,9 @@ scan_file(const char *fn, BlockNumber segmentno) progname, fn, blockno, csum, header->pd_checksum); badblocks++; } + + if (show_progress) + report_progress(false); } if (verbose) @@ -126,9 +207,10 @@ scan_file(const char *fn, BlockNumber segmentno) close(f); } -static void -scan_directory(const char *basedir, const char *subdir) +static int64 +scan_directory(const char *basedir, const char *subdir, bool sizeonly) { + int64 dirsize = 0; char path[MAXPGPATH]; DIR *dir; struct dirent *de; @@ -191,16 +273,22 @@ scan_directory(const char *basedir, const char *subdir) /* Relfilenode not to be included */ continue; - scan_file(fn, segmentno); + dirsize += st.st_size; + + if (!sizeonly) + { + scan_file(fn, segmentno); + } } #ifndef WIN32 else if (S_ISDIR(st.st_mode) || S_ISLNK(st.st_mode)) #else else if (S_ISDIR(st.st_mode) || pgwin32_is_junction(fn)) #endif - scan_directory(path, de->d_name); + dirsize += scan_directory(path, de->d_name, sizeonly); } closedir(dir); + return dirsize; } int @@ -208,6 +296,7 @@ main(int argc, char *argv[]) { static struct option long_options[] = { {"pgdata", required_argument, NULL, 'D'}, + {"progress", no_argument, NULL, 'P'}, {"verbose", no_argument, NULL, 'v'}, {NULL, 0, NULL, 0} }; @@ -217,6 +306,8 @@ main(int argc, char *argv[]) int option_index; bool crc_ok; + struct sigaction act; /* to turn progress status info on */ + set_pglocale_pgservice(argv[0], PG_TEXTDOMAIN("pg_verify_checksums")); progname = get_progname(argv[0]); @@ -235,7 +326,7 @@ main(int argc, char *argv[]) } } - while ((c = getopt_long(argc, argv, "D:r:v", long_options, &option_index)) != -1) + while ((c = getopt_long(argc, argv, "D:r:vP", long_options, &option_index)) != -1) { switch (c) { @@ -253,6 +344,9 @@ main(int argc, char *argv[]) } only_relfilenode = pstrdup(optarg); break; + case 'P': + show_progress = true; + break; default: fprintf(stderr, _("Try \"%s --help\" for more information.\n"), progname); exit(1); @@ -306,10 +400,54 @@ main(int argc, char *argv[]) exit(1); } + + /* + * Assign SIGUSR1 signal handler to toggle progress status information + */ + memset(&act, 0, sizeof(act)); + act.sa_sigaction = &toggle_progress_report; + act.sa_flags = SA_SIGINFO; + + /* + * Enable signal handler, but don't treat it as severe if we don't + * succeed here. Just give a message on STDERR. + */ + if (sigaction(SIGUSR1, &act, NULL) < 0) + { + fprintf(stderr, _("%s: could not set signal handler to toggle progress\n"), + progname); + } + + /* + * Iff progress status information is requested, we need to scan the + * directory tree(s) twice, once to get the idea how much data we need + * to scan and finally to do the real legwork. + */ + total_size = scan_directory(DataDir, "global", true); + total_size += scan_directory(DataDir, "base", true); + total_size += scan_directory(DataDir, "pg_tblspc", true); + + /* + * Remember start time. Required to calculate the current speed in + * report_progress() + */ + scan_started = time(NULL); + /* Scan all files */ - scan_directory(DataDir, "global"); - scan_directory(DataDir, "base"); - scan_directory(DataDir, "pg_tblspc"); + scan_directory(DataDir, "global", false); + scan_directory(DataDir, "base", false); + scan_directory(DataDir, "pg_tblspc", false); + + /* + * Done. Move to next line in case progress information was shown. + * Otherwise we clutter the summary output. + */ + if (show_progress) + { + report_progress(true); + if (isatty(fileno(stderr))) + fprintf(stderr, "\n"); + } printf(_("Checksum scan completed\n")); printf(_("Data checksum version: %d\n"), ControlFile->data_checksum_version);