[PATCH v2] src/port/snprintf.c: Optimize the common base=10 case in fmtint

fmtint() turns an integer into a string for a given base, and to do this
it does a divide/modulo operation iteratively.
The only possible base values are 8, 10 and 16

On just about any CPU, divides are a pretty expensive operation, generally
10x to 20x or more expensive than adds or multiplies.

By special casing the base values, the compiler (gcc or other) can (and will)
replace the divide by a multiply with 0xcccccccccccccccd (for base 10) or bitops
for base 8 and 16, yielding a lot faster code.

I considered a switch statement, but since base 10 is the most common by far,
I implemented it as a series of if/else statements with a likely() marking the 
10 case.

Even though this only shows up in the database creation phase of pgbench and 
not so much
during the normal run time, the optimization is simple and high value enough 
that
in my opinion it's worth doing




diff --git a/src/port/snprintf.c b/src/port/snprintf.c
index 7c21429369..547a59d4a0 100644
--- a/src/port/snprintf.c
+++ b/src/port/snprintf.c
@@ -1076,11 +1076,31 @@ fmtint(long long value, char type, int forcesign, int 
leftjust,
        else
        {
                /* make integer string */
-               do
-               {
-                       convert[sizeof(convert) - (++vallen)] = cvt[uvalue % 
base];
-                       uvalue = uvalue / base;
-               } while (uvalue);
+
+               /*
+                * Special case each of the possible base values  (8, 10, 16) 
to avoid an
+                * expensive divide operation
+                * (the compiler will use a multiply, shift or boolean ops for 
this)
+                */
+               if (likely(base == 10)) {
+                       do
+                       {
+                               convert[sizeof(convert) - (++vallen)] = 
cvt[uvalue % 10];
+                               uvalue = uvalue / 10;
+                       } while (uvalue);
+               } else if (base == 16) {
+                       do
+                       {
+                               convert[sizeof(convert) - (++vallen)] = 
cvt[uvalue % 16];
+                               uvalue = uvalue / 16;
+                       } while (uvalue);
+               } else if (base == 8) {
+                       do
+                       {
+                               convert[sizeof(convert) - (++vallen)] = 
cvt[uvalue % 8];
+                               uvalue = uvalue / 8;
+                       } while (uvalue);
+               }
        }

        zeropad = Max(0, precision - vallen);


Reply via email to