Re: PSA: we lack TAP test coverage on NetBSD and OpenBSD

Fabien COELHO Sun, 20 Jan 2019 13:55:55 -0800


Hello Tom,

Here is a POC which defines an internal interface for a PRNG, and use it
within pgbench, with several possible implementations which default to
rand48.


I seriously dislike this patch.  pgbench's random support is quite
overengineered already IMO, and this proposes to add a whole batch of
new code and new APIs to fix a very small bug.

My intention is rather to discuss postgres' PRNG, in passing. Full successon this point:-)

I must admit that I have a grudge against standard rand48:


I think this is nonsense, particularly the claim that anything in PG
cares about the lowest-order bits of random doubles.  I'm aware that
there are applications where that does matter, but people aren't doing
high-precision weather simulations in pgbench.

Sure. My point is not that it is an actual issue for pgbench, but as thesame PRNG is used more or less everywhere in postgres, I think that itshould be a good one rather than a known bad one.


Eg, about double:

  \set i debug(random(1, POWER(2,49)) % 2)

Always return 1 because of the 48 bit precision, i.e. the output is nevereven.


  \set i debug(random(1, POWER(2,48)) % 2)

Return 0 1 0 1 0 1 0 1 0 1 0 1 0 1 ... because it is a LCG.

  \set i debug(random(1, POWER(2,48)) % 4)

Cycles over (3 2 1 0)*

  \set i debug(random(1, power(2, 47)) % 4)

Cycles over (0 0 1 1 2 2 3 3)*, and so on.

BTW, did you look at the question of the range of zipfian?


Yep.

I confirmed here that as used in the test case, it's generating a rangeway smaller than the other ones: repeating the insertion snippet 1000xproduces stats like this: [...]

I have no idea whether that indicates an actual bug, or just poor
choice of parameter in the test's call.  But the very small number
of distinct outputs is disheartening at least.

Zipf distribution is highly skewed, somehow close to an exponential. Toreduce the decreasing probability the parameter must be closer to 1, eg1.05 or something. However as far as the test is concerned I do not seethis as a significant issue. I was rather planning to submit adocumentation improvement to provide more precise hints about how thedistribution behaves depending on the parameter, and possibly reduce theparameter used in the test in passing, but I see this as not very urgent.


--
Fabien.

Re: PSA: we lack TAP test coverage on NetBSD and OpenBSD

Reply via email to