Re: Probabilistic unit tests?

duncan smith Sat, 12 Jan 2013 10:14:29 -0800

On 12/01/13 08:07, alex23 wrote:

On 11 Jan, 13:34, Steven D'Aprano <steve
+comp.lang.pyt...@pearwood.info> wrote:

Well, that's not really a task for unit testing. Unit tests, like most
tests, are well suited to deterministic tests, but not really to
probabilistic testing. As far as I know, there aren't really any good
frameworks for probabilistic testing, so you're stuck with inventing your
own. (Possibly on top of unittest.)


One approach I've had success with is providing a seed to the RNG, so
that the random results are deterministic.

My ex-boss once instructed to do the same thing to test functions forgenerating random variates. I used a statistical approach instead.

There are often several ways of generating data that follow a particulardistribution. If you use a given seed so that you get a deterministicsequence of uniform random variates you will get deterministic outputsfor a specific implementation. But if you change the implementation thetests are likely to fail. e.g. To generate a negative exponentialvariate -ln(U)/lambda or -ln(1-U)/lambda will do the job correctly, buttests for one implementation would fail with the other. So each time youchanged the implementation you'd need to change the tests.

I think my boss had in mind that I would write the code, seed the RNG,call the function a few times, then use the generated values in thetest. That would not even have tested the original implementation. Iwould have had a test that would only have tested whether theimplementation had changed. I would argue, worse than no test at all. IfI'd gone to the trouble of manually calculating the expected outputs sothat I got valid tests for the original implementation, then I wouldhave had a test that would effectively just serve as a reminder to gothrough the whole manual calculation process again for any changedimplementation.

A reasonably general statistical approach is possible. Any hypothesisabout generated data that lends itself to statistical testing can beused to generate a sequence of p-values (one for each set of generatedvalues) that can be checked (statistically) for uniformity. Thiseffectively tests the distribution of the test statistic, so is betterthan simply testing whether tests on generated data pass, say, 95% ofthe time (for a chosen 5% Type I error rate). Cheers.


Duncan
--
http://mail.python.org/mailman/listinfo/python-list

Re: Probabilistic unit tests?

Reply via email to