Thanks Martin for doing this. On Thu, Aug 18, 2022 at 10:16 AM Martin Storsjö <mar...@martin.st> wrote:
> This avoids triggering overflows in the filters, and avoids stray > test failures in the approximate functions on x86; due to rounding > differences, one implementation might overflow while another one > doesn't. > > Signed-off-by: Martin Storsjö <mar...@martin.st> > --- > FWIW, this modification runs successfully with over 1000 different > seeds in checkasm. > --- > tests/checkasm/sw_scale.c | 16 +++++++++++++++- > 1 file changed, 15 insertions(+), 1 deletion(-) > > diff --git a/tests/checkasm/sw_scale.c b/tests/checkasm/sw_scale.c > index d72506ed86..ec06eafebe 100644 > --- a/tests/checkasm/sw_scale.c > +++ b/tests/checkasm/sw_scale.c > @@ -188,7 +188,6 @@ static void check_yuv2yuvX(int accurate) > uint8_t d_val = rnd(); > memset(dither, d_val, LARGEST_INPUT_SIZE); > randomize_buffers((uint8_t*)src_pixels, LARGEST_FILTER * > LARGEST_INPUT_SIZE * sizeof(int16_t)); > - randomize_buffers((uint8_t*)filter_coeff, LARGEST_FILTER * > sizeof(int16_t)); > ctx = sws_alloc_context(); > if (accurate) > ctx->flags |= SWS_ACCURATE_RND; > @@ -202,6 +201,21 @@ static void check_yuv2yuvX(int accurate) > if (dstW <= osi) > continue; > for (fsi = 0; fsi < FILTER_SIZES; ++fsi) { > + // Generate filter coefficients for the given filter size, > + // with some properties: > + // - The coefficients add up to the intended sum (4096, > 1<<12) > + // - The coefficients contain negative values > + // - The filter intermediates don't overflow for worst > case > + // inputs (all positive coefficients are coupled with > + // input_max and all negative coefficients with > input_min, > + // or vice versa). > + // Produce a filter with all coefficients set to > + // -((1<<12)/(filter_size-1)) except for one (randomly > chosen) > + // which is set to ((1<<13)-1). > + for (i = 0; i < filter_sizes[fsi]; ++i) > + filter_coeff[i] = -((1 << 12) / (filter_sizes[fsi] - > 1)); > + filter_coeff[rnd() % filter_sizes[fsi]] = (1 << 13) - 1; > + > src = av_malloc(sizeof(int16_t*) * filter_sizes[fsi]); > vFilterData = av_malloc((filter_sizes[fsi] + 2) * > sizeof(union VFilterData)); > memset(vFilterData, 0, (filter_sizes[fsi] + 2) * > sizeof(union VFilterData)); > -- > 2.25.1 > > _______________________________________________ ffmpeg-devel mailing list ffmpeg-devel@ffmpeg.org https://ffmpeg.org/mailman/listinfo/ffmpeg-devel To unsubscribe, visit link above, or email ffmpeg-devel-requ...@ffmpeg.org with subject "unsubscribe".