Lynne:
> Dec 23, 2023, 00:53 by jamr...@gmail.com:
> 
>> On an Intel Core i7 12700k:
>>
>> decorrelate_ls_c: 814.3
>> decorrelate_ls_sse2: 165.8
>> decorrelate_ls_avx2: 101.3
>> decorrelate_sf_c: 1602.6
>> decorrelate_sf_sse4: 640.1
>> decorrelate_sf_avx2: 324.6
>> decorrelate_sm_c: 1564.8
>> decorrelate_sm_sse2: 379.3
>> decorrelate_sm_avx2: 203.3
>> decorrelate_sr_c: 785.3
>> decorrelate_sr_sse2: 176.3
>> decorrelate_sr_avx2: 99.8
>>
>> Signed-off-by: James Almer <jamr...@gmail.com>
>>
> 
> Even better on a Zen3:
> checkasm: all 8 tests passed
> decorrelate_ls_c: 111.1
> decorrelate_ls_sse2: 272.6
> decorrelate_ls_avx2: 94.1
> decorrelate_sf_c: 170.6
> decorrelate_sf_sse4: 400.1
> decorrelate_sf_avx2: 196.1
> decorrelate_sm_c: 187.6
> decorrelate_sm_sse2: 383.1
> decorrelate_sm_avx2: 179.1
> decorrelate_sr_c: 102.6
> decorrelate_sr_sse2: 272.6
> decorrelate_sr_avx2: 94.1
> 

The SSE2 version is worse than the C version? Does this happen for more
DSP code?
(For decorrelate_sf_c, the C version is still the best and the gain of
AVX2 over C is not good for the other three either.)

- Andreas

_______________________________________________
ffmpeg-devel mailing list
ffmpeg-devel@ffmpeg.org
https://ffmpeg.org/mailman/listinfo/ffmpeg-devel

To unsubscribe, visit link above, or email
ffmpeg-devel-requ...@ffmpeg.org with subject "unsubscribe".

Reply via email to