Re: [FFmpeg-devel] [PATCH 3/3] avfilter/vf_convolution: add X86 SIMD for filter_column()

2019-12-04 Thread
cmp iq, radq >>+jl .loopr_i >>+ >>+pxor m4, m4 >>+cvtsi2ss m4, sumd >>+mulss m4, m0 ; sum *= rdiv >>+addss m4, m1 ; sum += bias >>+addss m4, m5 ; sum += 0.5 >>+cvttps2dq m

Re: [FFmpeg-devel] [PATCH] avfilter/vf_convolution: add 16-column operation for filter_column() to prepare for x86 SIMD.

2019-12-01 Thread
for x86 SIMD. > 在 2019年12月2日,10:42,徐鋆 写道: > > I'm sorry not to reply in time. > > The performance of this C code is about 10% better than the existing C code. > > It will have a bigger improvement after X86 SIMD optimizations. 1. How to test? 1. 怎么测试的? 1. どうやってテストした

Re: [FFmpeg-devel] [PATCH] avfilter/vf_convolution: add 16-column operation for filter_column() to prepare for x86 SIMD.

2019-12-01 Thread
ink above, or email ffmpeg-devel-requ...@ffmpeg.org with subject "unsubscribe". -- 敬颂钧安, 徐鋆 电子信息与电气工程学院 上海交通大学 邮箱:xuju...@sjtu.edu.cn 地址:上海市闵行区东川路800号 Yours sincerely, Xylem(Jun Xu) School of Electronic, Information and Electrical Engineering Shanghai Jiao Tong University Emai