On Tue, Oct 6, 2015 at 5:42 PM, Ronald S. Bultje <rsbul...@gmail.com> wrote: > +cglobal vp9_%1_%3_4x4_add_10, 3, 3, 0, dst, stride, block, eob [...] > + mova m0, [blockq+0*16+0] > + mova m4, [blockq+0*16+8] > + mova m1, [blockq+1*16+0] > + mova m5, [blockq+1*16+8] > + packssdw m0, m4 > + packssdw m1, m5 > + mova m2, [blockq+2*16+0] > + mova m4, [blockq+2*16+8] > + mova m3, [blockq+3*16+0] > + mova m5, [blockq+3*16+8] > + packssdw m2, m4 > + packssdw m3, m5
Use packssdw with a memory arg as the second operand. The mixing of MMX and SSE is quite ugly in general, but whatever works. _______________________________________________ ffmpeg-devel mailing list ffmpeg-devel@ffmpeg.org http://ffmpeg.org/mailman/listinfo/ffmpeg-devel