Le lauantaina 19. lokakuuta 2024, 13.55.46 EEST Rémi Denis-Courmont a écrit :
> Le perjantaina 11. lokakuuta 2024, 13.40.20 EEST u...@foxmail.com a écrit :
> > From: sunyuechi <sunyue...@iscas.ac.cn>
> > +.macro put_uni_pixels w, vlen, id
> > +\id\w\vlen:
> > +.if \w == 128 && \vlen == 128
> > +        li                t0, \w
> > +        vsetvli           zero, t0, e8, m8, ta, ma
> > +.else
> > +        vsetvlstatic8     \w, \vlen
> > +.endif
> > +1:
> > +        vle8.v            v0, (a2)
> > +        addi              a4, a4, -1
> > +        vse8.v            v0, (a0)
> > +        add               a2, a2, a3
> > +        add               a0, a0, a1
> > +        bnez              a4, 1b
> > +        ret
> 
> Up to 64-bit rows, you can use strided loads and stores here.

Or perhaps not if the vectors are not aligned but vectors should not be 
necessary here. This is especially true on the BPi whose memory bus is rather 
slow, so even scalar copy can saturate it.

-- 
Rémi Denis-Courmont
http://www.remlab.net/



_______________________________________________
ffmpeg-devel mailing list
ffmpeg-devel@ffmpeg.org
https://ffmpeg.org/mailman/listinfo/ffmpeg-devel

To unsubscribe, visit link above, or email
ffmpeg-devel-requ...@ffmpeg.org with subject "unsubscribe".

Reply via email to