Re: [FFmpeg-devel] [PATCH 1/6] lavu/riscv: implement floating point clips

2024-07-25 Thread Rémi Denis-Courmont
Le 25 juillet 2024 23:25:15 GMT+03:00, "Rémi Denis-Courmont" a écrit : >Unlike x86, fmin/fmax are single instructions, not function calls. They >are much much faster than doing a comparison, then branching based on its >results. With this, audiodsp.vector_clipf gets almost twice as fast, and >a

[FFmpeg-devel] [PATCH 1/6] lavu/riscv: implement floating point clips

2024-07-25 Thread Rémi Denis-Courmont
Unlike x86, fmin/fmax are single instructions, not function calls. They are much much faster than doing a comparison, then branching based on its results. With this, audiodsp.vector_clipf gets almost twice as fast, and a properly unrollled version of it gets 4-5x faster, on SiFive-U74. This is only