From: Kaustubh Raste
Use mask buffer.
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/hevcdsp_msa.c | 541 -
1 file changed, 312 insertions(+), 229 deletions(-)
diff --git a/libavcodec/mips/hevcdsp_msa.c b/libavcodec/mips/hevcdsp_msa.c
index b17127c.
From: Kaustubh Raste
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/hevc_macros_msa.h | 37 -
1 file changed, 37 deletions(-)
diff --git a/libavcodec/mips/hevc_macros_msa.h
b/libavcodec/mips/hevc_macros_msa.h
index 27c69ff..ea53812 100644
--- a/libavco
From: Kaustubh Raste
Use mask buffer.
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/hevcdsp_msa.c | 1478 -
1 file changed, 870 insertions(+), 608 deletions(-)
diff --git a/libavcodec/mips/hevcdsp_msa.c b/libavcodec/mips/hevcdsp_msa.c
index 73cc3ea.
From: Kaustubh Raste
Use global mask buffer for appropriate mask load.
Use immediate unsigned saturation for clip to max saving one vector register.
Remove unused macro.
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/hevc_mc_uniw_msa.c | 1052 ++--
1 file cha
From: Kaustubh Raste
Use global mask buffer for appropriate mask load.
Remove unused macro and table.
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/hevc_mc_uni_msa.c | 1125 +++--
1 file changed, 566 insertions(+), 559 deletions(-)
diff --git a/libavcodec/m
From: Kaustubh Raste
Use global mask buffer for appropriate mask load.
Use immediate unsigned saturation for clip to max saving one vector register.
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/hevc_mc_biw_msa.c | 1396 +++--
1 file changed, 872 insertions(
From: Kaustubh Raste
Use global mask buffer for appropriate mask load.
Use immediate unsigned saturation for clip to max saving one vector register.
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/hevc_mc_bi_msa.c | 1140 +++---
1 file changed, 685 insertions(
From: Kaustubh Raste
Align the mask buffer to 64 bytes.
Load the specific destination bytes instead of MSA load and pack.
Remove unused macros and functions.
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/h264qpel_msa.c | 1269
1 file changed, 751 in
From: Kaustubh Raste
Use global mask buffer for appropriate mask load.
Use immediate unsigned saturation for clip to max saving one vector register.
Remove unused macro.
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/hevc_mc_uniw_msa.c | 892 ++--
1 file cha
From: Kaustubh Raste
Use global mask buffer for appropriate mask load.
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/hevc_mc_uni_msa.c | 509 -
1 file changed, 274 insertions(+), 235 deletions(-)
diff --git a/libavcodec/mips/hevc_mc_uni_msa.c
b/libavc
From: Kaustubh Raste
Use global mask buffer for appropriate mask load.
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/hevc_mc_biw_msa.c | 587 -
1 file changed, 247 insertions(+), 340 deletions(-)
diff --git a/libavcodec/mips/hevc_mc_biw_msa.c
b/libavc
From: Kaustubh Raste
Use global mask buffer for appropriate mask load.
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/hevc_mc_bi_msa.c | 428 +++---
1 file changed, 210 insertions(+), 218 deletions(-)
diff --git a/libavcodec/mips/hevc_mc_bi_msa.c b/libavcod
From: Kaustubh Raste
Load the specific destination bytes instead of MSA load and pack.
Remove unused macros and functions.
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/h264qpel_msa.c | 1274 ++--
1 file changed, 834 insertions(+), 440 deletions(-)
diff
From: Kaustubh Raste
Remove loops and unroll as block sizes are known.
Load the specific destination bytes instead of MSA load and pack.
Remove unused macro and functions.
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/h264qpel_msa.c | 1002 +++-
1 file c
From: Kaustubh Raste
Use immediate unsigned saturation for clip to max saving one vector register.
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/hevc_macros_msa.h |9 +
libavcodec/mips/hevc_mc_uniw_msa.c | 1598 +---
2 files changed, 965 insertions(+),
From: Kaustubh Raste
Remove unused macro.
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/hevc_mc_uni_msa.c | 744 +
1 file changed, 499 insertions(+), 245 deletions(-)
diff --git a/libavcodec/mips/hevc_mc_uni_msa.c
b/libavcodec/mips/hevc_mc_uni_msa.c
i
From: Kaustubh Raste
Align the mask buffer.
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/hevc_mc_bi_msa.c | 940 --
1 file changed, 595 insertions(+), 345 deletions(-)
diff --git a/libavcodec/mips/hevc_mc_bi_msa.c b/libavcodec/mips/hevc_mc_bi_msa.c
in
From: Kaustubh Raste
Replace generic with block size specific function.
Load the specific destination bytes instead of MSA load and pack.
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/h264chroma_msa.c | 438 +-
1 file changed, 238 insertions(+), 200 del
From: Kaustubh Raste
Remove loops and unroll as block sizes are known.
Load the specific destination bytes instead of MSA load and pack.
Remove unused macro and functions.
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/h264qpel_msa.c | 781 ++--
1 file c
From: Kaustubh Raste
Use immediate unsigned saturation for clip to max saving one vector register.
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/hevc_mc_biw_msa.c | 706 ++-
libavutil/mips/generic_macros_msa.h | 35 ++
2 files changed, 489 insertions(+)
From: Kaustubh Raste
Replace generic with block size specific function.
Load the specific destination bytes instead of MSA load and pack.
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/h264chroma_msa.c | 627 +-
1 file changed, 275 insertions(+), 352 del
From: Kaustubh Raste
Remove loops and unroll as block sizes are known.
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/h264qpel_msa.c | 400
1 file changed, 240 insertions(+), 160 deletions(-)
diff --git a/libavcodec/mips/h264qpel_msa.c b/libavcodec
From: Kaustubh Raste
Pack the data to half word before clipping.
Use immediate unsigned saturation for clip to max saving one vector register.
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/hevc_mc_uniw_msa.c | 441 ++--
1 file changed, 222 insertions(+), 21
From: Kaustubh Raste
Update macros to remove adds.
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/hevc_mc_uni_msa.c | 612 +++--
1 file changed, 323 insertions(+), 289 deletions(-)
diff --git a/libavcodec/mips/hevc_mc_uni_msa.c
b/libavcodec/mips/hevc_mc_un
From: Kaustubh Raste
Load the specific destination bytes instead of MSA load and pack.
Use immediate unsigned saturation for clip to max saving one vector register.
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/hevc_mc_bi_msa.c | 652 +-
1 file changed,
From: Kaustubh Raste
Remove loops and unroll as block sizes are known.
Removed unused functions.
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/h264qpel_msa.c | 1013 +---
1 file changed, 747 insertions(+), 266 deletions(-)
diff --git a/libavcodec/mips/h
From: Kaustubh Raste
Replace generic with block size specific function.
Load the specific destination bytes instead of MSA load and pack.
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/h264chroma_msa.c | 329 --
1 file changed, 169 insertions(+), 160 del
From: Kaustubh Raste
Load the specific bytes instead of MSA load.
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/hevc_mc_uni_msa.c | 245 +++--
1 file changed, 100 insertions(+), 145 deletions(-)
diff --git a/libavcodec/mips/hevc_mc_uni_msa.c
b/libavcodec/
From: Kaustubh Raste
Load the specific destination bytes instead of MSA load and pack.
Pack the data to half word before clipping.
Use immediate unsigned saturation for clip to max saving one vector register.
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/hevc_macros_msa.h | 13 +-
libav
From: Kaustubh Raste
Remove loops and unroll as block sizes are known.
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/h264qpel_msa.c | 1219 ++--
1 file changed, 802 insertions(+), 417 deletions(-)
diff --git a/libavcodec/mips/h264qpel_msa.c b/libavcodec
From: Kaustubh Raste
Replace generic with block size specific function.
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/h264chroma_msa.c | 309 --
1 file changed, 166 insertions(+), 143 deletions(-)
diff --git a/libavcodec/mips/h264chroma_msa.c b/libavco
From: Kaustubh Raste
Replace generic with block size specific function.
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/h264dsp_msa.c | 469 +++
libavutil/mips/generic_macros_msa.h |4 +
2 files changed, 311 insertions(+), 162 deletions(-)
diff --g
From: Kaustubh Raste
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/hevc_lpf_sao_msa.c | 194
1 file changed, 132 insertions(+), 62 deletions(-)
diff --git a/libavcodec/mips/hevc_lpf_sao_msa.c
b/libavcodec/mips/hevc_lpf_sao_msa.c
index c192265..5b5537a
From: Kaustubh Raste
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/h264qpel_msa.c | 746
libavcodec/mips/hevc_mc_uniw_msa.c | 67
libavcodec/mips/hevcdsp_msa.c | 50 ---
3 files changed, 863 deletions(-)
diff --git a/libavcodec/mips/h
From: Kaustubh Raste
Remove loops and unroll as block sizes are known.
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/h264qpel_msa.c | 441 +++-
1 file changed, 432 insertions(+), 9 deletions(-)
diff --git a/libavcodec/mips/h264qpel_msa.c b/libavcodec/m
From: Kaustubh Raste
Replace generic with block size specific function.
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/h264chroma_msa.c | 237 ++
1 file changed, 112 insertions(+), 125 deletions(-)
diff --git a/libavcodec/mips/h264chroma_msa.c b/libavco
From: Kaustubh Raste
Replace generic with block size specific function.
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/h264dsp_msa.c | 423 ++-
libavutil/mips/generic_macros_msa.h | 36 +++
2 files changed, 306 insertions(+), 153 deletions(-)
diff -
From: Kaustubh Raste
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/h264pred_msa.c | 215 +---
1 file changed, 92 insertions(+), 123 deletions(-)
diff --git a/libavcodec/mips/h264pred_msa.c b/libavcodec/mips/h264pred_msa.c
index c297aec..b9990c1 100644
-
From: Kaustubh Raste
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/hevc_lpf_sao_msa.c | 197
1 file changed, 135 insertions(+), 62 deletions(-)
diff --git a/libavcodec/mips/hevc_lpf_sao_msa.c
b/libavcodec/mips/hevc_lpf_sao_msa.c
index 39c647e..c192265
From: Kaustubh Raste
Replace generic with block size specific function.
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/h264chroma_msa.c | 231 ++
1 file changed, 133 insertions(+), 98 deletions(-)
diff --git a/libavcodec/mips/h264chroma_msa.c b/libavcod
From: Kaustubh Raste
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/h264pred_msa.c | 318
1 file changed, 158 insertions(+), 160 deletions(-)
diff --git a/libavcodec/mips/h264pred_msa.c b/libavcodec/mips/h264pred_msa.c
index cddcd2e..6c7e756 100644
From: Kaustubh Raste
Load the specific destination bytes instead of MSA load and pack.
Pack the data to half word before clipping.
Use immediate unsigned saturation for clip to max saving one vector register.
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/hevc_mc_uniw_msa.c | 559
From: Kaustubh Raste
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/hevcdsp_msa.c | 168 +++--
1 file changed, 160 insertions(+), 8 deletions(-)
diff --git a/libavcodec/mips/hevcdsp_msa.c b/libavcodec/mips/hevcdsp_msa.c
index f2bc748..1a854b2 100644
---
From: Kaustubh Raste
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/hevc_lpf_sao_msa.c | 181
1 file changed, 122 insertions(+), 59 deletions(-)
diff --git a/libavcodec/mips/hevc_lpf_sao_msa.c
b/libavcodec/mips/hevc_lpf_sao_msa.c
index 3472d32..39c647e
From: Kaustubh Raste
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/h264dsp_msa.c | 274 +
1 file changed, 110 insertions(+), 164 deletions(-)
diff --git a/libavcodec/mips/h264dsp_msa.c b/libavcodec/mips/h264dsp_msa.c
index a17eacb..422703d 100644
--
From: Kaustubh Raste
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/h264qpel_msa.c | 284 +++-
1 file changed, 278 insertions(+), 6 deletions(-)
diff --git a/libavcodec/mips/h264qpel_msa.c b/libavcodec/mips/h264qpel_msa.c
index 05dffea..b7f6c3d 100644
--
From: Kaustubh Raste
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/hevc_lpf_sao_msa.c | 232 +---
1 file changed, 138 insertions(+), 94 deletions(-)
diff --git a/libavcodec/mips/hevc_lpf_sao_msa.c
b/libavcodec/mips/hevc_lpf_sao_msa.c
index 1d77432..3472d32
From: Kaustubh Raste
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/hevc_mc_uni_msa.c | 372 +
1 file changed, 133 insertions(+), 239 deletions(-)
diff --git a/libavcodec/mips/hevc_mc_uni_msa.c
b/libavcodec/mips/hevc_mc_uni_msa.c
index 754fbdb..cf22e7f
From: Kaustubh Raste
Preload data in band filter 0-8 for better pipeline parallelization.
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/hevc_lpf_sao_msa.c | 174 ++-
libavutil/mips/generic_macros_msa.h |1 +
2 files changed, 112 insertions(+), 63 delet
From: Kaustubh Raste
Remove loops and unroll as block sizes are known.
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/h264qpel_msa.c | 81 +---
1 file changed, 75 insertions(+), 6 deletions(-)
diff --git a/libavcodec/mips/h264qpel_msa.c b/libavcodec/mi
From: Kaustubh Raste
Optimize luma intra case by reducing conditional cases.
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/h264dsp_msa.c | 428 +
1 file changed, 138 insertions(+), 290 deletions(-)
diff --git a/libavcodec/mips/h264dsp_msa.c b/libav
From: Kaustubh Raste
Align the buffers. Remove reduandant constant array.
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/hevc_idct_msa.c | 255 ++-
1 file changed, 171 insertions(+), 84 deletions(-)
diff --git a/libavcodec/mips/hevc_idct_msa.c b/libavco
From: Kaustubh Raste
Seperate the filter processing in all strong, all weak and strong + weak cases.
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/hevc_lpf_sao_msa.c | 750 ++--
1 file changed, 556 insertions(+), 194 deletions(-)
diff --git a/libavcodec/mi
From: Kaustubh Raste
Load the specific destination bytes instead of MSA load and pack.
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/h264qpel_msa.c | 17 +-
libavcodec/mips/vp9_mc_msa.c| 759 ---
libavutil/mips/generic_macros_msa.h | 24 +-
From: Kaustubh Raste
Removed memset calls.
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/vp9_idct_msa.c | 118
1 file changed, 70 insertions(+), 48 deletions(-)
diff --git a/libavcodec/mips/vp9_idct_msa.c b/libavcodec/mips/vp9_idct_msa.c
index 25e
From: Kaustubh Raste
Updated VP9_LPF_FILTER4_4W macro to process on 8 bit data.
Replaced VP9_LPF_FILTER4_8W with VP9_LPF_FILTER4_4W.
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/vp9_lpf_msa.c | 94 ++---
1 file changed, 14 insertions(+), 80 deletions(
From: Kaustubh Raste
Replace memset call with msa stores.
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/h264idct_msa.c |4 ++--
1 file changed, 2 insertions(+), 2 deletions(-)
diff --git a/libavcodec/mips/h264idct_msa.c b/libavcodec/mips/h264idct_msa.c
index 861befe..1e1a5c8 100644
--
From: Kaustubh Raste
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/h264idct_msa.c | 66 +++-
1 file changed, 32 insertions(+), 34 deletions(-)
diff --git a/libavcodec/mips/h264idct_msa.c b/libavcodec/mips/h264idct_msa.c
index 81e09e9..861befe 100644
--
From: Kaustubh Raste
Removed memset call and improved performance.
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/h264idct_msa.c | 104 +++
libavutil/mips/generic_macros_msa.h | 18 ++
2 files changed, 74 insertions(+), 48 deletions(-)
diff --git
From: Kaustubh Raste
Reduced msa load-store code.
Removed inline asm of GP load-store for 64 bit.
Updated variable names in GP load-store macros for naming consistency.
Corrected macro descriptions.
Signed-off-by: Kaustubh Raste
---
libavutil/mips/generic_macros_msa.h | 629 ++---
From: Kaustubh Raste
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/Makefile|1 +
libavcodec/mips/videodsp_init.c | 51 +++
libavcodec/videodsp.c |2 ++
libavcodec/videodsp.h |1 +
4 files changed, 55 insertions(+)
From: Kaustubh Raste
Signed-off-by: Kaustubh Raste
---
libavcodec/mips/Makefile|1 +
libavcodec/mips/videodsp_mips.c | 42 +++
libavcodec/videodsp.c |2 ++
libavcodec/videodsp.h |1 +
4 files changed, 46 insertions(+)
62 matches
Mail list logo