Hi Jiu Fu, On Mon, Jul 13, 2020 at 07:50:28PM +0800, guojiufu wrote: > For very small loops (< 6 insns), it would be fine to unroll 4 > times to run fast with less latency and better cache usage.
> - /* TODO: This is hardcoded to 10 right now. It can be refined, for > - example we may want to unroll very small loops more times (4 perhaps). > - We also should use a PARAM for this. */ > + /* TODO: Using hardcodes here, for tunable, PARAM(s) maybe refined. */ /* TODO: These are hardcoded values right now. We probably should use a PARAM here. */ Okay for trunk with that. Thanks! Segher