Re: [PATCH 5/8] aarch64: Tweak the cost of elementwise stores

2021-08-04 Thread Richard Biener via Gcc-patches
On Tue, Aug 3, 2021 at 2:09 PM Richard Sandiford via Gcc-patches wrote: > > When the vectoriser scalarises a strided store, it counts one > scalar_store for each element plus one vec_to_scalar extraction > for each element. However, extracting element 0 is free on AArch64, > so it should have zer

[PATCH 5/8] aarch64: Tweak the cost of elementwise stores

2021-08-03 Thread Richard Sandiford via Gcc-patches
When the vectoriser scalarises a strided store, it counts one scalar_store for each element plus one vec_to_scalar extraction for each element. However, extracting element 0 is free on AArch64, so it should have zero cost. I don't have a testcase that requires this for existing -mtune options, bu