[RFC] Partial vectors for s390

Robin Dapp via Gcc-patches Wed, 20 Oct 2021 01:34:55 -0700

Hi,

I have been playing around with making Kewen's partial vector changesworkable with s390:

We have a vll instruction that can be passed the highest byte to load.The rather unfortunate consequence of this is that a length of zerocannot be specified. The partial vector framework, however, relies alot on the fact that a len_load can be made a NOP using a length of zero.

After confirming an additional zero-check before each vll is definitelytoo slow across SPEC and some discussion with Kewen we figured theeasiest way forward is to exclude loops with multiple VFs (despitegiving up vectorization possibilities). These are prone to len_loadswith zero while the regular induction variable check prevents them insingle-VF loops.


So, as a quick hack, I went with

diff --git a/gcc/tree-vect-loop.c b/gcc/tree-vect-loop.c
index 75f24e7c4f6..f79222daeb6 100644
--- a/gcc/tree-vect-loop.c
+++ b/gcc/tree-vect-loop.c
@@ -1170,6 +1170,9 @@ vect_verify_loop_lens (loop_vec_info loop_vinfo)
   if (LOOP_VINFO_LENS (loop_vinfo).is_empty ())
     return false;

+  if (LOOP_VINFO_LENS (loop_vinfo).length () > 1)
+    return false;
+

which could be made a hook, eventually. FWIW this is sufficient to makebootstrap, regtest and compiling the SPEC suites succeed. I'm unsurewhether we are guaranteed not to emit len_load with zero now. On top,I subtract 1 from the passed length in the expander, which, supposedly,is also not ideal.

There are some regressions that I haven't fully analyzed yet but whetherand when to actually enable this feature could be a backend decisionwith the necessary middle-end checks already in place.

Any ideas on how to properly check for the zero condition and excludethe cases that cause it? Kewen suggested enriching the len_load optabswith a separate parameter.


Regards
 Robin

[RFC] Partial vectors for s390

Reply via email to