Re: [patch, avr, pr71676 and pr71678] Issues with casesi expand

Pitchumani Sivanupandi Wed, 19 Oct 2016 22:59:04 -0700

On Wednesday 19 October 2016 07:51 PM, Georg-Johann Lay wrote:

On 17.10.2016 09:27, Pitchumani Sivanupandi wrote:
On Thursday 13 October 2016 08:42 PM, Georg-Johann Lay wrote:
On 13.10.2016 13:44, Pitchumani Sivanupandi wrote:
On Monday 26 September 2016 08:19 PM, Georg-Johann Lay wrote:
On 26.09.2016 15:19, Pitchumani Sivanupandi wrote:
Attached patch for PR71676 and PR71678.
PR71676 is for AVR target that generates wrong code when switchcase index is
more than 16 bits.
Switch case index of larger than SImode are checked for out ofrange before'casesi' expand. RTL expand of casesi gets index as SImode, butindex is
compared in HImode and ignores upper 16bits.
Attached patch changes the expansion for casesi to make the indexcomparison
in SImode and code generation accordingly.

PR71678 is ICE because below pattern in 'casesi' is not recognized.
(set (reg:HI 47)
     (minus:HI (subreg:HI (subreg:SI (reg:DI 44) 0) 0)
               (reg:HI 45)))

Fix of PR71676 avoids the above pattern as it changes the comparison
to SImode.
But this means that all comparisons are now performed in SImodewhich is agreat performance loss for most programs which will switch on16-bit values.
IMO we need a less intrusive (w.r.t. performance) approach.
Yes.
I tried to split 'casesi' into several based on case values so thatcompare is
done
in less expensive modes (i.e. QI or HI). In few cases it is notpossible
without
SImode subtract/ compare.
Pattern casesi will have index in SI mode. So, out of range checkswill be
expensive
as most common uses (in AVR) of case values will be in QI/HI mode.

e.g.
  if case values in QI range
    if upper three bytes index is set
      goto out_of_range

    offset = index - lower_bound (QImode)
    if offset > case_range       (QImode)
      goto out_of_range
    goto jump_table + offset

  else if case values in HI range
    if index[2,3] is set
      goto out_of_range

    offset = index - lower_bound (HImode)
    if offset > case_range       (HImode)
      goto out_of_range
    goto jump_table + offset
This modification will not work for the negative index values.Because code to
check
upper bytes of index will be expensive than the SImode subtract/compare.
So, I'm trying to update fix to have SImode subtract/ compare ifthe case
values include
negative integers. For, others will try to optimize as mentionedabove. Is that
approach OK?
But the above code will be executed at run time and add even moreoverhead,or am I missing something? If you conclude statically at expandtime fromthe case ranges then we might hit a similar problem as with theoriginal
subreg computation.
No. Lower bound and case range are const_int_operand, known atcompile time.
Yes, but if the range if form 10 to 90, say, then you still don't knowwhether HImode and QImode is appropriate or not which adds to codesize and register pressure.
As I mentioned earlier, I am working on a different approach whichwould revert your changes: The casesi is basically unaltered (exceptfor operand clean-ups and avoidance of clobbering subregs).
The ups of my approach are:

* The original value is known and whether is is QI or HI.

* The signedness is known which allows to use the maximum range of
  QI resp. HI depending on the sign.

* Also works on negative values.

* All is done at compile time, no need for extra code.

* No intermediate 32-bit values, no unnecessary increase of reg pressure.

* Optimization can be switched off by -fdisable if desired.

* Result can be seen in dumps.

The downsides are:

* Also needs some lines of code (~400).

* Makes assumptions on the anatomy of the code, i.e. extension
  precedes casesi.
First we should decide which route to follow as the changes areconflicting each other. I have not so much time to work on the stuffbut the results are promising. If you are interested in the changes,I can supply it but it's all still work in progress.


Ok. I'll put this patch on hold for now.
Please share if you have draft version of your fix is ready.

Thanks.

Regards,
Pitchumani

Re: [patch, avr, pr71676 and pr71678] Issues with casesi expand

Reply via email to