Re: [Patch] PR rtl-optimization/87763 - generate more bfi instructions on aarch64

Wilco Dijkstra Thu, 07 Feb 2019 10:14:03 -0800

Hi Steve,

>> After special cases you could do something like t = mask2 + (HWI_1U << 
>> shift);
>> return t == (t & -t) to check for a valid bfi.
>
> I am not sure I follow this logic and my attempts to use this did not
> work so I kept my original code.


It's similar to the initial code in aarch64_bitmask_imm, but rather than adding
the lowest bit to the value to verify it is a mask (val + (val & -val)), we use 
the
shift instead. If the shift is exactly right, it reaches the first set bit of 
the mask. 
Adding the low bit to a valid mask always results in zero or a single set bit. 
The standard idiom to check that is t == (t & -t).

>> +  "bfi\t%<GPI:w>0, %<GPI:w>1, 0, %P2"
>> 
>> This could emit a width that may be 32 too large in SImode if bit 31 is set
>> (there is another use of %P in aarch64.md which may have the same
>> issue).
>
> I am not sure why having bit 31 set would be a problem.  Sign
> extension?

Yes, if bit 31 is set, %P will emit 33 for a 32-bit constant which is obviously 
wrong.
Your patch avoids this for bfi by explicitly computing the correct value.

This looks good to me (and creates useful bfi's as expected), but I can't 
approve.

Wilco

Re: [Patch] PR rtl-optimization/87763 - generate more bfi instructions on aarch64

Reply via email to