Matthew Fortune <matthew.fort...@imgtec.com> writes:
> Richard Sandiford <rdsandif...@googlemail.com> writes:
>> Matthew Fortune <matthew.fort...@imgtec.com> writes:
>> > Hi Richard,
>> >
>> > As part of implementing the new O32 FPXX ABI I am making use of the
>> > HARD_REGNO_CALL_PART_CLOBBERED macro to allow odd-numbered
>> > floating-point registers to be considered as 'normally' callee-saved
>> > but call clobbered if they are being used to hold SImode or SFmode
>> > data. The macro is implemented as:
>> >
>> > /* Odd numbered single precision registers are not considered call
>> saved
>> >    for O32 FPXX as they will be clobbered when run on an FR=1 FPU.
>> */
>> > #define HARD_REGNO_CALL_PART_CLOBBERED(REGNO, MODE)
>> \
>> >   (TARGET_FLOATXX && ((MODE) == SFmode || (MODE) == SImode)
>> \
>> >    && FP_REG_P (REGNO) && (REGNO & 1))
>> 
>> Under these conditions the entire value is call-clobbered though.
>> It might be better to say that the odd-numbered registers are always
>> call-clobbered (which I think is more accurate) but force them to be
>> saved by functions that use them.  This is in some ways similar to the
>> way that interrupt handlers save call-clobbered registers.
>
> The problem here is that these registers form the second part of 64-bit 
> call saved registers and therefore have to be marked as call-saved for 
> 64-bit values regardless of their behaviour for 32-bit values.  I have 
> tried to simply mark them as call-clobbered and the effect was that 
> 64-bit values ended up being seen as call-clobbered.

Ah, OK.

> Approaching this problem from both sides leads to complications as O32
> FPXX has a register which behaves differently based on mode. From
> experimentation I believe this approach to be the neatest and use
> existing features of GCC.
>
>> Maybe some RA heuristics will need tweaking to reflect the extra cost
>> of these registers, but I imagine that's true either way.
>
> Perhaps. I am currently thinking/hoping that simply having them marked 
> as call clobbered will be enough.

FWIW, I have some patches queued for stage 1 that tell the
target-independent code which registers are saved by the current
function.  This makes things like interrupt functions less magical.
Maybe it would help with exposing the "call-clobbered and call-saved"
thing to RA.

> In terms of the HARD_REGNO_CALL_PART_CLOBBERED macro. I would say that 
> it is poorly named as it gives no indication to GCC internals as to 
> which part of the register is clobbered and which is not.

I don't think it was really supposed to matter.  (Not that I'm defending
the name. :-))

> When this macro 
> returns true GCC simply takes that to mean the whole register is 
> clobbered. I'd say that then makes my usage legitimate. If I had time 
> and will power I'd remove the 'PART' from this macro and switch over to 
> having it as the primary source of call-clobbered information in GCC 
> doing away with call_used_regs etc. Having the mode information is quite 
> valuable as we are seeing with SIMD ISA extensions that use this macro. 
> Supporting two sources of call-clobbered data is probably what lead us to 
> the current situation of broken optimisation passes.

Definitely agree with the last part.  But I think it'd be better to fix
it the other way: get rid of the existing HARD_REGNO_CALL_PART_CLOBBERED
uses in the generic code and replace things like call_used_regs with
mode-indexed HARD_REG_SETs of the call-clobbered registers.  You could
then add a cleaner target interface that says whether a given register
is call-clobbered in a given mode.  The default could use the existing
HARD_REGNO_CALL_PART_CLOBBERED, CALL_USED_REGISTERS and
CALL_REALLY_USED_REGISTERS (another horrible part of the interface).

Doing it that way wouldn't involve any changes to port-specific code
other than MIPS.  Other ports could use the default implementation
and switch to the new interface later.

I realise that might be more work than you were planning on,
but if you're having to change existing passes anyway then we
might as well fix it in a way that reduces the number of places
that need to check two things instead of one.

All IMO of course.  I don't maintain this part of GCC.

Thanks,
Richard

Reply via email to