On Fri, Jan 24, 2014 at 10:11:20PM +0100, Jakub Jelinek wrote:
> On Wed, Jan 01, 2014 at 03:30:04PM +0100, Jan Hubicka wrote:
> >     * config/i38/x86-tune.def: Disable X86_TUNE_ACCUMULATE_OUTGOING_ARGS
> >     for generic and recent AMD chips
> > Index: config/i386/x86-tune.def
> > ===================================================================
> > --- config/i386/x86-tune.def        (revision 206233)
> > +++ config/i386/x86-tune.def        (working copy)
> > @@ -143,7 +143,7 @@ DEF_TUNE (X86_TUNE_REASSOC_FP_TO_PARALLE
> >     regression on mgrid due to IRA limitation leading to unecessary
> >     use of the frame pointer in 32bit mode.  */
> >  DEF_TUNE (X86_TUNE_ACCUMULATE_OUTGOING_ARGS, "accumulate_outgoing_args",
> > -     m_PPRO | m_P4_NOCONA | m_BONNELL | m_SILVERMONT | m_AMD_MULTIPLE | 
> > m_GENERIC)
> > +     m_PPRO | m_P4_NOCONA | m_BONNELL | m_SILVERMONT | m_ATHLON_K8)
> >  
> >  /* X86_TUNE_PROLOGUE_USING_MOVE: Do not use push/pop in prologues that are
> >     considered on critical path.  */
> 
> Are you sure this is a good idea even for 32-bit code (i.e. shouldn't we
> have separate tunables for 32-bit and 64-bit code)?
> I admit I haven't performed trunk bootstraps/regtests for 3 days, am doing
> x86_64 and i686 bootstraps/regtests concurrently and it is yes,rtl checking,
> but am quite surprised that compared to 3 days ago the bootstrap time of
> i686-linux (all,obj-c++,go) went up from about 70 minutes or so to 140 
> minutes today,
> while the x86_64-linux (all,obj-c++,go,ada) remained basically the same
> around 2 hours.  This is on quad socket Quad-Core AMD Opteron(tm) Processor 
> 8354,
> perhaps it is just extremely undesirable there.

Most likely the big slowdown is var-tracking, at least stage2 insn-recog.o
(yes,rtl checking on i686-linux) took <= 23 minutes to compile and stage3
<= 62 minutes, with 45 minutes from that only spent on compiling
insn-recog.o and nothing else.

        Jakub

Reply via email to