On Wed, Aug 14, 2019 at 02:46:38PM +1000, Jordan Niethe wrote:
> On Tue, 2019-08-13 at 20:03 +1000, Paul Mackerras wrote:

[snip]
> > diff --git a/arch/powerpc/kvm/book3s_hv_rmhandlers.S
> > b/arch/powerpc/kvm/book3s_hv_rmhandlers.S
> > index 337e644..2e7e788 100644
> > --- a/arch/powerpc/kvm/book3s_hv_rmhandlers.S
> > +++ b/arch/powerpc/kvm/book3s_hv_rmhandlers.S
> > @@ -2831,29 +2831,39 @@ kvm_cede_prodded:
> >  kvm_cede_exit:
> >     ld      r9, HSTATE_KVM_VCPU(r13)
> >  #ifdef CONFIG_KVM_XICS
> > -   /* Abort if we still have a pending escalation */
> > +   /* are we using XIVE with single escalation? */
> > +   ld      r10, VCPU_XIVE_ESC_VADDR(r9)
> > +   cmpdi   r10, 0
> > +   beq     3f
> > +   li      r6, XIVE_ESB_SET_PQ_00
> Would it make sense to put the above instruction down into the 4: label
> instead? If we do not branch to 4, r6 is overwriten anyway. 

Right.

> I think that would save a load when we do not branch to 4. Also it

Well, li is a load immediate rather than a load ("load" would normally
imply a load from memory).  Load-immediate instructions are
essentially free since they can easily be executed in parallel with
other instructions and execute in a single cycle.

> would mean that you could use r5 everywhere instead of changing it to
> r6? 

Yes.  If I have to respin the patch for other reasons then I will
rearrange things as you suggest.  I don't think it's worth respinning
just for this change -- it won't reduce the total number of
instructions, and I strongly doubt there would be any measurable
performance difference.

> > +   /*
> > +    * If we still have a pending escalation, abort the cede,
> > +    * and we must set PQ to 10 rather than 00 so that we don't
> > +    * potentially end up with two entries for the escalation
> > +    * interrupt in the XIVE interrupt queue.  In that case
> > +    * we also don't want to set xive_esc_on to 1 here in
> > +    * case we race with xive_esc_irq().
> > +    */
> >     lbz     r5, VCPU_XIVE_ESC_ON(r9)
> >     cmpwi   r5, 0
> > -   beq     1f
> > +   beq     4f
> >     li      r0, 0
> >     stb     r0, VCPU_CEDED(r9)
> > -1: /* Enable XIVE escalation */
> > -   li      r5, XIVE_ESB_SET_PQ_00
> > +   li      r6, XIVE_ESB_SET_PQ_10
> > +   b       5f
> > +4: li      r0, 1
> > +   stb     r0, VCPU_XIVE_ESC_ON(r9)
> > +   /* make sure store to xive_esc_on is seen before xive_esc_irq
> > runs */
> > +   sync
> > +5: /* Enable XIVE escalation */
> >     mfmsr   r0
> >     andi.   r0, r0, MSR_DR          /* in real mode? */
> >     beq     1f
> > -   ld      r10, VCPU_XIVE_ESC_VADDR(r9)
> > -   cmpdi   r10, 0
> > -   beq     3f
> > -   ldx     r0, r10, r5
> > +   ldx     r0, r10, r6
> >     b       2f
> >  1: ld      r10, VCPU_XIVE_ESC_RADDR(r9)
> > -   cmpdi   r10, 0
> > -   beq     3f
> > -   ldcix   r0, r10, r5
> > +   ldcix   r0, r10, r6
> >  2: sync
> > -   li      r0, 1
> > -   stb     r0, VCPU_XIVE_ESC_ON(r9)
> >  #endif /* CONFIG_KVM_XICS */
> >  3: b       guest_exit_cont
> >  

Paul.

Reply via email to