GOn Wed, Jan 18, 2017 at 12:37:25PM -0200, Marcelo Tosatti wrote:
> On Wed, Jan 18, 2017 at 01:46:58PM +0100, Paolo Bonzini wrote:
> > 
> > 
> > On 18/01/2017 13:24, Marcelo Tosatti wrote:
> > > On Wed, Jan 18, 2017 at 10:17:38AM -0200, Marcelo Tosatti wrote:
> > >> On Tue, Jan 17, 2017 at 04:36:21PM +0100, Radim Krcmar wrote:
> > >>> 2017-01-17 09:30-0200, Marcelo Tosatti:
> > >>>> On Tue, Jan 17, 2017 at 09:03:27AM +0100, Miroslav Lichvar wrote:
> > >>>>> Users of the PTP_SYS_OFFSET ioctl assume that (ts[0]+ts[2])/2
> > >>>>> corresponds to ts[1], (ts[2]+ts[4])/2 corresponds to ts[3], and so on.
> > >>>>>
> > >>>>>                     ts[1]     ts[3]
> > >>>>> Host time    ---------+---------+........
> > >>>>>                       |         |
> > >>>>>                       |         |
> > >>>>> Guest time   ----+---------+---------+......
> > >>>>>                 ts[0]    ts[2]     ts[4]
> > >>>
> > >>> KVM PTP delay moves host ts[i] to be close to guest ts[i+1] and makes
> > >>> the offset very consistent, so the graph would look like:
> > >>>
> > >>>                         ts[1]     ts[3]
> > >>> Host time    -------------+---------+........
> > >>>                           |         |
> > >>>                           |         |
> > >>> Guest time   ----+---------+---------+......
> > >>>                 ts[0]    ts[2]     ts[4]
> > >>>
> > >>> which doesn't sound good if users assume that the host reading is in the
> > >>> middle -- the guest time would be ahead of the host time.
> > >>
> > >> Testcase: run a guest and a loop sending SIGUSR1 to vcpu0 (emulating
> > >> intense interrupts). Follows results:
> > >>
> > >> Without TSC delta calculation:
> > >> =============================
> > >>
> > >> #* PHC0                          0   3   377     2    -99ns[ +206ns] +/- 
> > >>  116ns
> > >> #* PHC0                          0   3   377     8   +202ns[ +249ns] +/- 
> > >>  111ns
> > >> #* PHC0                          0   3   377     8   -213ns[ +683ns] +/- 
> > >>   88ns
> > >> #* PHC0                          0   3   377     6    +77ns[ +319ns] +/- 
> > >>   56ns
> > >> #* PHC0                          0   3   377     4   -771ns[-1029ns] +/- 
> > >>   93ns
> > >> #* PHC0                          0   3   377    10    -49ns[  -58ns] +/- 
> > >>  121ns
> > >> #* PHC0                          0   3   377     9   +562ns[ +703ns] +/- 
> > >>  107ns
> > >> #* PHC0                          0   3   377     6     -2ns[   -3ns] +/- 
> > >>   94ns
> > >> #* PHC0                          0   3   377     4   +451ns[ +494ns] +/- 
> > >>  138ns
> > >> #* PHC0                          0   3   377    11    -67ns[  -74ns] +/- 
> > >>  113ns
> > >> #* PHC0                          0   3   377     8   +244ns[ +264ns] +/- 
> > >>  119ns
> > >> #* PHC0                          0   3   377     7   -696ns[ -890ns] +/- 
> > >>   89ns
> > >> #* PHC0                          0   3   377     4   +468ns[ +560ns] +/- 
> > >>  110ns
> > >> #* PHC0                          0   3   377    11   -310ns[ -430ns] +/- 
> > >>   72ns
> > >> #* PHC0                          0   3   377     9   +189ns[ +298ns] +/- 
> > >>   54ns
> > >> #* PHC0                          0   3   377     7   +594ns[ +473ns] +/- 
> > >>   96ns
> > >> #* PHC0                          0   3   377     5   +151ns[ +280ns] +/- 
> > >>   71ns
> > >> #* PHC0                          0   3   377    10   -590ns[ -696ns] +/- 
> > >>   94ns
> > >> #* PHC0                          0   3   377     8   +415ns[ +526ns] +/- 
> > >>   74ns
> > >> #* PHC0                          0   3   377     6  +1381ns[+1469ns] +/- 
> > >>  101ns
> > >> #* PHC0                          0   3   377     4   +571ns[+1304ns] +/- 
> > >>   54ns
> > >> #* PHC0                          0   3   377     8     -5ns[  +71ns] +/- 
> > >>  139ns
> > >> #* PHC0                          0   3   377     7   -247ns[ -502ns] +/- 
> > >>   69ns
> > >> #* PHC0                          0   3   377     5   -283ns[ +879ns] +/- 
> > >>   73ns
> > >> #* PHC0                          0   3   377     3   +148ns[ -109ns] +/- 
> > >>   61ns
> > >>
> > >> With TSC delta calculation:
> > >> ============================
> > >>
> > >> #* PHC0                          0   3   377     7   +379ns[ +432ns] +/- 
> > >>   53ns
> > >> #* PHC0                          0   3   377     9   +106ns[ +420ns] +/- 
> > >>   42ns
> > >> #* PHC0                          0   3   377     7    -58ns[ -136ns] +/- 
> > >>   62ns
> > >> #* PHC0                          0   3   377    12    +93ns[  -38ns] +/- 
> > >>   64ns
> > >> #* PHC0                          0   3   377     8    +84ns[ +107ns] +/- 
> > >>   69ns
> > >> #* PHC0                          0   3   377     3    -76ns[ -103ns] +/- 
> > >>   52ns
> > >> #* PHC0                          0   3   377     7    +52ns[  +63ns] +/- 
> > >>   50ns
> > >> #* PHC0                          0   3   377    11    +29ns[  +31ns] +/- 
> > >>   70ns
> > >> #* PHC0                          0   3   377     7    -47ns[  -56ns] +/- 
> > >>   42ns
> > >> #* PHC0                          0   3   377    10    -35ns[  -42ns] +/- 
> > >>   33ns
> > >> #* PHC0                          0   3   377     7    -32ns[  -34ns] +/- 
> > >>   42ns
> > >> #* PHC0                          0   3   377    11   -172ns[ -173ns] +/- 
> > >>  118ns
> > >> #* PHC0                          0   3   377     6    +65ns[  +76ns] +/- 
> > >>   23ns
> > >> #* PHC0                          0   3   377     9    +18ns[  +23ns] +/- 
> > >>   37ns
> > >> #* PHC0                          0   3   377     6    +41ns[  -60ns] +/- 
> > >>   30ns
> > >> #* PHC0                          0   3   377    10    +39ns[ +183ns] +/- 
> > >>   42ns
> > >> #* PHC0                          0   3   377     6    +50ns[ +102ns] +/- 
> > >>   86ns
> > >> #* PHC0                          0   3   377    11    +50ns[  +75ns] +/- 
> > >>   52ns
> > >> #* PHC0                          0   3   377     6    +50ns[ +116ns] +/- 
> > >>  100ns
> > >> #* PHC0                          0   3   377    10    +46ns[  +65ns] +/- 
> > >>   79ns
> > >> #* PHC0                          0   3   377     7    -38ns[  -51ns] +/- 
> > >>   29ns
> > >> #* PHC0                          0   3   377    10    -11ns[  -12ns] +/- 
> > >>   32ns
> > >> #* PHC0                          0   3   377     7    -31ns[  -32ns] +/- 
> > >>   99ns
> > >> #* PHC0                          0   3   377    10   +222ns[ +238ns] +/- 
> > >>   58ns
> > >> #* PHC0                          0   3   377     6   +185ns[ +207ns] +/- 
> > >>   39ns
> > >> #* PHC0                          0   3   377    10   -392ns[ -394ns] +/- 
> > >>  118ns
> > >> #* PHC0                          0   3   377     6     -9ns[  -50ns] +/- 
> > >>   35ns
> > >> #* PHC0                          0   3   377    10   -346ns[ -355ns] +/- 
> > >>  111ns
> > >>
> > >>
> > >> Do you still want to drop it in favour of simplicity?
> > > 
> > > This is the output of "chronyc sources". See section "Time sources"
> > > of https://chrony.tuxfamily.org/doc/2.4/chronyc.html.
> > 
> > It's just that it's not obvious why you get better results with biased
> > host timestamps.  What makes the biased host timestamp more precise?
> > 
> > I'd rather use PTP_SYS_OFFSET_PRECISE instead, but unfortunately chrony
> > does not support it---but I would still prefer you to support
> > PTP_SYS_OFFSET_PRECISE as well.
> 
> A single TSC read could be used to implement the PRECISE ioctl, but if
> a timer interrupt takes place on either the host or the guest, and that
> timer interrupt "adds" the TSC delta to xtime.nsec/xtime.sec, then that
> single TSC read cannot be used.
> 
> So you would have to stop timer interrupts (in guest and host) for the 
> duration of the
> PRECISE ioctl in the guest to avoid that situation, which seems a bit
> overkill to me.
> 
> Any other ideas?

Could have a hypercall that disables host timer interrupts for 
a specified amount of time... But that does not scale with multiple VMs.

Reply via email to