On Mon, Jul 31, 2017 at 11:49 AM, Neal Cardwell <ncardw...@google.com> wrote: > On Sun, Jul 30, 2017 at 11:29 PM, maowenan <maowe...@huawei.com> wrote: >> [Mao Wenan]please refer to the attachment, test.pkt is packetdrill script. >> In test.pcap, packet number 17 is the TLP probe, packet number 218 is the >> retransmission packet because client don't send data packet to server. >> From the capture time, there are about 6 seconds the retransmission >> packet can be sent, and this time can be added more as long as client >> send data packet continually. >> I have reproduced this issue in Linux 4.13-rc3, 3.10, 4.1. Please check the >> timing >> When you use test.pkt to reproduce in your environment. > > Thank you for your very nice packetdrill test case illustrating this > problem! And thanks for verifying that the problem shows up in those > kernel versions. > > We are able to run the script in our environment and both verify that > the bug is the one we hypothesized, and verify our proposed patch > fixes it (the RTO for the TLP happens 221ms after the TLP, instead of > ~5 secs later). We will send out our proposed patches ASAP.
The timer patches are upstream for review for the "net" branch: https://patchwork.ozlabs.org/patch/796057/ https://patchwork.ozlabs.org/patch/796058/ https://patchwork.ozlabs.org/patch/796059/ Again, thank you for reporting this and providing a packetdrill script to reproduce this! neal