On Fri, Sep 15, 2017 at 1:03 AM, Oleksandr Natalenko <oleksa...@natalenko.name> wrote: > Hi. > > I've applied your test patch but it doesn't fix the issue for me since the > warning is still there. > > Were you able to reproduce it?
Hi, Thanks for testing that. That is a very useful data point. I was able to cook up a packetdrill test that could put the connection in CA_Disorder with retransmitted packets out, but not in CA_Open. So we do not yet have a test case to reproduce this. We do not see this warning on our fleet at Google. One significant difference I see between our environment and yours is that it seems you run with FACK enabled: net.ipv4.tcp_fack = 1 Note that FACK was disabled by default (since it was replaced by RACK) between kernel v4.10 and v4.11. And this is exactly the time when this bug started manifesting itself for you and some others, but not our fleet. So my new working hypothesis would be that this warning is due to a behavior that only shows up in kernels >=4.11 when FACK is enabled. Would you be able to disable FACK ("sysctl net.ipv4.tcp_fack=0" at boot, or net.ipv4.tcp_fack=0 in /etc/sysctl.conf, or equivalent), reboot, and test the kernel for a few days to see if the warning still pops up? thanks, neal [ps: apologies for the previous, mis-formatted post...]