Hi Ihar,

On 4/3/24 4:37 PM, Ihar Hrachyshka wrote:
On Wed, Apr 3, 2024 at 4:15 PM Brian Haley via discuss <ovs-discuss@openvswitch.org <mailto:ovs-discuss@openvswitch.org>> wrote:

    Hi,

    I recently have been seeing issues in a large environment where the
    listen backlog of ovsdb-server, both NB and SB, is getting over-flowed,
    for example:

    17842 times the listen queue of a socket overflowed
    17842 SYNs to LISTEN sockets dropped

    There is more on NB than SB, but I was surprised to see any. I can only
    guess at the moment it is happening when the leader changes and
    hundreds
    of nodes try and reconnect.

    Looking at their sockets I can see the backlog is only set to 10:

    $ ss -ltm | grep 664
    LISTEN 0      10 0.0.0.0:6641 <http://0.0.0.0:6641>        0.0.0.0:*
    LISTEN 0      10 0.0.0.0:6642 <http://0.0.0.0:6642>        0.0.0.0:*

    Digging into the code, there is only two places where listen() is
    called, one being inet_open_passive():

          /* Listen. */
          if (style == SOCK_STREAM && listen(fd, 10) < 0) {
              error = sock_errno();
              VLOG_ERR("%s: listen: %s", target, sock_strerror(error));
              goto error;
          }

    There is no way to config around this to even test if increasing would
    help in a running environment.

    So my question is two-fold:

    1) Should this be increased? 128, 256, 1024? I can send a patch.

    2) Should this be configurable?

    Has anyone else seen this?


Yes, I've seen it, though I was focusing on AF_UNIX behavior. You may want to check this series: https://patchwork.ozlabs.org/project/openvswitch/list/?series=382739&state=%2A&archive=both <https://patchwork.ozlabs.org/project/openvswitch/list/?series=382739&state=%2A&archive=both>

Specifically, it includes a patch to bump the value for INET sockets to 64: https://patchwork.ozlabs.org/project/openvswitch/patch/20231118010703.4154866-2-ihrac...@redhat.com/ <https://patchwork.ozlabs.org/project/openvswitch/patch/20231118010703.4154866-2-ihrac...@redhat.com/>

The series includes a number of other fixes related to unixctl socket handling; I am planning to revive this series in next few months, but perhaps we could untangle this bump limit patch from the series and merge it independently.

As to the question of potential configurability, I have nothing to contribute.

Thanks for the quick response Ihar, I had actually missed the python code as I only looked in *.c. I can see if bumping to 64 helps, I just don't know how quickly it will be deployed and tested.

Regarding merging it independently, it seems from looking at the series it might be something that could/should be backported in it's entirety. If not I can try and break-out that one patch for previous versions.

And regarding configurability I can only say it would have been good to have it to debug this easily, as opposed to re-building things.

-Brian
_______________________________________________
discuss mailing list
disc...@openvswitch.org
https://mail.openvswitch.org/mailman/listinfo/ovs-discuss

Reply via email to