https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=246660

            Bug ID: 246660
           Summary: Sporadic LACP Lagg Flap
           Product: Base System
           Version: 12.1-STABLE
          Hardware: Any
                OS: Any
            Status: New
          Severity: Affects Some People
          Priority: ---
         Component: kern
          Assignee: b...@freebsd.org
          Reporter: nones...@longcount.org

On FreeBSD 12-STABLE r354698 amd64 with LACP laggs on top of Solarflare SFXGE
nics (7k and 8k series cards) attached to arista mlaggs ( 7050's or 7150's
running  eos 4.0.19 and newer ). The FreeBSD side sporadically stops setting
the lacp distribution flag. Causing the switch to detect a link flap. 


The setup is as follows . Dell R630 and R640's, Supermicro X9DRT Ivy Bridge
boards
setup with one Solarflare SFN7122F or SFN8522 runnging Solarflare Firmware 7.1
and  7.4.4 . Each server is setup as a router with pf being used to nat and
filter traffic. There is on LACP lagg made of up the two ports going to two
upstream Arista 7050 or 7150 switches in a MLAGG setup. The LAGG carries
anywhere from 5 to 50 vlans at a time. 

Now the complicated part. This issues happens as what appear to be random
times, on routers we have setup and left to "burn in" over a weekend with
little or no traffic, and on some routers where they are preforming moderate
amounts work.
This issue also did not happen on 10.3-STABLE amd64 . 


Sysctls
=============================

#security.bsd.see_other_uids=0
net.inet.tcp.mssdflt=1460
net.inet.tcp.minmss=536
net.inet.tcp.rfc6675_pipe=1
net.inet.tcp.syncache.rexmtlimit=0  # (default 3)
net.inet.tcp.per_cpu_timers=1
net.inet.ip.fastforwarding=1
#kern.random.harvest.mask=65887
kern.random.harvest.mask=65537
kern.random.sys.harvest.ethernet=0
kern.random.sys.harvest.point_to_point=0
kern.random.sys.harvest.interrupt=0
hw.intr_storm_threshold=10000
kern.ipc.maxsockbuf=16777216
# socket buffers
net.inet.tcp.recvspace=4194304
net.inet.tcp.sendspace=2097152
net.inet.tcp.sendbuf_max=16777216
net.inet.tcp.recvbuf_max=16777216
net.inet.tcp.sendbuf_auto=1
net.inet.tcp.recvbuf_auto=1
net.inet.tcp.sendbuf_inc=16384
net.inet.tcp.recvbuf_inc=524288
net.inet.tcp.cc.algorithm=htcp
net.inet.ip.intr_queue_maxlen=2048
net.route.netisr_maxqlen=2048
# Do not send IP redirects (enable fastforwarding path)
net.inet.ip.redirect=0
net.inet6.ip6.redirect=0

===loader.conf===
ipmi_load="YES"
boot_multicons="YES"
boot_serial="YES"
console="comconsole,vidconsole"
net.inet.tcp.tso="0"
autoboot_delay="5"
hw.mfi.mrsas_enable="1"
hw.usb.no_pf="1"        # Disable USB packet filtering
hw.usb.no_shutdown_wait="1"
hw.vga.textmode="1"     # Text mode
machdep.hyperthreading_allowed="0"
geom_mirror_load="YES"
kern.ipc.nmbclusters="1000000"
net.isr.maxqlimit="1000000"
kern.ipc.nmbjumbop=524288
net.isr.bindthreads="0"
net.isr.maxthreads="-1"
net.link.ifqmaxlen="2048"
net.pf.source_nodes_hashsize="1048576"
net.isr.defaultqlimit="2048"
net.inet.tcp.syncache.hashsize="1024"
net.inet.tcp.syncache.bucketlimit="100"
net.inet.tcp.tcbhashsize="65536"
vm.pmap.pti=0
hw.ibrs_disable=1


===LAGG Config===

ifconfig lagg0 laggproto lacp lagghash l2,l3 laggport sfxge0 laggport sfxge1


===sfxge tunings===
kenv hw.sfxge.${NIC0_ID}.max_rss_channels=7
kenv hw.sfxge.${NIC1_ID}.max_rss_channels=7
kenv hw.sfxge.tx_ring=2048
kenv hw.sfxge.rx_ring=4096
kenv hw.sfxge.tx_dpl_get_non_tcp_max=4096
kenv hw.sfxge.tx_dpl_put_max=2048
# This turns of AIM
sysctl dev.sfxge.${NIC0_ID}.int_mod=0
sysctl dev.sfxge.${NIC1_ID}.int_mod=0


PCAPs available on request.

-- 
You are receiving this mail because:
You are the assignee for the bug.
_______________________________________________
freebsd-bugs@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-bugs
To unsubscribe, send any mail to "freebsd-bugs-unsubscr...@freebsd.org"

Reply via email to