Thanks for your patch, but it’s not working. We have find a way to to fix it, and we are running the test. We will see the result.
发件人: Huanglili (lee) 发送时间: 2016年10月18日 20:52 收件人: Huanglili (lee); 'discuss@openvswitch.org' 抄送: 'gaoxiaoqiu 00307203'; caihe 主题: [vxlan]openvswitch-2.5.0 coredump issue Hi, all: I have met a core dump issue when I ifconfig down vxlan-port durning sending vxlan packets. We know that when ifconfig down vxlan port, the udp socket will be freed, but vxlan_xmit_one will still using the socket to do the send action, and then a NULL is read. How we should avoid this problem??? Also see in openvswitch-2.6.0. The coredump file is: [437952.039489] BUG: unable to handle kernel NULL pointer dereference at 0000000000000020 [437952.047784] IP: [<ffffffffa05e5407>] vxlan_xmit_one.constprop.50+0x47/0x1210 [openvswitch] [437952.056503] PGD d418fb067 PUD ee31f4067 PMD 0 [437952.061259] Thread overran stack, or stack corrupted [437952.066472] Oops: 0000 [#1] SMP [437952.074100] collected_len = 1015954, LOG_BUF_LEN_LOCAL = 1048576 [437952.099532] kbox: no notify die func register. no need to notify [437952.105786] do nothing after die! [437952.109352] Modules linked in: sch_netem ip_set_hash_net ip_set_hash_ipport rfile(OVE) vport_vxlan(O) bridge dm_service_time dm_multipath iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi 8021q garp stp mrp llc bonding nat(O) bum(O) ip_set nfnetlink vfat fat isofs ext4 jbd2 xfs dev_connlimit(O) ebtable_filter ebtables ip6table_filter ip6_tables iptable_filter openvswitch(O) nf_defrag_ipv6 gre kboxdriver(O) kbox(O) signo_catch(O) pmcint(O) ipmi_devintf ipmi_si ipmi_msghandler ixgbe(O) coretemp intel_rapl crc32_pclmul crc32c_intel ghash_clmulni_intel vxlan aesni_intel ip6_udp_tunnel igb tg3 lrw udp_tunnel gf128mul glue_helper ablk_helper i2c_algo_bit cryptd ses dca i2c_i801 sb_edac enclosure edac_core sg iTCO_wdt iTCO_vendor_support i2c_core pcspkr ptp pps_core lpc_ich mfd_core shpchp mei_me mei [437952.182915] acpi_power_meter nf_conntrack_ipv4 nf_defrag_ipv4 vhost_net(O) tun(O) vhost(O) macvtap macvlan vfio_pci ip_tables ext3 mbcache jbd sd_mod kvm_intel(O) kvm(O) lpfc irqbypass ahci crc_t10dif libahci crct10dif_generic crct10dif_pclmul libata scsi_transport_fc scsi_tgt megaraid_sas crct10dif_common dm_mod vfio_iommu_type1 vfio xt_sctp nf_conntrack_proto_sctp nf_nat_proto_sctp nf_nat nf_conntrack sctp libcrc32c [last unloaded: rmem] [437952.222291] CPU: 4 PID: 12988 Comm: handler903 Tainted: G OE ----V------- 3.10.0-327.28.3.41.x86_64 #1 [437952.233147] Hardware name: Huawei RH2288H V3/BC11HGSA0, BIOS 1.69 10/31/2015 [437952.240440] task: ffff88003f348000 ti: ffff880f09534000 task.ti: ffff880f09534000 [437952.248335] RIP: 0010:[<ffffffffa05e5407>] [<ffffffffa05e5407>] vxlan_xmit_one.constprop.50+0x47/0x1210 [openvswitch] [437952.259479] RSP: 0018:ffff880f09537728 EFLAGS: 00010246 [437952.265043] RAX: 0000000000000000 RBX: ffff881802e3e000 RCX: 0000000000002000 [437952.272594] RDX: ffff8810185bf488 RSI: ffff881802e3e000 RDI: ffff880c1e603600 [437952.280099] RBP: ffff880f09537888 R08: ffff880f88b27708 R09: 0000000000000000 [437952.287659] R10: ffffffff8151c007 R11: ffffea0030798080 R12: ffff880f8ba68d00 [437952.295158] R13: 000000000000ffbe R14: ffff880c1e603600 R15: ffff881810c9c858 [437952.302714] FS: 00007f119c7f8700(0000) GS:ffff88103ff00000(0000) knlGS:0000000000000000 [437952.315104] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [437952.321099] CR2: 0000000000000020 CR3: 0000000fdfa2a000 CR4: 00000000001427e0 [437952.328644] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 [437952.336189] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 [437952.343733] Stack: [437952.345996] 0000000000000000 0000000000000000 0000000000000000 0000000000000000 [437952.353866] 0000000000000000 0000000000000000 0000000000000000 0000000000000000 [437952.361740] 0000000000000000 8913000000000000 aa037909b0037909 0000000000000000 [437952.369624] Call Trace: [437952.372337] [<ffffffffa05d60a7>] ? internal_dev_recv+0x167/0x180 [openvswitch] [437952.380062] [<ffffffff8151c007>] ? kfree_skbmem+0x37/0x90 [437952.385811] [<ffffffffa05e6625>] rpl_vxlan_xmit+0x55/0x80 [openvswitch] [437952.392761] [<ffffffffa05d5ad4>] ovs_vport_send+0x44/0xb0 [openvswitch] [437952.399723] [<ffffffffa05c62a5>] do_output+0x65/0x180 [openvswitch] [437952.406329] [<ffffffffa05c70dc>] do_execute_actions+0x10c/0x860 [openvswitch] [437952.413960] [<ffffffffa05c7870>] ovs_execute_actions+0x40/0x130 [openvswitch] [437952.421597] [<ffffffffa05cbb59>] ovs_packet_cmd_execute+0x2c9/0x2f0 [openvswitch] [437952.429587] [<ffffffff8155f31d>] genl_family_rcv_msg+0x1cd/0x400 [437952.435927] [<ffffffff8122b110>] ? ep_scan_ready_list.isra.9+0x1f0/0x1f0 [437952.442968] [<ffffffff8155f550>] ? genl_family_rcv_msg+0x400/0x400 [437952.449488] [<ffffffff8155f5e1>] genl_rcv_msg+0x91/0xd0 [437952.455054] [<ffffffff8155d549>] netlink_rcv_skb+0xa9/0xc0 [437952.460878] [<ffffffff8155da78>] genl_rcv+0x28/0x40 [437952.466096] [<ffffffff8155ceba>] netlink_unicast+0x16a/0x210 [437952.472097] [<ffffffff8155d277>] netlink_sendmsg+0x317/0x430 [437952.478095] [<ffffffff8155b3c2>] ? netlink_recvmsg+0x212/0x490 [437952.484272] [<ffffffff81514fd0>] sock_sendmsg+0xb0/0xf0 [437952.489838] [<ffffffff8151569f>] ? sock_recvmsg+0xbf/0x100 [437952.495662] [<ffffffff810b8f56>] ? try_to_wake_up+0x1b6/0x300 [437952.501742] [<ffffffff81515409>] ___sys_sendmsg+0x3a9/0x3c0 [437952.507654] [<ffffffff81198772>] ? do_numa_page+0x162/0x1f0 [437952.513564] [<ffffffff811999d7>] ? handle_mm_fault+0x607/0xf50 [437952.519737] [<ffffffff8122b253>] ? ep_poll+0x123/0x360 [437952.525191] [<ffffffff815162f1>] __sys_sendmsg+0x51/0x90 [437952.530843] [<ffffffff81516342>] SyS_sendmsg+0x12/0x20 [437952.536322] [<ffffffff81649609>] system_call_fastpath+0x16/0x1b [437952.542580] Code: 48 81 ec 38 01 00 00 8b 8e 40 09 00 00 65 48 8b 04 25 28 00 00 00 48 89 84 24 30 01 00 00 31 c0 48 8b 86 e0 08 00 00 48 8b 40 30 <4c> 8b 78 20 48 8b 47 38 48 85 c0 45 0f b7 47 10 0f 84 b3 0f 00 [437952.563291] RIP [<ffffffffa05e5407>] vxlan_xmit_one.constprop.50+0x47/0x1210 [openvswitch] [437952.572078] RSP <ffff880f09537728> [437952.575822] CR2: 0000000000000020 [437952.579911] ---[ end trace 28d6b1058663a9c7 ]--- [437952.665116] Kernel panic - not syncing: Fatal exception in interrupt [437952.767620] die even has been record!
_______________________________________________ discuss mailing list discuss@openvswitch.org http://openvswitch.org/mailman/listinfo/discuss