** Changed in: neutron Status: Fix Released => In Progress -- You received this bug notification because you are a member of Yahoo! Engineering Team, which is subscribed to neutron. https://bugs.launchpad.net/bugs/2101840
Title: ovs jobs randomly failing with guest vms not getting ip from dhcp Status in neutron: In Progress Bug description: Creating a new one then https://bugs.launchpad.net/neutron/+bug/2045549 as we seeing this more often now so might be related to recent cleanups for eventlet in master Example failures:- - https://6680d19461a57172eaa9-71ec100da363a5820f9bef22bce9fb5d.ssl.cf5.rackcdn.com/periodic/opendev.org/openstack/neutron/master/neutron-ovs-tempest-with-os-ken-master/30c29c2/testr_results.html - https://fc891d7e1f0b0d1ecc9c-5406e7d03d51190d7ee83b25b2acffe7.ssl.cf2.rackcdn.com/periodic/opendev.org/openstack/neutron/master/neutron-ovs-tempest-with-os-ken-master/3d07efc/testr_results.html - https://e6aae367021d38c9d4db-1eee5d6654f19bfbb50f61b5642c3fcd.ssl.cf2.rackcdn.com/periodic/opendev.org/openstack/neutron/master/neutron-ovs-tempest-with-oslo-master/5db5fe8/testr_results.html - https://storage.gra.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_682/periodic/opendev.org/openstack/neutron/master/neutron-ovs-tempest-fips/6822cad/testr_results.html - https://b2b9d08c5b85917c3495-18cd7f889ae1379763e7ec0a1ec70a8d.ssl.cf2.rackcdn.com/periodic/opendev.org/openstack/neutron/master/neutron-ovs-tempest-with-os-ken-master/ae8c73a/testr_results.html - https://storage.gra.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_c6b/periodic/opendev.org/openstack/neutron/master/neutron-ovs-tempest-plugin-iptables_hybrid-nftables/c6b70a9/testr_results.html Checking the last failure ^ Test fails to SSH via floating ip:- 2025-03-09 02:52:17,690 87771 WARNING [tempest.lib.common.ssh] Failed to establish authenticated ssh connection to cirros@172.24.5.180 ([Errno None] Unable to connect to port 22 on 172.24.5.180). Number attempts: 19. Retry after 20 seconds. 2025-03-09 02:52:41,242 87771 WARNING [tempest.lib.common.ssh] Failed to establish authenticated ssh connection to cirros@172.24.5.180 ([Errno None] Unable to connect to port 22 on 172.24.5.180). Number attempts: 20. Retry after 21 seconds. 2025-03-09 02:53:05,817 87771 ERROR [tempest.lib.common.ssh] Failed to establish authenticated ssh connection to cirros@172.24.5.180 after 20 attempts. Proxy client: no proxy client 2025-03-09 02:53:05.817 87771 ERROR tempest.lib.common.ssh Traceback (most recent call last): 2025-03-09 02:53:05.817 87771 ERROR tempest.lib.common.ssh File "/opt/stack/tempest/tempest/lib/common/ssh.py", line 136, in _get_ssh_connection 2025-03-09 02:53:05.817 87771 ERROR tempest.lib.common.ssh ssh.connect(self.host, port=self.port, username=self.username, 2025-03-09 02:53:05.817 87771 ERROR tempest.lib.common.ssh File "/opt/stack/tempest/.tox/tempest/lib/python3.12/site-packages/paramiko/client.py", line 409, in connect 2025-03-09 02:53:05.817 87771 ERROR tempest.lib.common.ssh raise NoValidConnectionsError(errors) 2025-03-09 02:53:05.817 87771 ERROR tempest.lib.common.ssh paramiko.ssh_exception.NoValidConnectionsError: [Errno None] Unable to connect to port 22 on 172.24.5.180 From vm console log it didn't got IP from dhcp:- ### /etc/init.d/sshd start Top of dropbear init script Starting dropbear sshd: failed to get instance-id of datasource mkdir: can't create directory '/etc/dropbear': No such file or directory WARN: generating key of type rsa failed! WARN: generating key of type ecdsa failed! FAIL ### ifconfig -a eth0 Link encap:Ethernet HWaddr FA:16:3E:04:97:83 inet6 addr: fe80::f816:3eff:fe04:9783/64 Scope:Link UP BROADCAST RUNNING MULTICAST MTU:1380 Metric:1 RX packets:117 errors:0 dropped:0 overruns:0 frame:0 TX packets:14 errors:0 dropped:0 overruns:0 carrier:0 collisions:0 txqueuelen:1000 RX bytes:5654 (5.5 KiB) TX bytes:1892 (1.8 KiB) From dhcp agent logs:- Mar 09 02:48:00.817622 np0040117219 dnsmasq-dhcp[88108]: DHCPDISCOVER(tap6d8b1e0a-4b) fa:16:3e:04:97:83 no address available Mar 09 02:49:00.865746 np0040117219 dnsmasq-dhcp[88108]: DHCPDISCOVER(tap6d8b1e0a-4b) fa:16:3e:04:97:83 no address available Mar 09 02:50:00.930199 np0040117219 dnsmasq-dhcp[88108]: DHCPDISCOVER(tap6d8b1e0a-4b) fa:16:3e:04:97:83 no address available Mar 09 02:50:16.221733 np0040117219 neutron-dhcp-agent[64177]: DEBUG neutron.agent.dhcp.agent [-] Pending events to be processed: 233 {{(pid=64177) _process_resource_update /opt/stack/neutron/neutron/agent/dhcp/agent.py:597}} Mar 09 02:50:16.222351 np0040117219 neutron-dhcp-agent[64177]: DEBUG neutron.agent.dhcp.agent [-] neutron.agent.dhcp.agent.DhcpAgentWithStateReport method _port_create called with arguments (admin_state_up=True, allowed_address_pairs=[], binding:host_id=, binding:profile=, binding:vif_details=, binding:vif_type=unbound, binding:vnic_type=normal, created_at=2025-03-09T02:47:49Z, description=, device_id=93c28b39-3e90-4d4c-8a27-04d04a4224e3, device_owner=, dns_assignment=[fqdn=host-10-1-0-5.openstackgate.local., hostname=host-10-1-0-5, ip_address=10.1.0.5], dns_domain=, dns_name=, extra_dhcp_opts=[], fixed_ips=[ip_address=10.1.0.5, subnet_id=251d60e3-4355-45a3-b0f7-9b99794ad793], id=fdeeaa61-d821-4c28-9a5f-3fa54466002c, ip_allocation=immediate, mac_address=fa:16:3e:04:97:83, name=, network=admin_state_up=True, availability_zone_hints=[], availability_zones=['nova'], created_at=2025-03-09T02:46:12Z, description=, dns_domain=, id=89dcf835-6757-4b37-a434-a790066e5142, ipv4_address_scope= None, ipv6_address_scope=None, l2_adjacency=True, mtu=1380, name=tempest-AttachInterfacesTestJSON-1573214142-network, port_security_enabled=True, project_id=fa0d2441acf841a18af24edd7bc278f4, provider:network_type=vxlan, provider:physical_network=None, provider:segmentation_id=5, qinq=None, qos_policy_id=None, revision_number=2, router:external=False, shared=False, standard_attr_id=63, status=ACTIVE, subnets=['251d60e3-4355-45a3-b0f7-9b99794ad793'], tags=[], tenant_id=fa0d2441acf841a18af24edd7bc278f4, updated_at=2025-03-09T02:46:12Z, vlan_transparent=None, network_id=89dcf835-6757-4b37-a434-a790066e5142, port_security_enabled=True, project_id=fa0d2441acf841a18af24edd7bc278f4, propagate_uplink_status=True, qos_network_policy_id=None, qos_policy_id=None, resource_request=None, revision_number=1, security_groups=['7da7083c-7d1f-4714-ad8e-7092b95ed3f5'], standard_attr_id=701, status=DOWN, tags=[], tenant_id=fa0d2441acf841a18af24edd7bc278f4, trusted=None, updated_at=2025-03-09T02:47:50Z,) {} {{(pid=64177) wrapper /opt/stack/data/venv/lib/python3.12/site-packages/oslo_log/helpers.py:65}} Mar 09 02:50:16.222894 np0040117219 neutron-dhcp-agent[64177]: INFO neutron.agent.dhcp.agent [-] Trigger reload_allocations for port admin_state_up=True, allowed_address_pairs=[], binding:host_id=, binding:profile=, binding:vif_details=, binding:vif_type=unbound, binding:vnic_type=normal, created_at=2025-03-09T02:47:49Z, description=, device_id=93c28b39-3e90-4d4c-8a27-04d04a4224e3, device_owner=, dns_assignment=[fqdn=host-10-1-0-5.openstackgate.local., hostname=host-10-1-0-5, ip_address=10.1.0.5], dns_domain=, dns_name=, extra_dhcp_opts=[], fixed_ips=[ip_address=10.1.0.5, subnet_id=251d60e3-4355-45a3-b0f7-9b99794ad793], id=fdeeaa61-d821-4c28-9a5f-3fa54466002c, ip_allocation=immediate, mac_address=fa:16:3e:04:97:83, name=, network=admin_state_up=True, availability_zone_hints=[], availability_zones=['nova'], created_at=2025-03-09T02:46:12Z, description=, dns_domain=, id=89dcf835-6757-4b37-a434-a790066e5142, ipv4_address_scope=None, ipv6_address_scope=None, l2_adjacency=True, mtu=1380 , name=tempest-AttachInterfacesTestJSON-1573214142-network, port_security_enabled=True, project_id=fa0d2441acf841a18af24edd7bc278f4, provider:network_type=vxlan, provider:physical_network=None, provider:segmentation_id=5, qinq=None, qos_policy_id=None, revision_number=2, router:external=False, shared=False, standard_attr_id=63, status=ACTIVE, subnets=['251d60e3-4355-45a3-b0f7-9b99794ad793'], tags=[], tenant_id=fa0d2441acf841a18af24edd7bc278f4, updated_at=2025-03-09T02:46:12Z, vlan_transparent=None, network_id=89dcf835-6757-4b37-a434-a790066e5142, port_security_enabled=True, project_id=fa0d2441acf841a18af24edd7bc278f4, propagate_uplink_status=True, qos_network_policy_id=None, qos_policy_id=None, resource_request=None, revision_number=1, security_groups=['7da7083c-7d1f-4714-ad8e-7092b95ed3f5'], standard_attr_id=701, status=DOWN, tags=[], tenant_id=fa0d2441acf841a18af24edd7bc278f4, trusted=None, updated_at=2025-03-09T02:47:50Z on network 89dcf835-6757-4b37-a434-a790066e5142 Mar 09 02:50:16.223469 np0040117219 neutron-dhcp-agent[64177]: DEBUG neutron.agent.dhcp.agent [-] Calling driver for network: 89dcf835-6757-4b37-a434-a790066e5142/seg=None action: reload_allocations {{(pid=64177) _call_driver /opt/stack/neutron/neutron/agent/dhcp/agent.py:233}} Mar 09 02:50:16.224636 np0040117219 neutron-dhcp-agent[64177]: DEBUG neutron.agent.linux.dhcp [-] Building host file: /opt/stack/data/neutron/dhcp/89dcf835-6757-4b37-a434-a790066e5142/host {{(pid=64177) _output_hosts_file /opt/stack/neutron/neutron/agent/linux/dhcp.py:944}} Mar 09 02:50:16.225296 np0040117219 neutron-dhcp-agent[64177]: DEBUG neutron.agent.linux.dhcp [-] Done building host file /opt/stack/data/neutron/dhcp/89dcf835-6757-4b37-a434-a790066e5142/host {{(pid=64177) _output_hosts_file /opt/stack/neutron/neutron/agent/linux/dhcp.py:985}} Mar 09 02:50:16.738386 np0040117219 dnsmasq[88108]: read /opt/stack/data/neutron/dhcp/89dcf835-6757-4b37-a434-a790066e5142/addn_hosts - 4 names Seeing high number of events pending, seems that causing such issues? May be related to recent changes related to eventlet cleanup like https://review.opendev.org/c/openstack/neutron/+/942393 To manage notifications about this bug go to: https://bugs.launchpad.net/neutron/+bug/2101840/+subscriptions -- Mailing list: https://launchpad.net/~yahoo-eng-team Post to : yahoo-eng-team@lists.launchpad.net Unsubscribe : https://launchpad.net/~yahoo-eng-team More help : https://help.launchpad.net/ListHelp