In my openstack cluster (Havana), occasionally, i have some instances can
not be ping if reboot many instances simultaneuously.
if i "nova console-log ...", i see a line:
"*cloud-init-nonet waiting 120 seconds for a network device.*"
To fix this, i must run "*service neutron-plugin-openvswitch-agent restart*"
on compute-nodes that these instances is hosted.
After restart neutron-agent, i can ping these instances normally.


Therefore, I build another openstack-cluster (Havana) to test, but the
problem persist (i test with both VLAN and GRE, both soft_reboot and
hard_reboot - but this problem is the same!)
# for server_id in `nova list | grep ACTIVE | awk '{print $2}'`;do echo
$server_id ; nova reboot --hard $server_id; done
or
# for server_id in `nova list | grep ACTIVE | awk '{print $2}'`;do echo
$server_id ; nova reboot $server_id; done

Check console-log:
# for server_id in `nova list | grep ACTIVE | awk '{print $2}'`;do echo
$server_id ; nova console-log $server_id | grep "waiting 120 seconds";
done

cea03292-5c76-45dd-af35-b9d80b82ac5d
37b02ca8-98fc-40c7-a700-f7942d134b65
a92051b1-3a1d-4f68-ba77-4d093e916ce4
*cloud-init-nonet waiting 120 seconds for a network device.*
def8f657-572a-4431-902a-c983d0aad7a9
*cloud-init-nonet waiting 120 seconds for a network device.*

And I can ping 2 instances: a92051b1-3a1d-4f68-ba77-4d093e916ce4 &
def8f657-572a-4431-902a-c983d0aad7a9

This bug exists quite along time in neutron-agent, but still not be fixed.

Can you suggest me a quick fix for that?
Thank you very much!

Best Regards,
_______________________________________________
Mailing list: http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack
Post to     : openstack@lists.openstack.org
Unsubscribe : http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack

Reply via email to