Re: [Openstack-operators] MessagingTimeout in block live-migration due to long image fetch operation

Matt Riedemann Fri, 01 Dec 2017 15:09:07 -0800

On 11/28/2017 9:13 AM, Gustavo Randich wrote:

(running Mitaka)
When doing block live-migration, if the image / backing file is notpresent at destination host, sometimes pre-live migration fails after 60seconds as shown below. Retrying the migration to the same destinationhost succeeds.
It seems that an rpc_response_timeout of 60 seconds is not enough forthis scenario, in which fetching the image involves 90 seconds. We don'tlike to increase rpc_response_timeout to say, 120 seconds, only forthis reason ('cause in other kind of errors we prefer to fail fast).
Given that migrations are usually long, shouldn't this operation beunder the scope of a configurable timeout such aslive_migration_progress_timeout or live_migration_completion_timeoutwhich overrides the default rpc timeout?

I think we've talked about adding a config option or somehow doing rpctimeouts differently for operations that we know are prone to timeouts,so I don't think people would be against a config option for this. Iknow there is at least one place in nova where we specify an rpcresponse timeout which is not the default.


--

Thanks,

Matt

_______________________________________________
OpenStack-operators mailing list
OpenStack-operators@lists.openstack.org
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-operators

Re: [Openstack-operators] MessagingTimeout in block live-migration due to long image fetch operation

Reply via email to