[openstack-dev] [heat][sahara][magnum][tripleo] Scaling nested stack validation

Zane Bitter Wed, 23 Nov 2016 15:00:20 -0800

We discussed $SUBJECT at the summit as one of the main performanceproblems that people are running into when trying to create very largeautoscaling groups, as projects like Sahara, Magnum, TripleO, OpenShiftare wont to do. Of course, as we all know, validation happenssynchronously, so it's prone to causing RPC timeouts that mean a hardfailure of the parent stack.


First the good news - I just committed this patch:


https://review.openstack.org/#/c/400961/

which should mean from now on that resources with identical definitionswill not all be validated, and instead we'll just validate onerepresentative one. In theory this should mean that autoscaling groupsshould now validate in constant rather than linear time. If anyone fromone of the affected projects is able to confirm this, then I'd be happyto backport the patch to stable/newton. It really is very simple.

The bad news here is for users of ResourceGroups with %index%substitution (*cough*TripleO*cough*) - this makes each resourcedefinition unique, so it won't benefit from this fix. (Adding this to mymental list of reasons why index substitution is bad.)

I also investigated another issue, which is that since the fix forhttps://bugs.launchpad.net/heat/+bug/1388140 landed (in Kilo) I believewe are validating nested stacks multiple times (specifically, m times,where m is the stack's depth in the tree):


  root                     child                    grandchild

  create
   -> validate ----------> validate --------------> validate
   -> Resource.create ===> create
                            -> validate ----------> validate
                            -> Resource.create ===> create
                                                     -> validate

The only good news here is that ResourceGroup is smart enough to makesure that it generates a nested stack with at most 1 resource tovalidate when validate() is called. (However, when the nested stack iscreated, and thus validated, it is of course full-sized.) Autoscalinggroups make no such allowances, but the patch above should actually havethe same effect. (We can't get rid of the special case for ResourceGroupthough, because of index substitution.)

An obvious fix would be to disable validation - or, more specifically,validation of _resources_ - on create/update for stacks that have anon-null owner_id (i.e. nested stacks), so that we had something like:


  root                     child                    grandchild

  create
   -> validate ----------> validate --------------> validate
   -> Resource.create ===> create
                            -> Resource.create ===> create

That would eliminate the duplication/triplication/multiplication ofvalidation. It would also mean that we'd cut out the expensive part ofResourceGroup validation with index substitution, leaving only the cheappart.

One downside is that in the ResourceGroup/index substitution case we'dbe creating resources whose definitions hadn't _ever_ been validated. I_think_ that's safe, in the sense that you'd just hear about errorslater, as opposed to everything falling over in a heap, but it'sdifficult to be certain. Hearing about problems late is also not ideal(since it may cause otherwise-healthy siblings to be cancelled), but Iwould guess that heavy users like TripleO developers would say that it'sworth the tradeoff.

However, one other thing about this bothers me. The part of validationthat we're keeping:


   -> validate ----------> validate --------------> validate

involves loading all of the nested stacks in memory at once (i.e. thething we were not supposed to be doing any more in Kilo, in favour offarming nested stacks out over RPC.) As we discovered when we found outwe were doing the same thing with outputs[1], this is a bit like hangingout a giant "Kick Me" sign for the OOM Killer.

That's mitigated quite a lot by my patch though... we'll load the wholeautoscaling group stack in memory, but if its members are themselvesnested stacks we'll load only one of them. So the scaling tendencieswill hopefully be dominated by the complexity of your templates morethan than the size of your deployment. ResourceGroup is in a betterposition, because its nested stack will actually have only one member,so the size shouldn't affect memory consumption at all during validation.


Some options:
1) Chalk it up to an acceptable tradeoff
2) Add a single-member special case for autoscaling group validation
3) Farm out the nested validation over RPC
4) Both (2) & (3)
5) Some totally different arrangement of how nested stacks are validated

Discuss.

cheers,
Zane.

[1] https://review.openstack.org/#/c/383839/

__________________________________________________________________________
OpenStack Development Mailing List (not for usage questions)
Unsubscribe: openstack-dev-requ...@lists.openstack.org?subject:unsubscribe
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev

[openstack-dev] [heat][sahara][magnum][tripleo] Scaling nested stack validation

Reply via email to