[openstack-dev] [nova] placement / resource providers ocata summit session recaps

Matt Riedemann Wed, 02 Nov 2016 10:57:07 -0700

We had three design summit sessions related to the new placement serviceand resource providers work. Since they are all more or less related,I'm going to recap them in a single email.


----

The first session was a retrospective on the placement service work thathappened in the Newton release. The full etherpad is here:


https://etherpad.openstack.org/p/ocata-nova-summit-placement-retrospective

We first talked about what went well, of which there were many things:

- There is a better shared understanding of the design and goals amongmore people in Nova.

- Computes in Newton are reporting their RAM/DISK/CPU inventory and usage.
- We have CI jobs.

- Jay did a nice job of using consistent terminology when discussingresource providers and the end goal for Newton so we could stay focused.- Hangouts helped the team get unstuck at times when we were grindingtoward feature freeze.- The placement API has a clean WSGI design and REST interface thatothers are able to build onto easily.


We then talked about what didn't go so well, which included:

- Confusion around division of labor and when different chunks can beworked in parallel, and by whom.- There was too much time spent on making the specs perfect and weneeded to just start writing and reviewing code. This was especiallyevident when the client side (resource tracker) pieces started gettingwritten that used the placement REST API and required changes to the API.- At times there were key discussions/decisions that were not properlydocumented/communicated back to the wider team.- There was a breakdown in communication at or after the midcycle aboutthe separate placement DB which led to a revert late in the cycle.

- General burnout and frustration.

- Traps of working on long patch series with little review feedbackearly in the series or low-latency on reviews leading to wasted time.

From those discussions, we listed what we should keep doing or dodifferently:

- Write specs with so less low-level detail, but if there is that levelof detail, make sure to amend the spec later if there are changes onceimplemented.

- Use Hangouts when we get stuck.

- Document/communicate decisions/agreements/changes in direction in themailing list.

- Encourage people to pair up for redundancy.

- Encourage early PoCs before building a long and potentially off themark patch series.

There was also some general discussion about not moving specs to'implemented' until the spec is updated after the code is all approved.I was personally not sold on what was proposed for this, since Iconsider amending specs is like writing documentation and CI tests - ifyou don't -2 the last change in the series to complete the blueprint,people have little incentive to actually do it and once their code ismerged it's very hard to get them to do the ancillary tasks. I'm open tofurther discussing this idea though in case I missed the point.


----

The next session was about the quantitative side of resource providers.The full etherpad is here:


https://etherpad.openstack.org/p/ocata-nova-summit-resource-providers-quantitative

There were quite a few things in the etherpad and we didn't get to allof them, so this is a recap of what we did talk about.


- Custom resource classes

The code for this is moving along and being reviewed. There will benamespaces on the standard resource classes that nova provides. Theresource tracker will create inventory/allocation records for the Ironicnodes. The Ironic inventory records will use the node.resource_classvalue as the custom resource class.

We still need to figure out what to do about mapping a single flavor tomultiple node classes, but it might just be done with extra_specs. Therewill be upgrade impacts for this, however, if not properly mapped andthe scheduler starts using the placement service.


- Microversions

Chris Dent has a patch up to add microversion support to the placementAPI and it's being reviewed.


- Nested resource providers

Jay has been working on code for this and has a design in mind. Jay andEd did some whiteboarding in the hall and sorted out their differenceson the design and have agreement on the way forward (which is Jay'snesting/tree model).


- Documenting the placement REST API

We didn't get into this at the summit, but in side discussions it's aTODO and right now we'll most likely handle this like we do for thecompute api-ref.


- Top priorities for Ocata

1. The scheduler calling the placement API to get a list of resourceproviders. There are some specs and WIP code up that Sylvain is workingon. Note that this is not going to involve the caching scheduler fornow, we'll worry about that later.

2. Start handling shared storage. We need the resource tracker and/or anexternal script to create the resource provider / aggregate mapping andinventory/allocation records against shared DISK_GB inventories. Theaggregates mapping modeling work in the placement API is underway.


- What's required when upgrading to Ocata

1. The placement service is required to upgrade to Ocata. You'll breakin Ocata if you don't have this because the scheduler will be using theplacement service for scheduling decisions. The idea is to stand up theplacement service in Newton, get the resource provider (compute node)data populated and then upgrade.

TODO: We need to be more clear about this in the release notes andupgrade docs.

2. The aforementioned mapping of Ironic flavors to multiple noderesource classes. This is still a TBD though.


----

The final resource providers session focused on qualitative aspects,which are the traits on a given resource provider. The full sessionetherpad is here:


https://etherpad.openstack.org/p/ocata-nova-summit-resource-providers-qualitative

The majority of the session was mostly talking about the proposed traitsREST API and different use cases, along with some clarification on rulesaround traits:


- They can't be negative.

- Preferred/required traits will be part of the request spec, not taggedon a trait itself. How this is worked into the request spec is TBD.- Image metadata / flavor extra specs will need to be handled at somepoint but it's not a top priority right now.

- There will be no ACLs on traits.
- The traits APIs will be admin-only for now.

The direction for Ocata is to:

- Spend less time on the spec and start working on some proof of conceptcode, especially on the client side to help shape the needs of the REST API.- Create a spec for namespaces on custom traits which will mirror how wehandle namespaces for custom resource classes.

- Move the os-traits library under the Compute program wrt governance.

--

Thanks,

Matt Riedemann


__________________________________________________________________________
OpenStack Development Mailing List (not for usage questions)
Unsubscribe: openstack-dev-requ...@lists.openstack.org?subject:unsubscribe
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev

[openstack-dev] [nova] placement / resource providers ocata summit session recaps

Reply via email to