Re: [openstack-dev] [tc] [all] TC Report 18-26

Zane Bitter Fri, 06 Jul 2018 09:59:19 -0700

On 02/07/18 19:13, Jay Pipes wrote:

Also note that when I've said that *OpenStack* should have a smallermission and scope, that doesn't mean that higher-level servicesaren't necessary or wanted.
Thank you for saying this, and could I please ask you to repeat thisdisclaimer whenever you talk about a smaller scope for OpenStack.
Yes. I shall shout it from the highest mountains. [1]


Thanks. Appreciate it :)

[1] I live in Florida, though, which has no mountains. But, when Ivisit, say, North Carolina, I shall certainly shout it from theirmountains.


That's where I live, so I'll keep an eye out for you if I hear shouting.

Because for those of us working on higher-level services it feels likethere has been a non-stop chorus (both inside and outside the project)of people wanting to redefine OpenStack as something that doesn'tinclude us.
I've said in the past (on Twitter, can't find the link right now, butit's out there somewhere) something to the effect of "at some point,someone just needs to come out and say that OpenStack is, at its core,Nova, Neutron, Keystone, Glance and Cinder".

https://twitter.com/jaypipes/status/875377520224460800 for anyone whowas curious.

Interestingly, that and my equally off-the-cuff replyhttps://twitter.com/zerobanana/status/875559517731381249 are actuallypretty close to the minimal descriptions of the two broad camps we weretalking about in the technical vision etherpad. (Noting for the recordthat cdent disputes that views can be distilled into two camps.)

Perhaps this is what you were recollecting. I would use a differentphrase nowadays to describe what I was thinking with the above.

I don't think I was recalling anything in particular that *you* hadsaid. Complaining about the non-core projects (presumably on the logicthat if we kicked them out of OpenStack all their developers wouldinstead go to work on radically simplifying the remaining projectsinstead?</sarcasm>) was a widespread popular pastime for at leastroughly the 4 years from 2013-2016.

I would say instead "Nova, Neutron, Cinder, Keystone and Glance [2] area definitive lower level of an OpenStack deployment. They represent aset of required integrated services that supply the most basicinfrastructure for datacenter resource management when deployingOpenStack."
Note the difference in wording. Instead of saying "OpenStack is X", I'msaying "These particular services represent a specific layer of anOpenStack deployment".

OK great. So this is wrong :) and I will attempt to explain why I thinkthat in a second. But first I want to acknowledge what is attractiveabout this viewpoint (even to me). This is a genuinely usefulobservation that leads to a real insight.

The insight, I think, is the same one we all just agreed on in anotherpart of the thread: OpenStack is the only open source projectconcentrating on the gap between a rack full of unconfigured equipmentand somewhere that you could, say, install Kubernetes. We write the bitwhere the rubber meets the road, and if we don't get it done there'snobody else to do it! There's an almost infinite variety of differentapplications and they'll all need different parts of the higher layers,but ultimately they'll all need to be reified in a physical data centerand when they do, we'll be there: that's the core of what we're building.

It's honestly only the tiniest of leaps from seeing that idea asattractive, useful, and genuinely insightful to seeing it as correct,and I don't really blame anybody who made that leap.

I'm going to gloss over the fact that we punted the actual process ofsetting up the data center to a bunch of what turned out to bevendor-specific installer projects that you suggest should be punted outof OpenStack altogether, because that isn't the biggest problem I havewith this view.

Back in the '70s there was this idea about AI: even a 2 year old humancan e.g. recognise images with a high degree of accuracy, but doing e.g.calculus is extremely hard in comparison and takes years of training.But computers can already do calculus! Ergo, we've solved the hardestpart already and building the rest out of that will be trivial, AGI isjust around the corner, &c. &c. (I believe I cribbed this explanationfrom an outdated memory of Marvin Minsky's 1982 paper "Why People ThinkComputers Can't" - specifically the section "Could a Computer HaveCommon Sense?" - so that's a better source if you actually want to learnsomething about AI.) The popularity of this idea arguably helped createdthe AI bubble, and the inevitable collision with the reality of itsfundamental wrongness led to the AI Winter. Because in fact just becauseyou can build logic out of many layers of heuristics (as human brainsdo), it absolutely does not follow that it's trivial to build otherthings that also require many layers of heuristics once you have somebasic logic building blocks. (This is my conclusion, not Minsky's, andprobably more influenced by reading summaries of Kahneman. But sufficeto say the AI technology of the present, which is showing more promise,is called Deep Learning because it consists literally of many layers ofheuristics. It's also still considerably worse at it than any 2 year oldhuman.)

I see the problem with the OpenStack-as-layers model as being analogous.(I don't think there's going to be a full-on OpenStack Winter, but we'vecertainly hit the Trough of Disillusionment.) With Nova, Neutron,Cinder, Keystone and Glance you can build a pretty good VPS hostingservice. But it's a mistake to think that cloud is something you get bylayering stuff on top of VPS hosting. It's comparatively easy to build aVPS on top of a cloud, just like teaching a child arithmetic. But it'senormously difficult to build a cloud on top of VPS (it would involve alot of wasteful layers of abstraction, similar to building artificialneurons in software).

Speaking of abstraction, let's try to pull this back to somethingconcrete. Kubernetes is event-driven at a very fundamental level: when apod dies, k8s gets a notification immediately and that prompts it toreschedule the workload. In contrast, Nova/Cinder/&c. is a black hole.You can't even build a sane dashboard for your VPS - let alonecloud-style orchestration - over it, because they have to spend alltheir time polling the API to find out if anything happened. There's anentire separate project (Masakari) that ~nobody has installed, basicallydedicated to spelunking in the compute node without Nova's knowledge totry to surface this information. I am definitely not disrespecting theMasakari team, who are doing something that desperately needs doing inthe only way that's really open to them, but that's an embarrassinglybad architecture for OpenStack as a whole.

So yeah, it's sometimes helpful to think about the fact that there's agroup of components that own the low level interaction with outsidesystems (hardware, or IdM in the case of Keystone), and that almostevery application will end up touching those directly or indirectly,while each using different subsets of the other functionality... *but*only in the awareness that those things also need to be built from theground up to occupy a space in a larger puzzle.

When folks say stuff like these projects represent a "definitive lowerlevel of an OpenStack deployment" they invite the listener to ignore thebigger picture; to imagine that if those lower level services just takecare of their own needs then everything else can just build on top.That's a mistake, unless you believe (and I know *you* don't believethis Jay) that OpenStack needs only to provide enough building blocks tobuild VPS hosting out of, because support for all of those higher-levelthings doesn't just fall out like that. You have to consciously work at it.

Imagine for a moment that, knowing everything we know now, we haddesigned OpenStack around a system of event sources and sinks that'sreliable in the face of network partitions &c., with componentsconnecting into it to provide services to the user and to each other.That's what Kubernetes did. That's the key to its success. We need to doenable something similar, because OpenStack is still necessary for allof the reasons above and more.

In particular, I think one place where OpenStack provides value is thatwe are less opinionated and can allow application developers to choosehow the event sources and sinks are connected together. That means thatusers can e.g. customise their own failovers in 'userspace' rather thanthe more one-size-fits-all approach of handling everything automaticallyinside k8s. This is theoretically the advantage of having separateprojects instead of a monolithic design, and one reason why I don'tthink that destroying all of the boundaries between projects is the wayforward for OpenStack. (I do still think it'd be a great thing for thecompute node, which is entirely internal to OpenStack and definitelydoes not benefit from fragmentation.)

Nowadays, I would further add something to the effect of "Depending onthe particular use cases and workloads the OpenStack deployer wishes topromote, an additional layer of services provides workload orchestrationand workflow management capabilities. This layer of services includeHeat, Mistral, Tacker, Service Function Chaining, Murano, etc".

That makes sense, but the key point I want to make is that you can't(usefully) provide the porcelain unless the plumbing for it is in place.Right now information only flows one way - we have drains connected tothe porcelain but no running water. Application developers are fetchingwater in buckets and heating it over an open fire medieval-style, whilethere are still (some) people who go around saying 'we have too muchporcelain, we should just concentrate on making better drains'. Somebodydare me to stretch this metaphor even further.

Does that provide you with some closure on this feeling of "non-stopchorus" of exclusion that you mentioned above?


I'm never letting this go ;)

The reason I haven't dropped this discussion is because I really wantto know if _all_ of those people were actually talking about somethingelse (e.g. a smaller scope for Nova), or if it's just you. Because youand I are in complete agreement that Nova has grown a lot of obscurecapabilities that make it fiendishly difficult to maintain, and thatin many cases might never have been requested if we'd had higher-leveltools that could meet the same use cases by composing simpler operations.
IMHO some of the contributing factors to that were:
* The aforementioned hostility from some quarters to the existence ofhigher-level projects in OpenStack.* The ongoing hostility of operators to deploying any projects outsideof Keystone/Nova/Glance/Neutron/Cinder (*still* seen playing out inthe Barbican vs. Castellan debate, where we can't even correct one ofOpenStack's original sins and bake in a secret store - something k8smanaged from day one - because people don't want to install anotherReST API even over a backend that they'll already have to installanyway).* The illegibility of public Nova interfaces to potential higher-leveltools.
I would like to point something else out here. Something that may not bepleasant to confront.
Heat's competition (for resources and mindshare) is Kubernetes, plainand simple.

For resources, that's undoubtedly true. For mindshare, that seems a bitlike saying "Horses' competition for mindshare is cars". I mean, yes,but the _competition_ part was over a while back, cars won, and horsesnow fulfil a niche role.

That's actually OK by me. When we first started Heat, it was a projectto make *OpenStack* resources orchestratable. Once it was up andrunning, a bunch of people came to us (at the Havana summit in Portlandin early 2013) and said that we needed to build a software orchestrationsystem. Personally, I was pretty reluctant at first. Eventually theyconvinced me. But in retrospect while they were right about the factthat we needed better ways to deploy software via Heat than to bake itinto the image and pass minimal configuration in the user_data (andHeat's Software Deployments delivered those improvements), the thingthey really needed was Kubernetes. Once that existed those folks meltedaway from the Heat community, and that's not a terrible outcome. TurningHeat into k8s would be hard and distracting; it's better to integratethe two together so people can get the all functionality they need fromthe projects in the best position to provide it.

There are still plenty of folks who need to do orchestration across allof their virtual infrastructure, and Heat is here to meet their needs.The project was always about trying to make OpenStack better and moreconsumable for a certain audience, and users tell us it has succeeded atthat.

Heat's competition is not other OpenStack projects.

In practical terms, Heat's competition is Horizon, shell scripts andapathy. Not necessarily in that order. Arguably Ansible as well, butmostly because we don't have any real integration with it, so people aresometimes forced to pick one or the other when they need both.

Nova's competition is not Kubernetes (despite various people continuingto say that it is).
Nova is not an orchestration system. Never was and (as long as I'mkicking and screaming) never will be.
Nova's primary competition is:

* Stand-alone Ironic
* oVirt and stand-alone virsh callers
* Parts of VMWare vCenter [3]
* MaaS in some respects

Do you see KubeVirt or Kata or Virtlet or RancherVM ending up on thislist at any point? Because a lot of people* do. And Nova is absolutelycompeting for resources with those projects. Having your VM provisioningthing embedded in the user's orchestration system of choice is a seriouscompetitive advantage. (BTW I'm currently trying to save your baconhere:http://lists.openstack.org/pipermail/openstack-dev/2018-June/131183.html)


* https://news.ycombinator.com/item?id=17013779

* The *compute provisioning* parts of EC2, Azure, and GCP

I agree this is true in practice, but would like to note that thecompute provisioning parts of those services are tied in to the rest ofthe cloud in ways that Nova is not tied in to the rest of OpenStack, andthat is a *major* missed opportunity because it largely limits ourmarket to the subset of people who need only the compute provisioning bits.

We chose to add features to Nova to compete with vCenter/oVirt, and notto add features the would have enabled OpenStack as a whole to competewith more than just the compute provisioning subset of EC2/Azure/GCP.Meanwhile, the other projects in OpenStack were working on building theother parts of an AWS/Azure/GCP competitor. And our vague one-sentencemission statement allowed us all to maintain the delusion that we wereall working on the same thing and pulling in the same direction, when intruth we haven't been at all.

We can decide that we want to be one, or the other, or both. But if wedon't all decide together then a lot of us are going to continue wastingour time working at cross-purposes.

This is why there is a Kubernetes OpenStack cloud provider plugin [4].
This plugin uses Nova [5] (which can potentially use Ironic), Cinder,Keystone and Neutron to deploy kubelets to act as nodes in a Kubernetescluster and load balancer objects to act as the proxies that k8s itselfuses when deploying Pods and Services.
Heat's architecture, template language and object constructs are indirect competition with Kubernetes' API and architecture, with theprimary difference being a VM-centric [6] vs. a container-centric objectmodel.

Mmmm, I wouldn't really call Heat VM-centric. It'sinfrastructure-centric with a sideline in managing software, where K8sis software-centric. Here's a blog post I wrote from back when peoplethought Heat's competition was Puppet(!):


https://www.zerobanana.com/archive/2014/05/08#heat-configuration-management

It's aged pretty well except for the fact that k8s largely owns the'Software Orchestration' space now. (Although, really, k8s itselfdoesn't do 'orchestration' as such. It just starts everything up, andthe application does its own co-ordination using etcd. Helm does doorchestration in the traditional sense AIUI.)

Heat's template language is similar to Helm's chart template YAMLstructure [7], and with Heat's evolution to the "convergence model",Heat's architecture actually got closer to Kubernetes' architecture:that of continually attempting to converge an observed state with adesired state.

It's important to note that that model change never happened, and likelynever will. More specifically, the set of changes labelled 'convergence'can be grouped into three different buckets, only one of which exists:

1) Feed all resource actions into a task queue for workers to pick up,enabling Heat to scale out arbitrarily (limited only by the centralisedDB); and allow users to update stacks without waiting for previousoperations to complete. This absolutely happened, has been the defaultsince Newton, is used even by TripleO since Queens, and is workinggreat. This is what most people mean when they refer to 'convergence'now (for a while we used to call it 'phase 1').2) Update resources by comparing their observed state to the desiredstate and making an incremental change to converge them, then repeat.This can itself be divided into several different implementation phases,and the first one (comparing to observed rather than last-recordedstate) actually sort-of exists as an experimental option. That said,this is probably never going to be completed for a number of reasons:lack of developers/reviewers; the need to write new resourceimplementations, thus throwing away years worth of corner-case fixes andresulting stability; and an inability to get events in an efficient(i.e. filtered at source) and reliable way.3) Doing this constantly all the time ('continuous convergence') evenwhen a stack update is not in progress. We agreed not to ever do thisbecause I argued that the user needed control over the process - it'senough that Heat could recognise that something had changed and fix itduring a stack update (bucket #2), after that it's better to let theapplication decide when to run a stack update, either on a timer or inresponse to events (probably via Mistral in both cases). Maybe if wecould (efficiently) get event notifications for everything it might be adifferent story, but there's no way we can justify constant polling ofevery resource in every stack whether the user needs it or not.

So, what is Heat to do?

One thing we need to do is to integrate with other ways of deployingsoftware (especially Kubernetes and Ansible), to build better bridgesbetween the infrastructure and the software running on it.

The challenging part is authenticating to those other systems.Unfortunately Keystone has never become a standard way of authenticatingservices outside the immediate OpenStack ecosystem. One option I want toexplore more is just having the user put credentials for those othersystems into Barbican. That's not an especially elegant solution, and itrequires operators to actually install Barbican, but it least it'ssomething and Heat wouldn't have to store the user's credentials itself.We're working on adding support for creating stacks in remote OpenStackcloud using this method, so that should help provide a model we can reuse.

The hype and marketing machine is never-ending, I'm afraid. [8]
I'm not sure there's actually anything that can be done about this.Perhaps it is a fait accomplis that Kubernetes/Helm will/has becomesynonymous with "orchestration of things". Perhaps not. I'm not anoracle, unfortunately.

Me neither. There are even folks who think that the Zun model ofcontainer deployment is going to take over the world:


https://medium.com/@steve.yegge/honestly-i-cant-stand-k8s-48c9a600e405

Who knows? He was right about Javascript. We're going to find out.

Maybe the only thing that Heat can do to fend off the coming doom is tomake a case that Heat's performance, reliability, feature set orintegration with OpenStack's other services make it a better candidatefor orchestrating virtual machine or baremetal workloads on an OpenStackdeployment than Kubernetes is.
Sorry to be the bearer of bad news,

I assure you that, contrary to popular opinion, I have not been livingunder a rock ;)


cheers,
Zane.

__________________________________________________________________________
OpenStack Development Mailing List (not for usage questions)
Unsubscribe: openstack-dev-requ...@lists.openstack.org?subject:unsubscribe
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev

Re: [openstack-dev] [tc] [all] TC Report 18-26

Reply via email to