Re: [Openstack-operators] Lets talk capacity monitoring

Mathieu Gagné Thu, 15 Jan 2015 09:29:20 -0800

On 2015-01-15 11:43 AM, Jesse Keating wrote:

We have a need to better manage the various openstack capacities across
our numerous clouds. We want to be able to detect when capacity of one
system or another is approaching the point where it would be a good idea
to arrange to increase that capacity. Be it volume space, VCPU
capability, object storage space, etc...


What systems are you folks using to monitor and react to such things?


Thanks for bringing up the subject Jesse.

I believe you are not the only one facing this challenge because I am too.

I added the subject to the midcycle ops meetup (Capacityplanning/monitoring) which I hope to be able to attend:

https://etherpad.openstack.org/p/PHL-ops-meetup

We are using host aggregates and have a complex combination of them.(imaging a venn diagram)


What we do is retrieving all:
- hypervisor stats
- host aggregates

From there, we compute resource usage (vcpus, ram, disk) in any givenhost aggregate.

This part is very challenging as we have to partially reimplementnova-scheduler logic to determine if a given hypervisor has differentresource allocation ratios based on host aggregate attributes.

The result in a table with resource usage percentage (and absolutenumbers) for each host aggregates (and combinations).

Unfortunately, I can't share yet this first tool as my coworker verytightly integrated it to our internal monitoring tool and wouldn't workoutside it. No promise but I'll try to find time to extract it and shareit with you guys.

We also coded a very primitive tool which takes a flavor name andcompute available "slots" on each hypervisors (regardless of hostaggregate memberships):


https://gist.github.com/mgagne/bc54c3434a119246a88d

This tool is not actively used in our monitoring due to mentionedlimitation as we would again have to partially reimplementnova-scheduler logic to determine if a given flavor can (or not) bespawn on a given hypervisor and filter it out from the output if itcan't accept the flavor. Furthermore, it does not take into accountresource allocation ratios based on host aggregates.

Hopefully, other people will join in and share their tools so we can allimprove our OpenStack operations experience.


--
Mathieu

_______________________________________________
OpenStack-operators mailing list
OpenStack-operators@lists.openstack.org
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-operators

Re: [Openstack-operators] Lets talk capacity monitoring

Reply via email to