[Cloud] Networking incident today in CloudVPS (ferm update)

2019-09-30 Thread Arturo Borrero Gonzalez
Hi,

today 2019-09-30 we were doing an operation in all CloudVPS virtual machines to
update ferm to fix a bug [0]. Ferm is a firewalling utility.

The fleet-wide operation resulted in ferm being installed in every VM, even in
those VMs not requiring it. This resulted in a network outage for most of the
virtual machines and projects that were not previously configured to use ferm.
Many Toolforge tools (webservices, grid jobs, etc) stopped working, database
connection were lost, proxy reported bad gateway errors, etc.

To resolve the issue, we quickly removed ferm from every VM and run puppet agent
to install it just in the VMs that had ferm in their puppet manifests.
As soon as we did this, everything went back to normal.
This incident lasted 1h, give or take.

Please, get in contact in case you see any issue or have any doubts about this
incident.

regards.

[0] https://phabricator.wikimedia.org/T153468
-- 
Arturo Borrero Gonzalez
SRE / Wikimedia Cloud Services
Wikimedia Foundation

___
Wikimedia Cloud Services mailing list
Cloud@lists.wikimedia.org (formerly lab...@lists.wikimedia.org)
https://lists.wikimedia.org/mailman/listinfo/cloud

[Cloud] [Cloud-announce] Cloud VPS users, please claim your projects

2019-09-30 Thread Andrew Bogott
Every year or so the Cloud Services team tries to identify and clean up 
unused projects and VMs.  We do this via an opt-in process: anyone can 
mark a project as 'in use,' and that project will be preserved for 
another year.


I've created a wiki page the lists all existing projects, here:

https://wikitech.wikimedia.org/wiki/News/Cloud_VPS_2019_Purge

If you are a VPS user, please visit that page and mark any projects that 
you use as {{Used}}.  Note that it's not necessary for you to be a 
project admin to mark something -- if you know that you're currently 
using a resource and want to keep using it, go ahead and mark it 
accordingly.  If you /are/ a project admin, please take a moment to mark 
which VMs are or aren't used in your projects.


When December arrives, I will shut down and begin the process of 
reclaiming resources from unused projects.


If you think you use a VPS project but aren't sure which, I encourage 
you to poke around on https://tools.wmflabs.org/openstack-browser/ to 
see what looks familiar.  Worst case, just email 
cloud@lists.wikimedia.org with a description of your use case and we'll 
sort it out there.


Exclusive toolforge users are free to ignore this task.

Thank you!

-Andrew and WMCS team

___
Wikimedia Cloud Services announce mailing list
cloud-annou...@lists.wikimedia.org (formerly labs-annou...@lists.wikimedia.org)
https://lists.wikimedia.org/mailman/listinfo/cloud-announce
___
Wikimedia Cloud Services mailing list
Cloud@lists.wikimedia.org (formerly lab...@lists.wikimedia.org)
https://lists.wikimedia.org/mailman/listinfo/cloud