Re: [Hosting] Downtime for VMs on gprod5 7/18

2017-07-19 Thread Justin Dugger
Move completed, all VMs restarted successfully.

Users of mysql-vip2 may have seen a brief outage believed to be due to
a saturated network switch. We've move gprod5 to a separate switch to
alleviate this. Apologies!
--
Justin Dugger
Senior Systems Administrator
OSU Open Source Lab


On Thu, Jul 13, 2017 at 3:03 PM, Justin Dugger  wrote:
> At 10am on July 18th, all VMs running on gprod5 will be shut down.
>
> As part of the OSL's ongoing rack rotation datacenter efficiency
> project, VMs hosted on gprod5 will be offline for the duration of the
> move:
>
> bro-try.osuosl.org
> darcs.osuosl.org
> jenkins.inkscape.org
>
> Based on past experience, I expect the outage to last about an hour.
>
> Note this is the rescheduling of a move originally planned for 6/28.
>
> --
> Justin Dugger
> Senior Systems Administrator
> OSU Open Source Lab
___
Hosting mailing list
host...@osuosl.org
https://lists.osuosl.org/mailman/listinfo/hosting


Re: jenkins emergency restart notice - Tues 00:00 UTC

2017-07-19 Thread Mike Drob
Have the changes resulted in any noticeable improvements? Is there a JIRA
interested folks can follow along on?

On Tue, Jul 18, 2017 at 6:40 PM, Chris Lambertus  wrote:

> We’re trying a suggested downgrade of the credentials binding plugin which
> may be related to this. No guarantees, though. I will be restarting again
> tonight at 0100 UTC 19 July.
>
> -Chris
>
>
>
>
> > On Jul 18, 2017, at 9:00 AM, Chris Lambertus  wrote:
> >
> > This appears to be a bug in one of the new versions of either jenkins or
> a plugin. We are still looking into the issue. The limit was raised
> yesterday by a factor of 4.
> >
> >
> >
> >
> >> On Jul 18, 2017, at 4:08 AM, Geoff Macartney <
> geoff.macart...@cloudsoft.io> wrote:
> >>
> >> We're seeing them in Apache Brooklyn builds too, e.g.
> >> https://builds.apache.org/job/brooklyn-server-master/664.
> >>
> >> regards
> >> Geoff
> >>
> >>
> >>
> >> On Tue, 18 Jul 2017 at 12:03 Ismael Juma  wrote:
> >>
> >>> Hi,
> >>>
> >>> Thanks for the heads up. Was this done? We are still seeing "too many
> open
> >>> files" issues in Kafka builds. One example of many:
> >>>
> >>> https://builds.apache.org/job/kafka-trunk-jdk7/
> lastCompletedBuild/console
> >>>
> >>> Ismael
> >>>
> >>> On Mon, Jul 17, 2017 at 2:53 PM, Chris Lambertus 
> wrote:
> >>>
> 
>  Jenkins will be restarted at midnight tonight UTC to address problems
>  related to ‘too many open files’ and to attempt to correct a problem
> with
>  some windows agents. I will be stopping new builds so that the queue
> can
>  empty over the next 2 hours.
> 
>  -Chris
> 
> 
> >>>
> >
>
>


Re: jenkins emergency restart notice - Tues 00:00 UTC

2017-07-19 Thread Dominik Psenner
I filed https://issues.apache.org/jira/browse/INFRA-14628 which is probably
related to this since the exception message says "Too many open files".

2017-07-19 17:01 GMT+02:00 Mike Drob :

> Have the changes resulted in any noticeable improvements? Is there a JIRA
> interested folks can follow along on?
>
> On Tue, Jul 18, 2017 at 6:40 PM, Chris Lambertus  wrote:
>
> > We’re trying a suggested downgrade of the credentials binding plugin
> which
> > may be related to this. No guarantees, though. I will be restarting again
> > tonight at 0100 UTC 19 July.
> >
> > -Chris
> >
> >
> >
> >
> > > On Jul 18, 2017, at 9:00 AM, Chris Lambertus  wrote:
> > >
> > > This appears to be a bug in one of the new versions of either jenkins
> or
> > a plugin. We are still looking into the issue. The limit was raised
> > yesterday by a factor of 4.
> > >
> > >
> > >
> > >
> > >> On Jul 18, 2017, at 4:08 AM, Geoff Macartney <
> > geoff.macart...@cloudsoft.io> wrote:
> > >>
> > >> We're seeing them in Apache Brooklyn builds too, e.g.
> > >> https://builds.apache.org/job/brooklyn-server-master/664.
> > >>
> > >> regards
> > >> Geoff
> > >>
> > >>
> > >>
> > >> On Tue, 18 Jul 2017 at 12:03 Ismael Juma  wrote:
> > >>
> > >>> Hi,
> > >>>
> > >>> Thanks for the heads up. Was this done? We are still seeing "too many
> > open
> > >>> files" issues in Kafka builds. One example of many:
> > >>>
> > >>> https://builds.apache.org/job/kafka-trunk-jdk7/
> > lastCompletedBuild/console
> > >>>
> > >>> Ismael
> > >>>
> > >>> On Mon, Jul 17, 2017 at 2:53 PM, Chris Lambertus 
> > wrote:
> > >>>
> > 
> >  Jenkins will be restarted at midnight tonight UTC to address
> problems
> >  related to ‘too many open files’ and to attempt to correct a problem
> > with
> >  some windows agents. I will be stopping new builds so that the queue
> > can
> >  empty over the next 2 hours.
> > 
> >  -Chris
> > 
> > 
> > >>>
> > >
> >
> >
>



-- 
Dominik Psenner


Re: jenkins emergency restart notice - Tues 00:00 UTC

2017-07-19 Thread Geoff Macartney
Apache Brooklyn builds seem to be ok now, thanks.

On Wed, 19 Jul 2017, 16:09 Dominik Psenner,  wrote:

> I filed https://issues.apache.org/jira/browse/INFRA-14628 which is
> probably
> related to this since the exception message says "Too many open files".
>
> 2017-07-19 17:01 GMT+02:00 Mike Drob :
>
> > Have the changes resulted in any noticeable improvements? Is there a JIRA
> > interested folks can follow along on?
> >
> > On Tue, Jul 18, 2017 at 6:40 PM, Chris Lambertus  wrote:
> >
> > > We’re trying a suggested downgrade of the credentials binding plugin
> > which
> > > may be related to this. No guarantees, though. I will be restarting
> again
> > > tonight at 0100 UTC 19 July.
> > >
> > > -Chris
> > >
> > >
> > >
> > >
> > > > On Jul 18, 2017, at 9:00 AM, Chris Lambertus  wrote:
> > > >
> > > > This appears to be a bug in one of the new versions of either jenkins
> > or
> > > a plugin. We are still looking into the issue. The limit was raised
> > > yesterday by a factor of 4.
> > > >
> > > >
> > > >
> > > >
> > > >> On Jul 18, 2017, at 4:08 AM, Geoff Macartney <
> > > geoff.macart...@cloudsoft.io> wrote:
> > > >>
> > > >> We're seeing them in Apache Brooklyn builds too, e.g.
> > > >> https://builds.apache.org/job/brooklyn-server-master/664.
> > > >>
> > > >> regards
> > > >> Geoff
> > > >>
> > > >>
> > > >>
> > > >> On Tue, 18 Jul 2017 at 12:03 Ismael Juma  wrote:
> > > >>
> > > >>> Hi,
> > > >>>
> > > >>> Thanks for the heads up. Was this done? We are still seeing "too
> many
> > > open
> > > >>> files" issues in Kafka builds. One example of many:
> > > >>>
> > > >>> https://builds.apache.org/job/kafka-trunk-jdk7/
> > > lastCompletedBuild/console
> > > >>>
> > > >>> Ismael
> > > >>>
> > > >>> On Mon, Jul 17, 2017 at 2:53 PM, Chris Lambertus 
> > > wrote:
> > > >>>
> > > 
> > >  Jenkins will be restarted at midnight tonight UTC to address
> > problems
> > >  related to ‘too many open files’ and to attempt to correct a
> problem
> > > with
> > >  some windows agents. I will be stopping new builds so that the
> queue
> > > can
> > >  empty over the next 2 hours.
> > > 
> > >  -Chris
> > > 
> > > 
> > > >>>
> > > >
> > >
> > >
> >
>
>
>
> --
> Dominik Psenner
>


Re: jenkins emergency restart notice - Tues 00:00 UTC

2017-07-19 Thread Chris Lambertus
I just commented on that ticket. Looks like this was primarily due to the 
switch to systemd startup in ubuntu 16.04 and the need to update 
DefaultLimitNOFILE in /etc/systemd/system.conf (grr) to affect processes 
started via systemctl. Thanks Joan Touzet for pointing this out.

-Chris



> On Jul 19, 2017, at 8:47 AM, Geoff Macartney  
> wrote:
> 
> Apache Brooklyn builds seem to be ok now, thanks.
> 
> On Wed, 19 Jul 2017, 16:09 Dominik Psenner,  wrote:
> 
>> I filed https://issues.apache.org/jira/browse/INFRA-14628 which is
>> probably
>> related to this since the exception message says "Too many open files".
>> 
>> 2017-07-19 17:01 GMT+02:00 Mike Drob :
>> 
>>> Have the changes resulted in any noticeable improvements? Is there a JIRA
>>> interested folks can follow along on?
>>> 
>>> On Tue, Jul 18, 2017 at 6:40 PM, Chris Lambertus  wrote:
>>> 
 We’re trying a suggested downgrade of the credentials binding plugin
>>> which
 may be related to this. No guarantees, though. I will be restarting
>> again
 tonight at 0100 UTC 19 July.
 
 -Chris
 
 
 
 
> On Jul 18, 2017, at 9:00 AM, Chris Lambertus  wrote:
> 
> This appears to be a bug in one of the new versions of either jenkins
>>> or
 a plugin. We are still looking into the issue. The limit was raised
 yesterday by a factor of 4.
> 
> 
> 
> 
>> On Jul 18, 2017, at 4:08 AM, Geoff Macartney <
 geoff.macart...@cloudsoft.io> wrote:
>> 
>> We're seeing them in Apache Brooklyn builds too, e.g.
>> https://builds.apache.org/job/brooklyn-server-master/664.
>> 
>> regards
>> Geoff
>> 
>> 
>> 
>> On Tue, 18 Jul 2017 at 12:03 Ismael Juma  wrote:
>> 
>>> Hi,
>>> 
>>> Thanks for the heads up. Was this done? We are still seeing "too
>> many
 open
>>> files" issues in Kafka builds. One example of many:
>>> 
>>> https://builds.apache.org/job/kafka-trunk-jdk7/
 lastCompletedBuild/console
>>> 
>>> Ismael
>>> 
>>> On Mon, Jul 17, 2017 at 2:53 PM, Chris Lambertus 
 wrote:
>>> 
 
 Jenkins will be restarted at midnight tonight UTC to address
>>> problems
 related to ‘too many open files’ and to attempt to correct a
>> problem
 with
 some windows agents. I will be stopping new builds so that the
>> queue
 can
 empty over the next 2 hours.
 
 -Chris
 
 
>>> 
> 
 
 
>>> 
>> 
>> 
>> 
>> --
>> Dominik Psenner
>> 



signature.asc
Description: Message signed with OpenPGP