i'll take a look at this today.

On Mon, Sep 8, 2014 at 1:13 AM, Josh Rosen <rosenvi...@gmail.com> wrote:

> Yeah, I think https://github.com/apache/spark/pull/2315 should have fixed
> the Mima issue.  We're still seeing some intermittent failures due to
> DriverSuite and SparkSubmitSuite tests failing, so I'd appreciate any help
> in diagnosing that issue.
>
> On Sun, Sep 7, 2014 at 10:08 PM, Prashant Sharma <scrapco...@gmail.com>
> wrote:
>
>> Looks like this is already taken care of ?
>>
>> Prashant Sharma
>>
>>
>>
>> On Mon, Sep 8, 2014 at 4:37 AM, Josh Rosen <rosenvi...@gmail.com> wrote:
>>
>>> Does anyone know why some of the MiMa tests have started failing?
>>>
>>> See
>>> https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/19948/consoleFull
>>>  for
>>> an example.
>>>
>>> On September 6, 2014 at 12:48:27 PM, Josh Rosen (rosenvi...@gmail.com)
>>> wrote:
>>>
>>> It looks like Jenkins is up and running, but there seems to be a delay
>>> in responding to requests to re-test patches.  It seems like Jenkins is
>>> promptly testing new PRs, or new commits as they’re added to existing PRs,
>>> but taking a very long time to respond to requests to re-test PRs.
>>>
>>> I’m going to continue monitoring this today.  I’m considering creating
>>> my own fork of the Jenkins pull request builder plugin so that we can add
>>> extra logging in order to diagnose what’s causing this lag.
>>>
>>> - Josh
>>> On September 5, 2014 at 5:49:32 PM, Josh Rosen (rosenvi...@gmail.com)
>>> wrote:
>>>
>>> We have successfully purged Jenkins’ build queue.  If you want a PR to
>>> be re-tested, please ask Jenkins again.
>>>
>>> On September 5, 2014 at 5:36:30 PM, shane knapp (skn...@berkeley.edu)
>>> wrote:
>>>
>>> yeah, it was a problem w/the PRB's OAuth key. josh rosen added a new key,
>>> and magique!
>>>
>>> we're about to clear the queue of all builds as most aren't
>>> wanted/needed.
>>>
>>>
>>> On Fri, Sep 5, 2014 at 5:33 PM, Nicholas Chammas <
>>> nicholas.cham...@gmail.com
>>> > wrote:
>>>
>>> > Looks like Jenkins is back!
>>> >
>>> > lol The poor guy has like a million builds
>>> > <
>>> https://amplab.cs.berkeley.edu/jenkins/view/Pull%20Request%20Builders/job/SparkPullRequestBuilder/
>>> >
>>> > to catch up on.
>>> >
>>> >
>>> > On Fri, Sep 5, 2014 at 4:15 PM, Nicholas Chammas <
>>> > nicholas.cham...@gmail.com> wrote:
>>> >
>>> >> How's it going?
>>> >>
>>> >> It looks like during the last build
>>> >> <
>>> https://amplab.cs.berkeley.edu/jenkins/view/Pull%20Request%20Builders/job/SparkPullRequestBuilder/lastBuild/console
>>> >
>>> >> from about 30 min ago Jenkins was still having trouble fetching from
>>> >> GitHub. It also looks like not all requests for testing are triggering
>>> >> builds.
>>> >>
>>> >>
>>> >> On Fri, Sep 5, 2014 at 1:23 PM, shane knapp <skn...@berkeley.edu>
>>> wrote:
>>> >>
>>> >>> it's looking like everything except the pull request builders are
>>> >>> working. i'm going to be working on getting this resolved today.
>>> >>>
>>> >>>
>>> >>> On Fri, Sep 5, 2014 at 8:18 AM, Nicholas Chammas <
>>> >>> nicholas.cham...@gmail.com> wrote:
>>> >>>
>>> >>>> Hmm, looks like at least some builds
>>> >>>> <
>>> https://amplab.cs.berkeley.edu/jenkins/view/Pull%20Request%20Builders/job/SparkPullRequestBuilder/19804/consoleFull
>>> >
>>> >>>> are working now, though this last one was from ~5 hours ago.
>>> >>>>
>>> >>>>
>>> >>>> On Fri, Sep 5, 2014 at 1:02 AM, shane knapp <skn...@berkeley.edu>
>>> >>>> wrote:
>>> >>>>
>>> >>>>> yep. that's exactly the behavior i saw earlier, and will be
>>> figuring
>>> >>>>> out first thing tomorrow morning. i bet it's an environment issues
>>> on the
>>> >>>>> slaves.
>>> >>>>>
>>> >>>>>
>>> >>>>> On Thu, Sep 4, 2014 at 7:10 PM, Nicholas Chammas <
>>> >>>>> nicholas.cham...@gmail.com> wrote:
>>> >>>>>
>>> >>>>>> Looks like during the last build
>>> >>>>>> <
>>> https://amplab.cs.berkeley.edu/jenkins/view/Pull%20Request%20Builders/job/SparkPullRequestBuilder/19797/console
>>> >
>>> >>>>>> Jenkins was unable to execute a git fetch?
>>> >>>>>>
>>> >>>>>>
>>> >>>>>> On Thu, Sep 4, 2014 at 7:58 PM, shane knapp <skn...@berkeley.edu>
>>> >>>>>> wrote:
>>> >>>>>>
>>> >>>>>>> i'm going to restart jenkins and see if that fixes things.
>>> >>>>>>>
>>> >>>>>>>
>>> >>>>>>> On Thu, Sep 4, 2014 at 4:56 PM, shane knapp <skn...@berkeley.edu
>>> >
>>> >>>>>>> wrote:
>>> >>>>>>>
>>> >>>>>>>> looking
>>> >>>>>>>>
>>> >>>>>>>>
>>> >>>>>>>> On Thu, Sep 4, 2014 at 4:21 PM, Nicholas Chammas <
>>> >>>>>>>> nicholas.cham...@gmail.com> wrote:
>>> >>>>>>>>
>>> >>>>>>>>> It appears that our main man is having trouble
>>> >>>>>>>>> <
>>> https://amplab.cs.berkeley.edu/jenkins/view/Pull%20Request%20Builders/job/SparkPullRequestBuilder/
>>> >
>>> >>>>>>>>> hearing new requests
>>> >>>>>>>>> <
>>> https://github.com/apache/spark/pull/2277#issuecomment-54549106>.
>>> >>>>>>>>>
>>> >>>>>>>>> Do we need some smelling salts?
>>> >>>>>>>>>
>>> >>>>>>>>>
>>> >>>>>>>>> On Thu, Sep 4, 2014 at 5:49 PM, shane knapp <
>>> skn...@berkeley.edu>
>>> >>>>>>>>> wrote:
>>> >>>>>>>>>
>>> >>>>>>>>>> i'd ping the Jenkinsmench... the master was completely
>>> offline,
>>> >>>>>>>>>> so any new
>>> >>>>>>>>>> jobs wouldn't have reached it. any jobs that were queued when
>>> >>>>>>>>>> power was
>>> >>>>>>>>>> lost probably started up, but jobs that were running would
>>> fail.
>>> >>>>>>>>>>
>>> >>>>>>>>>>
>>> >>>>>>>>>> On Thu, Sep 4, 2014 at 2:45 PM, Nicholas Chammas <
>>> >>>>>>>>>> nicholas.cham...@gmail.com
>>> >>>>>>>>>> > wrote:
>>> >>>>>>>>>>
>>> >>>>>>>>>> > Woohoo! Thanks Shane.
>>> >>>>>>>>>> >
>>> >>>>>>>>>> > Do you know if queued PR builds will automatically be picked
>>> >>>>>>>>>> up? Or do we
>>> >>>>>>>>>> > have to ping the Jenkinmensch manually from each PR?
>>> >>>>>>>>>> >
>>> >>>>>>>>>> > Nick
>>> >>>>>>>>>> >
>>> >>>>>>>>>> >
>>> >>>>>>>>>> > On Thu, Sep 4, 2014 at 5:37 PM, shane knapp <
>>> >>>>>>>>>> skn...@berkeley.edu> wrote:
>>> >>>>>>>>>> >
>>> >>>>>>>>>> >> AND WE'RE UP!
>>> >>>>>>>>>> >>
>>> >>>>>>>>>> >> sorry that this took so long... i'll send out a more
>>> detailed
>>> >>>>>>>>>> explanation
>>> >>>>>>>>>> >> of what happened soon.
>>> >>>>>>>>>> >>
>>> >>>>>>>>>> >> now, off to back up jenkins.
>>> >>>>>>>>>> >>
>>> >>>>>>>>>> >> shane
>>> >>>>>>>>>> >>
>>> >>>>>>>>>> >>
>>> >>>>>>>>>> >> On Thu, Sep 4, 2014 at 1:27 PM, shane knapp <
>>> >>>>>>>>>> skn...@berkeley.edu> wrote:
>>> >>>>>>>>>> >>
>>> >>>>>>>>>> >> > it's a faulty power switch on the firewall, which has
>>> been
>>> >>>>>>>>>> swapped out.
>>> >>>>>>>>>> >> > we're about to reboot and be good to go.
>>> >>>>>>>>>> >> >
>>> >>>>>>>>>> >> >
>>> >>>>>>>>>> >> > On Thu, Sep 4, 2014 at 1:19 PM, shane knapp <
>>> >>>>>>>>>> skn...@berkeley.edu>
>>> >>>>>>>>>> >> wrote:
>>> >>>>>>>>>> >> >
>>> >>>>>>>>>> >> >> looks like some hardware failed, and we're swapping in a
>>> >>>>>>>>>> replacement.
>>> >>>>>>>>>> >> i
>>> >>>>>>>>>> >> >> don't have more specific information yet -- including
>>> >>>>>>>>>> *what* failed,
>>> >>>>>>>>>> >> as our
>>> >>>>>>>>>> >> >> sysadmin is super busy ATM. the root cause was an
>>> >>>>>>>>>> incorrect circuit
>>> >>>>>>>>>> >> being
>>> >>>>>>>>>> >> >> switched off during building maintenance.
>>> >>>>>>>>>> >> >>
>>> >>>>>>>>>> >> >> on a side note, this incident will be accelerating our
>>> plan
>>> >>>>>>>>>> to move the
>>> >>>>>>>>>> >> >> entire jenkins infrastructure in to a managed datacenter
>>> >>>>>>>>>> environment.
>>> >>>>>>>>>> >> this
>>> >>>>>>>>>> >> >> will be our major push over the next couple of weeks.
>>> more
>>> >>>>>>>>>> details
>>> >>>>>>>>>> >> about
>>> >>>>>>>>>> >> >> this, also, as soon as i get them.
>>> >>>>>>>>>> >> >>
>>> >>>>>>>>>> >> >> i'm very sorry about the downtime, we'll get everything
>>> up
>>> >>>>>>>>>> and running
>>> >>>>>>>>>> >> >> ASAP.
>>> >>>>>>>>>> >> >>
>>> >>>>>>>>>> >> >>
>>> >>>>>>>>>> >> >> On Thu, Sep 4, 2014 at 12:27 PM, shane knapp <
>>> >>>>>>>>>> skn...@berkeley.edu>
>>> >>>>>>>>>> >> wrote:
>>> >>>>>>>>>> >> >>
>>> >>>>>>>>>> >> >>> looks like a power outage in soda hall. more updates as
>>> >>>>>>>>>> they happen.
>>> >>>>>>>>>> >> >>>
>>> >>>>>>>>>> >> >>>
>>> >>>>>>>>>> >> >>> On Thu, Sep 4, 2014 at 12:25 PM, shane knapp <
>>> >>>>>>>>>> skn...@berkeley.edu>
>>> >>>>>>>>>> >> >>> wrote:
>>> >>>>>>>>>> >> >>>
>>> >>>>>>>>>> >> >>>> i am trying to get things up and running, but it looks
>>> >>>>>>>>>> like either
>>> >>>>>>>>>> >> the
>>> >>>>>>>>>> >> >>>> firewall gateway or jenkins server itself is down.
>>> i'll
>>> >>>>>>>>>> update as
>>> >>>>>>>>>> >> soon as
>>> >>>>>>>>>> >> >>>> i know more.
>>> >>>>>>>>>> >> >>>>
>>> >>>>>>>>>> >> >>>
>>> >>>>>>>>>> >> >>>
>>> >>>>>>>>>> >> >>
>>> >>>>>>>>>> >> >
>>> >>>>>>>>>> >>
>>> >>>>>>>>>> >
>>> >>>>>>>>>> > --
>>> >>>>>>>>>> > You received this message because you are subscribed to the
>>> >>>>>>>>>> Google Groups
>>> >>>>>>>>>> > "amp-infra" group.
>>> >>>>>>>>>> > To unsubscribe from this group and stop receiving emails
>>> from
>>> >>>>>>>>>> it, send an
>>> >>>>>>>>>> > email to amp-infra+unsubscr...@googlegroups.com.
>>> >>>>>>>>>> > For more options, visit https://groups.google.com/d/optout.
>>> >>>>>>>>>> >
>>> >>>>>>>>>>
>>> >>>>>>>>>
>>> >>>>>>>>>
>>> >>>>>>>>
>>> >>>>>>>
>>> >>>>>>
>>> >>>>>
>>> >>>>
>>> >>>
>>> >>
>>> >
>>>
>>
>>
>

Reply via email to