I can make a commitment to help getting cells passing.  Basically, I'd like to 
do whatever I can to make sure we can have a useful gate on cells.  
Unfortunately I'm going to be mostly offline for the next 10 days or so, 
however. :)

I thought there was a sec group patch up for cells, but I've not fully reviewed 
it.  

The generic "cannot communicate with cell 'child'" almost sounds like some 
other basic issue.... I'll see if I can take a peak during my layovers tonight.

On Jul 13, 2013, at 8:28 AM, Sean Dague <s...@dague.net> wrote:

> On 07/13/2013 10:50 AM, Dan Smith wrote:
>>> Currently cells can even get past devstack exercises, which
>>> are very
>>> minor sanity checks for the environment (nothing tricky).
>> 
>> I thought that the plan was to deprecate the devstack exercises and
>> just use tempest. Is that not the case? I'd bet that the devstack
>> exercises are just not even on anyone's radar. Since the excellent work
>> you QA folks did to harden those tests before grizzly, I expect most
>> people take them for granted now :)
>> 
>> Digging into the logs just a bit, I see what looks like early failures
>> related to missing security group issues in the cells manager log. I
>> know there are some specific requirements in how things have to be set
>> up for cells, so I think it's likely that we'll need to do some
>> tweaking of configs to get all of this right.
>> 
>> We enabled the test knowing that it wasn't going to pass for a while,
>> and it's only been running for less than 24 hours. In the same way that
>> the grenade job had (until recently) been failing on everything, the
>> point of enabling the cells test now is so that we can start iterating
>> on fixes so that we can hopefully have some amount of regular test
>> coverage before havana.
> 
> Like I said, as long as someone is going to work on it, I'm happy. :) I just 
> don't want this to be an enable the tests and hope magically fairies come to 
> fix them issue. That's what we did on full neutron tests, and it's been 
> bouncing around like that for a while.
> 
> We are planning on disabling the devstack exercises, it wasn't so much that, 
> it's that it looks like there is fundamental lack of functioning nova on 
> devstack for cells right now. The security groups stack trace is just a side 
> effect of cells falling over in a really low level way (this is what's before 
> and after the trace).
> 
> 2013-07-13 00:12:18.605 ERROR nova.cells.scheduler 
> [req-dcbb868c-98a7-4d65-94b3-e1234c50e623 demo demo] Couldn't communicate 
> with cell 'child'
> ....
> 2013-07-13 00:12:18.606 ERROR nova.cells.scheduler 
> [req-dcbb868c-98a7-4d65-94b3-e1234c50e623 demo demo] Couldn't communicate 
> with any cells
> 
> Again, mostly I want to know that we've got a blueprint or bug that's high 
> priority and someone's working on it. It did take a while to get grenade 
> there (we're 2 bugs away from being able to do it repeatably in the gate), 
> but during that time we did have people working on it. It just takes a while 
> to get to the bottom of these issues some times, so I want people to have a 
> realistic expectation on how quickly we'll go from running upstream to gating.
> 
>    -Sean
> 
> -- 
> Sean Dague
> http://dague.net
> 
> _______________________________________________
> OpenStack-dev mailing list
> OpenStack-dev@lists.openstack.org
> http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev

_______________________________________________
OpenStack-dev mailing list
OpenStack-dev@lists.openstack.org
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev

Reply via email to