Re: Bootstrap question

Anthony Molinaro Sun, 18 Jul 2010 12:02:36 -0700

Yeah, I tried all that already and it didn't seem to work, no new nodes
will bootstrap, which makes me think there's some saved state somewhere,
preventing a new node from bootstrapping.  I think maybe the Location
sstables?  Is it safe to nuke those on all hosts and restart everything?
(I just don't want to lose actual data).


Thanks for the ideas,

-Anthony

On Sun, Jul 18, 2010 at 08:09:45PM +0300, shimi wrote:
> If I have problems with never ending bootstraping I do the following. I try
> each one if it doesn't help I try the next. It might not be the right thing
> to do but it worked for me.
> 
> 1. Restart the bootstraping node
> 2. If I see streaming 0/xxxx I restart the node and all the streaming nodes
> 3. Restart all the nodes
> 4. If there is data in the bootstraing node I delete it before I restart.
> 
> Good luck
> Shimi
> 
> On Sun, Jul 18, 2010 at 12:21 AM, Anthony Molinaro <
> antho...@alumni.caltech.edu> wrote:
> 
> > So still waiting for any sort of answer on this one.  The cluster still
> > refuses to do anything when I bring up new nodes.  I shut down all the
> > new nodes and am waiting.  I'm guessing that maybe the old nodes have
> > some state which needs to get cleared out?  Is there anything I can do
> > at this point?  Are there alternate strategies for bootstrapping I can
> > try?  (For instance can I just scp all the sstables to all the new
> > nodes and do a repair, would that actually work?).
> >
> > Anyone seen this sort of issue?  All this is with 0.6.3 so I assume
> > eventually others will see this issue.
> >
> > -Anthony
> >
> > On Thu, Jul 15, 2010 at 10:45:08PM -0700, Anthony Molinaro wrote:
> > > Okay, so things were pretty messed up.  I shut down all the new nodes,
> > > then the old nodes started doing the half the ring is down garbage which
> > > pretty much requires a full restart of everything.  So I had to shut
> > > everything down, then bring the seed back, then the rest of the nodes,
> > > so they finally all agreed on the ring again.
> > >
> > > Then I started one of the new nodes, and have been watching the logs, so
> > > far 2 hours since the "Bootstrapping" message appeared in the new
> > > log and nothing has happened.  No anticompaction messages anywhere,
> > there's
> > > one node compacting, but its on the other end of the ring, so no where
> > near
> > > that new node.  I'm wondering if it will ever get data at this point.
> > >
> > > Is there something else I should try?  The only thing I can think of
> > > is deleting the system directory on the new node, and restarting, so
> > > I'll try that and see if it does anything.
> > >
> > > -Anthony
> > >
> > > On Thu, Jul 15, 2010 at 03:43:49PM -0500, Jonathan Ellis wrote:
> > > > On Thu, Jul 15, 2010 at 3:28 PM, Anthony Molinaro
> > > > <antho...@alumni.caltech.edu> wrote:
> > > > > Is the fact that 2 new nodes are in the range messing it up?
> > > >
> > > > Probably.
> > > >
> > > > >  And if so
> > > > > how do I recover (I'm thinking, shutdown new nodes 2,3,4,5, the
> > bringing
> > > > > up nodes 2,4, waiting for them to finish, then bringing up 3,5?).
> > > >
> > > > Yes.
> > > >
> > > > You might have to restart the old nodes too to clear out the confusion.
> > > >
> > > > --
> > > > Jonathan Ellis
> > > > Project Chair, Apache Cassandra
> > > > co-founder of Riptano, the source for professional Cassandra support
> > > > http://riptano.com
> > >
> > > --
> > > ------------------------------------------------------------------------
> > > Anthony Molinaro                           <antho...@alumni.caltech.edu>
> >
> > --
> > ------------------------------------------------------------------------
> > Anthony Molinaro                           <antho...@alumni.caltech.edu>
> >

-- 
------------------------------------------------------------------------
Anthony Molinaro                           <antho...@alumni.caltech.edu>

Re: Bootstrap question

Reply via email to