Re: strange gossip messages after node reboot with different ip

Piavlo Tue, 08 May 2012 00:13:02 -0700

On 05/01/2012 04:16 AM, aaron morton wrote:

Gossip information about a node can stay in the cluster for up to 3days. How long has this been going on for ?

This has been going for over a week already without any signs of slowdown, all nodes that have changed ip popup as UP/DEAD endlessly.

Any ideas?


Thanks

I'm unsure if this is expected behaviour. But it sounds like Gossip iskicking out the phantom node correctly.
Can you use nodetool gossipinfo on the nodes to capture some artefactswhile it is still running?
How come the old ip 10.63.14.214 still popup as UP and then declaredas DEAD again, an so on and on?
I think this is gossip bouncing information about the node around.Once it has been observed as dead for 3 days it should be purged.
Another question, if node is recognised as new (due to ip change) butwith same token - will other nodes stream the hinted handoffs to it?
Hints are stored against the token, not the end point address. When anode comes up the process is reversed and the end point is mapped toit's (new) token.
And is there way to tell cassandra also use names and if ip changesbut node name is the same and resolves to the new ip then the clustertreat it as old node?
Not that I am aware of. It's designed to handle IP addresses changing.AFAIK the log messages are not indicative of a fault. Instead theyindicate something odd happening with Gossip that is being correctlyhandled.
Hope that helps.
-----------------
Aaron Morton
Freelance Developer
@aaronmorton
http://www.thelastpickle.com

On 1/05/2012, at 3:09 AM, Piavlo wrote:
Hi,

We have a cassandra cluster in ec2.
If i stop a node and start it - as a result the node ip changes. Thenode is recognised as NEW node and is declared as replacing theprevious node with same token.(But this is the same node of course)
In this specific case the node ip before stop/start was 10.63.14.214and new ip is 10.54.81.14.And even that the cluster and node seems to be working fine for morethan a day after the stop/start of this node, I see the followingloop of messages ~ once every minute.
INFO [GossipStage:1] 2012-04-30 14:18:57,089 Gossiper.java (line 838)Node /10.63.14.214 is now part of the clusterINFO [GossipStage:1] 2012-04-30 14:18:57,089 Gossiper.java (line 804)InetAddress /10.63.14.214 is now UPINFO [GossipStage:1] 2012-04-30 14:18:57,090 StorageService.java(line 1017) Nodes /10.63.14.214 and cassa1a.internal/10.54.81.14 havethe same token 0. Ignoring /10.63.14.214INFO [GossipTasks:1] 2012-04-30 14:19:11,834 Gossiper.java (line 818)InetAddress /10.63.14.214 is now dead.INFO [GossipTasks:1] 2012-04-30 14:19:27,896 Gossiper.java (line 632)FatClient /10.63.14.214 has been silent for 30000ms, removing from gossipINFO [GossipStage:1] 2012-04-30 14:20:30,803 Gossiper.java (line 838)Node /10.63.14.214 is now part of the cluster
...
How come the old ip 10.63.14.214 still popup as UP and then declaredas DEAD again, an so on and on?I know since this is ec2 other node with same ip can come UP, buti've verified and there is no such node and it certainly does not runcassandra :)
I stop/started another node and observe similar behaviour.
This is version 1.0.8
Another question, if node is recognised as new (due to ip change) butwith same token - will other nodes stream the hinted handoffs to it?And is there way to tell cassandra also use names and if ip changesbut node name is the same and resolves to the new ip then the clustertreat it as old node?
Thanks
Alex

Re: strange gossip messages after node reboot with different ip

Reply via email to