After more than 9 hours, I've restart the node and reused the join command (data+cache+commitlog have not been erased) and now the node is in normal state in less than a second :

nodetool -h localhost ring
Address DC Rack Status State Load Owns Token
                                                                               
155962751505430129087380028406227096917
10.0.0.61 DC1 RAC1 Up Normal 72.28 GB 8.33% 0 10.0.0.62 DC1 RAC1 Up Normal 72.18 GB 8.33% 14178431955039102644307275309657008810 10.0.1.61 DC2 RAC1 Up Normal 71.88 GB 8.33% 28356863910078205288614550619314017621 10.0.1.62 DC2 RAC1 Up Normal 107.41 GB 8.33% 42535295865117307932921825928971026432 10.0.0.63 DC1 RAC1 Up Normal 72.21 GB 8.33% 56713727820156410577229101238628035242 10.0.0.64 DC1 RAC1 Up Normal 71.91 GB 8.33% 70892159775195513221536376548285044053 10.0.1.63 DC2 RAC1 Up Normal 72.81 GB 8.33% 85070591730234615865843651857942052864 10.0.1.70 DC2 RAC1 Up Normal 104.76 KB 7.05% 97059533058214537253124691979661801455 10.0.1.64 DC2 RAC1 Up Normal 71.23 GB 1.29% 99249023685273718510150927167599061674 10.0.0.65 DC1 RAC1 Up Normal 71.68 GB 8.33% 113427455640312821154458202477256070485 10.0.0.66 DC1 RAC1 Up Normal 72.12 GB 8.33% 127605887595351923798765477786913079296 10.0.1.65 DC2 RAC1 Up Normal 71.37 GB 8.33% 141784319550391026443072753096570088106 10.0.1.66 DC2 RAC1 Up Normal 70.94 GB 8.33% 155962751505430129087380028406227096917

You see the repartition ? isn't weird ? I have done it many times and each time the first join raises an error about not being able to join seed nodes. It seems that the more I wait and the more the percent of data it owns is high ...



On 3/10/12 9:49 PM, Cyril Scetbon wrote:
Hi all,

When I try the join command I have the following error :
Exception in thread "main" java.lang.RuntimeException: No other nodes seen! Unable to bootstrap.If you intended to start a single-node cluster, you should make sure your broadcast_address (or listen_address) is listed as a seed. Otherwise, you need to determine why the seed being contacted has no knowledge of the rest of the cluster. Usually, this can be solved by giving all nodes the same seed list. at org.apache.cassandra.dht.BootStrapper.getBootstrapSource(BootStrapper.java:168) at org.apache.cassandra.dht.BootStrapper.getBalancedToken(BootStrapper.java:150)

and see the messages below in the seed log :
==> /var/log/cassandra/output.log <==
INFO 19:55:39,554 InetAddress /10.0.1.70 is now dead.

==> /var/log/cassandra/system.log <==
INFO [GossipStage:1] 2012-03-10 19:55:39,554 Gossiper.java (line 818) InetAddress /10.0.1.70 is now dead.

however the node I try to add finds the topology of the cluster :
loaded network topology /10.0.1.62:[Ljava.lang.String;@53533ff4, /10.0.1.63:[Ljava.lang.String;@7e8bd6e8, /10.0.1.61:[Ljava.lang.String;@eca5a40, /10.0.0.66:[Ljava.lang.String;@6eb2756, /10.0.0.65:[Ljava.lang.String;@386382a6, /10.0.0.64:[Ljava.lang.String;@d167d36, /10.0.0.61:[Ljava.lang.String;@3f766af0, /10.0.0.62:[Ljava.lang.String;@7766d31c, /10.0.0.63:[Ljava.lang.String;@7fae1081, /10.0.1.70:[Ljava.lang.String;@7cbdc6a3, /10.0.1.65:[Ljava.lang.String;@f3c3750, /10.0.1.64:[Ljava.lang.String;@3a536d00, /10.0.1.66:[Ljava.lang.String;@3b1938ea

The node says it's in joining mode but does not receive any token :

Mode: JOINING
Not sending any streams.
Not receiving any streams.
Pool Name                    Active   Pending      Completed
Commands                        n/a         0             12
Responses                       n/a         0           5490

any idea ?



--
Cyril SCETBON

Reply via email to