I had a one-node Cassandra 2.1.3 cluster, where the output of nodetool
status looked like this:

Datacenter: datacenter1
=======================
Status=Up/Down
|/ State=Normal/Leaving/Joining/Moving
--  Address       Load       Tokens  Owns    Host
ID                               Rack
UN  172.31.20.10  12.94 MB   256     ?
f803cae9-3f12-40c9-b681-caf4829b6bc6  rack1


Then I added another host to the cluster, and according to the logs it did
some bootstrapping:

INFO  [main] 2015-04-23 06:25:41,955 StorageService.java:1008 - JOINING:
schema complete, ready to bootstrap
INFO  [main] 2015-04-23 06:25:41,955 StorageService.java:1008 - JOINING:
calculation complete, ready to bootstrap
INFO  [main] 2015-04-23 06:25:41,956 StorageService.java:1008 - JOINING:
getting bootstrap token
INFO  [main] 2015-04-23 06:26:11,999 StorageService.java:1008 - JOINING:
Starting to bootstrap...
INFO  [main] 2015-04-23 06:26:12,159 StreamResultFuture.java:86 - [Stream
#a2d70110-e981-11e4-90fe-03a9e0dac111] Executing streaming plan for
Bootstrap
INFO  [main] 2015-04-23 06:26:13,225 StorageService.java:1037 - Bootstrap
completed! for the tokens [-6649489682159922872,


But when I ran nodetool status after the new node had joined the cluster,
it looked like this:

Datacenter: datacenter1
=======================
Status=Up/Down
|/ State=Normal/Leaving/Joining/Moving
--  Address        Load       Tokens  Owns    Host
ID                               Rack
UN  172.31.21.108  3.78 MB    256     ?
0fc3f5ac-c414-4340-b072-7d9959a28209  rack1
UN  172.31.20.10   15.45 MB   256     ?
f803cae9-3f12-40c9-b681-caf4829b6bc6  rack1


So I was expecting the load to drop to about 6.5 MB on my original node
while the new node would pick up about 6.5 MB, so they'd be balanced, but
instead the disk usage on my original node somehow increased by 2.5 MB
while the new node only picked up 3.78 MB. Why didn't I get a balanced
load? Why did the load on my original node go up when I added another node?
I didn't write any points during the bootstrap. All my keyspaces that have
a lot of data have replication factor 1, so I think and hope it wasn't just
replicating data on the new node. Thanks!

Reply via email to