I also have this problem. My data on nodes grows to roughly 30GB. After a restart only 5GB remains. Is a factor 6 common for Cassandra?
2012/1/18 aaron morton <aa...@thelastpickle.com> > Good idea Jeremiah, are you using compression Michael ? > > Scanning through the CF stats this jumps out… > > Column Family: Attractions**** > SSTable count: 3**** > Space used (live): 27542876685**** > Space used (total): 1213220387 > Thats 25Gb of live data but only 1.3GB total. > > Otherwise want to see if a restart fixes it :) Would be interesting to > know if it's wrong from the start or drifts during streaming or compaction. > > Cheers > > ----------------- > Aaron Morton > Freelance Developer > @aaronmorton > http://www.thelastpickle.com > > On 18/01/2012, at 12:04 PM, Jeremiah Jordan wrote: > > There were some nodetool ring load reporting issues with early version of > 1.0.X don't remember when they were fixed, but that could be your issue. > Are you using compressed column families, a lot of the issues were with > those. > Might update to 1.0.7. > > -Jeremiah > > On 01/16/2012 04:04 AM, Michael Vaknine wrote: > > Hi,**** > > ** ** > > I have a 4 nodes cluster 1.0.3 version**** > > ** ** > > This is what I get when I run nodetool ring**** > > ** ** > > Address DC Rack Status State Load > Owns Token**** > > > 127605887595351923798765477786913079296**** > > 10.8.193.87 datacenter1 rack1 Up Normal 46.47 GB > 25.00% 0**** > > 10.5.7.76 datacenter1 rack1 Up Normal 48.01 GB > 25.00% 42535295865117307932921825928971026432**** > > 10.8.189.197 datacenter1 rack1 Up Normal 53.7 GB > 25.00% 85070591730234615865843651857942052864**** > > 10.5.3.17 datacenter1 rack1 Up Normal 43.49 GB > 25.00% 127605887595351923798765477786913079296**** > > ** ** > > I have finished running repair on all 4 nodes.**** > > ** ** > > I have less then 10 GB on the /var/lib/cassandra/data/ folders**** > > ** ** > > My question is Why nodetool reports almost 50 GB on each node?**** > > ** ** > > Thanks**** > > Michael**** > > >