Re: nodetool-compact duplicated data files again and again

2010-03-18 Thread Sylvain Lebresne
I believe this is caused by two things (and sorry if I go into too much details): 1) there is http://wiki.apache.org/cassandra/FAQ#i_deleted_what_gives. That is, Cassandra has to wait GCGraceSeconds before really remove physically deleted columns. And by default, this is 10 days. For "normal" colu

nodetool-compact duplicated data files again and again

2010-03-17 Thread Weijun Li
I'm testing the ExpiringColumn patch in 0.6-beta2, inserted 26GB data with TTL, after columns have expired I use get_slice to verify that no columns can be retrieved. When I run "nodetool compact" I think all data should be gone. But the problem is: 1) After the first nodetool-comact, Cassandra du