you are seeing this http://wiki.apache.org/cassandra/FAQ#range_ghosts

Lots of client API's and CQL 3 hide this from you now. 

Cheers
 
-----------------
Aaron Morton
Cassandra Consultant
New Zealand

@aaronmorton
http://www.thelastpickle.com

On 17/07/2013, at 1:52 PM, 杨辉强 <huiqiangy...@yunrang.com> wrote:

> Thanks, But Michael's answer confuse me more. 
> 
> I use list cf; in cassandra-cli. It seems lots of rows have been deleted, but 
> keys exist.
> 
> After the deletion, why the key still exists? It seems useless.
> 
> RowKey: 3030303031306365633862356437636365303861303433343137656531306435
> -------------------
> RowKey: 3030303031316333616336366531613636373735396363323037396331613230
> -------------------
> RowKey: 3030303031316333616336366531613637303964616364363630663865313433
> -------------------
> RowKey: 3030303031323934613637303239323563633133303238626330646666626335
> -------------------
> RowKey: 3030303031323934613637303239323566303733303638373138366334323436
> -------------------
> RowKey: 3030303031333838333139303930633664643364613331316664363134656639
> -------------------
> RowKey: 3030303031336265343639303630613938376333366230363439316336333230
> -------------------
> RowKey: 3030303031336365653735376465616334633932333363363832653130363733
> -------------------
> RowKey: 3030303031343632343261363966376464656235373266663761633233353065
> 
> 
> ----- 原始邮件 -----
> 发件人: "Michael Theroux" <mthero...@yahoo.com>
> 收件人: user@cassandra.apache.org
> 发送时间: 星期二, 2013年 7 月 16日 下午 10:23:32
> 主题: Re: Deletion use more space.
> 
> The only time information is removed from the filesystem is during 
> compaction.  Compaction can remove tombstones after gc_grace_seconds, which, 
> could result in reanimation of deleted data if the tombstone was never 
> properly replicated to other replicas.  Repair will make sure tombstones are 
> consistent amongst replicas.  However, tombstones can not be removed if the 
> data the tombstone is deleting is in another SSTable and has not yet been 
> removed. 
> 
> Hope this helps,
> -Mike
> 
> 
> On Jul 16, 2013, at 10:04 AM, Andrew Bialecki wrote:
> 
>> I don't think setting gc_grace_seconds to an hour is going to do what you'd 
>> expect. After gc_grace_seconds, if you haven't run a repair within that 
>> hour, the data you deleted will seem to have been undeleted.
>> 
>> Someone correct me if I'm wrong, but in order to order to completely delete 
>> data and regain the space it takes up, you need to "delete" it, which 
>> creates tombstones, and then run a repair on that column family within 
>> gc_grace_seconds. After that the data is actually gone and the space 
>> reclaimed.
>> 
>> 
>> On Tue, Jul 16, 2013 at 6:20 AM, 杨辉强 <huiqiangy...@yunrang.com> wrote:
>> Thank you!
>> It should be "update column family ScheduleInfoCF with gc_grace = 3600;"
>> Faint.
>> 
>> ----- 原始邮件 -----
>> 发件人: "杨辉强" <huiqiangy...@yunrang.com>
>> 收件人: user@cassandra.apache.org
>> 发送时间: 星期二, 2013年 7 月 16日 下午 6:15:12
>> 主题: Re: Deletion use more space.
>> 
>> Hi,
>>  I use the follow cmd to update gc_grace_seconds. It reports error! Why?
>> 
>> [default@WebSearch] update column family ScheduleInfoCF with 
>> gc_grace_seconds = 3600;
>> java.lang.IllegalArgumentException: No enum const class 
>> org.apache.cassandra.cli.CliClient$ColumnFamilyArgument.GC_GRACE_SECONDS
>> 
>> 
>> ----- 原始邮件 -----
>> 发件人: "Michał Michalski" <mich...@opera.com>
>> 收件人: user@cassandra.apache.org
>> 发送时间: 星期二, 2013年 7 月 16日 下午 5:51:49
>> 主题: Re: Deletion use more space.
>> 
>> Deletion is not really "removing" data, but it's adding tombstones
>> (markers) of deletion. They'll be later merged with existing data during
>> compaction and - in the end (see: gc_grace_seconds) - removed, but by
>> this time they'll take some space.
>> 
>> http://wiki.apache.org/cassandra/DistributedDeletes
>> 
>> M.
>> 
>> W dniu 16.07.2013 11:46, 杨辉强 pisze:
>>> Hi, all:
>>>   I use cassandra 1.2.4 and I have 4 nodes ring and use byte order 
>>> partitioner.
>>>   I had inserted about 200G data in the ring previous days.
>>> 
>>>   Today I write a program to scan the ring and then at the same time delete 
>>> the items that are scanned.
>>>   To my surprise, the cassandra cost more disk usage.
>>> 
>>>    Anybody can tell me why? Thanks.
>>> 
>> 

Reply via email to