Ok, thanks a lot Matthew.

On 18 February 2014 16:18, Matthew Von-Maszewski <matth...@basho.com> wrote:

> Riak 2.0 is coming.  Hold your mass delete until then.  The "bug" is
> within Google's original leveldb architecture.  Riak 2.0 sneaks around to
> get the disk space freed.
>
> Matthew
>
>
>
> On Feb 18, 2014, at 11:10 AM, Edgar Veiga <edgarmve...@gmail.com> wrote:
>
> The only/main purpose is to free disk space..
>
> I was a little bit concerned regarding this operation, but now with your
> feedback I'm tending to don't do nothing, I can't risk the growing of
> space...
> Regarding the overhead I think that with a tight throttling system I could
> control and avoid overloading the cluster.
>
> Mixed feelings :S
>
>
>
> On 18 February 2014 15:45, Matthew Von-Maszewski <matth...@basho.com>wrote:
>
>> Edgar,
>>
>> The first "concern" I have is that leveldb's delete does not free disk
>> space.  Others have executed mass delete operations only to discover they
>> are now using more disk space instead of less.  Here is a discussion of the
>> problem:
>>
>> https://github.com/basho/leveldb/wiki/mv-aggressive-delete
>>
>> The link also describes Riak's database operation overhead.  This is a
>> second "concern".  You will need to carefully throttle your delete rate or
>> the overhead will likely impact your production throughput.
>>
>> We have new code to help quicken the actual purge of deleted data in Riak
>> 2.0.  But that release is not quite ready for production usage.
>>
>>
>> What do you hope to achieve by the mass delete?
>>
>> Matthew
>>
>>
>>
>>
>> On Feb 18, 2014, at 10:29 AM, Edgar Veiga <edgarmve...@gmail.com> wrote:
>>
>> Sorry, forgot that info!
>>
>> It's leveldb.
>>
>> Best regards
>>
>>
>> On 18 February 2014 15:27, Matthew Von-Maszewski <matth...@basho.com>wrote:
>>
>>> Which Riak backend are you using:  bitcask, leveldb, multi?
>>>
>>> Matthew
>>>
>>>
>>> On Feb 18, 2014, at 10:17 AM, Edgar Veiga <edgarmve...@gmail.com> wrote:
>>>
>>> > Hi all!
>>> >
>>> > I have a fairly trivial question regarding mass deletion on a riak
>>> cluster, but firstly let me give you just some context. My cluster is
>>> running with riak 1.4.6 on 6 machines with a ring of 256 nodes and 1Tb ssd
>>> disks.
>>> >
>>> > I need to execute a massive object deletion on a bucket, I'm talking
>>> of ~1 billion keys (The object average size is ~1Kb). I will not retrive
>>> the keys from riak because a I have a file with all of them. I'll just
>>> start a script that reads them from the file and triggers an HTTP DELETE
>>> for each one.
>>> > The cluster will continue running on production with a quite high load
>>> serving all other applications, while running this deletion.
>>> >
>>> > My question is simple, do I need to have any kind of extra concerns
>>> regarding this action? Do you advise me on taking special attention to any
>>> kind of metrics regarding riak or event the servers where it's running?
>>> >
>>> > Best regards!
>>> > _______________________________________________
>>> > riak-users mailing list
>>> > riak-users@lists.basho.com
>>> > http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com
>>>
>>>
>>
>>
>
>
_______________________________________________
riak-users mailing list
riak-users@lists.basho.com
http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com

Reply via email to