My experience for the last question is ... it depends. If you have NO
changes to the store (which I would argue could be abnormal, it's not in a
production  environment allowing writes) ... then I you can do a full
range/key scan and get no repeats. Factors that will impact scanning all
keys include replication factor, actual partitioning, and status of
replications.

If you are taking changes while scanning ... what exactly is the semantics
of the key scan that you'd expect? For example, would it be acceptable to
have a "low" key not be returned because it was in the data store before the
full scan ended but you've already gone past that key in your iteration?

I think that these questions apply to both ordered and random partitioners.

On Wed, Sep 15, 2010 at 3:30 PM, Michal Augustýn
<augustyn.mic...@gmail.com>wrote:

> And what about uniqueness? Can we be sure that we get each row just once?
>
> Thanks.
>
> 2010/9/15 Janne Jalkanen <janne.jalka...@ecyrd.com>
>
>
>> Correct. You can use get_range_slices with RandomPartitioner too, BUT the
>> iteration order is non-predictable, that is, you will not know in which
>> order you get the rows (RandomPartitioner would probably better be called
>> ObscurePartitioner - it ain't random, but it's as good as if it were ;-).
>> This I find to be mostly useful when you want to go through your entire
>> keyspace, like when doing something map-reduce like.  Or you just have a
>> fairly small CF.
>>
>> /Janne
>>
>>
>> On Sep 15, 2010, at 20:26 , Rana Aich wrote:
>>
>>  Hi All,
>>>
>>> I was under the impression that in order to query with get_range_slices
>>> one has to have a OrderPreservingPartitioner.
>>>
>>> Can we do get_range_slices with RandomPartitioner also? I can distinctly
>>> remember I read that(OrderPreservingPartitioner for get_range_slices) in
>>> Cassnadra WIKI but now somehow I'm not finding it anymore.
>>>
>>> Can anyone throw some light on it.
>>>
>>> Thanks and Regards,
>>>
>>> Rana
>>>
>>
>>
>

Reply via email to