Created CASSANDRA-1042.

On Sat, May 1, 2010 at 12:01 AM, Jonathan Ellis <jbel...@gmail.com> wrote:
> Can you create a ticket?
>
> On Fri, Apr 30, 2010 at 4:55 PM, Joost Ouwerkerk <jo...@openplaces.org> wrote:
>> There's a bug in ColumnFamilyRecordReader that appears when processing
>> a single split.  When the start and end tokens of the split are equal,
>> duplicate rows can be returned.
>>
>> Example with 5 rows:
>> token (start and end) = 53193025635115934196771903670925341736
>>
>> Tokens returned by first get_range_slices iteration:
>>  16955237001963240173058271559858726497
>>  40670782773005619916245995581909898190
>>  99079589977253916124855502156832923443
>>  144992942750327304334463589818972416113
>>  166860289390734216023086131251507064403
>>
>> Tokens returned by next iteration (first token is last token from
>> previous, end token is unchanged)
>>  16955237001963240173058271559858726497
>>  40670782773005619916245995581909898190
>>
>> Tokens returned by final iteration  (first token is last token from
>> previous, end token is unchanged)
>>  [] (empty)
>>
>> In this example, the mapper has processed 7 rows in total, 2 of which
>> were duplicates.
>>
>> Joost.
>>
>
>
>
> --
> Jonathan Ellis
> Project Chair, Apache Cassandra
> co-founder of Riptano, the source for professional Cassandra support
> http://riptano.com
>

Reply via email to