Re: Updating (as opposed to just setting) Cassandra data via Hadoop

2010-05-07 Thread Ian Kallen
ean or are you referring to something else? thanks! -Ian -- Ian Kallen blog: http://www.arachna.com/roller/spidaman tweetz: http://twitter.com/spidaman vox: 925.385.8426

Re: pagination through slices with deleted keys

2010-05-06 Thread Ian Kallen
that have Columns matching our predicate. So if > there aren't any, we just don't return that row. No need to check if the > entire row is deleted. > > Mike > > > On Thu, May 6, 2010 at 9:17 AM, Ian Kallen wrote: > >> I read the DistributedDeletes and t

Re: Updating (as opposed to just setting) Cassandra data via Hadoop

2010-05-06 Thread Ian Kallen
I have inputs that are text logs and I wrote a Cassandra OutputFormat, the reducers read the old values from their respective column families, increment the counts and write back the new values. Since all of the writes are done by the hadoop jobs and we're not running multiple jobs concurrently, th

Re: pagination through slices with deleted keys

2010-05-06 Thread Ian Kallen
quick example of how to iterate over a CF that may > have tombstones. This may help you out: > http://markjgreene.wordpress.com/2010/05/05/iterate-over-entire-cassandra-column-family/ > > > On Thu, May 6, 2010 at 12:17 PM, Ian Kallen wrote: > >> I read the DistributedDe

pagination through slices with deleted keys

2010-05-06 Thread Ian Kallen
I read the DistributedDeletes and the range_ghosts FAQ entry on the wiki which do a good job describing how difficult deletion is in an eventually consistent system. But practical application strategies for dealing with it aren't there (that I saw). I'm wondering how folks implement pagination in t