Re: ColumnFamilyInputFormat KeyRange scans on a CF

Utku Can Topçu Thu, 29 Apr 2010 16:04:50 -0700

I'm currently writing collected data continuously to Cassandra, having keys
starting with a timestamp and a unique identifier (like
2009.01.01.00.00.00.RANDOM) for being able to query in time ranges.

I'm thinking of running periodical mapreduce jobs which will go through a
designated time period. I might want to analyze the data only between
2009.01 and 2009.02.
I had done this previously with HBase however I thought cassandra would be a
better choice for continuously storing data in a safe manner.

I guess this briefly explains my designated use case.

Best Regards,
Utku

On Thu, Apr 29, 2010 at 11:32 PM, Jonathan Ellis <jbel...@gmail.com> wrote:

> It's technically possible but 0.6 does not support this, no.
>
> What is the use case you are thinking of?
>
> On Thu, Apr 29, 2010 at 11:14 AM, Utku Can Topçu <u...@topcu.gen.tr>
> wrote:
> > Hi,
> >
> > I've been trying to use Cassandra for some kind of a supplementary input
> > source for Hadoop MapReduce jobs.
> >
> > The default usage of the ColumnFamilyInputFormat does a full columnfamily
> > scan for using within the MapReduce framework as map input.
> >
> > However I believe that, it should be possible to give a keyrange to scan
> the
> > specified range.
> >
> > Is it anymeans possible?
> >
> > Best Regards,
> >
> > Utku
>
> --
> Jonathan Ellis
> Project Chair, Apache Cassandra
> co-founder of Riptano, the source for professional Cassandra support
> http://riptano.com
>

Re: ColumnFamilyInputFormat KeyRange scans on a CF

Reply via email to