You may want to look into CQL3 as I hear there may be a way to specify the
query so only those rows are map/reduced.  I am not sure if that is out
yet or not but I remember someone from datastax telling me about it.

Dean

On 12/11/12 9:46 AM, "Ayush V." <ayushv...@gmail.com> wrote:

>I'm working on Cassandra Hadoop intergration (MapReduce). We have used
>Random
>Partioner to insert data to gain faster write. Now we have to read that
>data
>from cassandra in MapReduce and perform some calculation on it.
>
>From the lots of data we have in cassandra we wan't to fetch data only for
>particular ROW-KEYs but we are unable to do it due to RandomPartioner -
>assertion is there in code.
>
>Can anyone please guide me how should I filter data based on RowKey on
>Cassandra level itself (I know data is distributed across regions using
>Hash
>of the RowKey)? 
>
>Does using secondary indexes (still trying to understand how it works)
>will
>solve my problem or is there some other way around?
>
>I will be really appreciated if someone could answer my queries.
>
>Thanks 
>AV
>
>
>
>--
>View this message in context:
>http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/Filter-da
>ta-on-row-key-in-Cassandra-Hadoop-s-Random-Partitioner-tp7584212.html
>Sent from the cassandra-u...@incubator.apache.org mailing list archive at
>Nabble.com.

Reply via email to