On Mon, Sep 12, 2011 at 12:35 AM, Tharindu Mathew <mcclou...@gmail.com> wrote:
> Hi,
>
> I plan to do $subject and contribute.
>
> Right now, the hadoop integration splits according to the number of rows in
> a slice predicate. This doesn't scale if a row has a large number of
> columns.
>
> I'd like to know from the cassandra-devs as to how feasible this is?

It's feasible, but not entirely easy.  Essentially you need to page
through the row since you can't know how large it is beforehand.  IIRC
though, this breaks the current input format contract, since an entire
row is expected to be returned.

-Brandon

Reply via email to