Re: Implementing a input format that splits according to column size

Brandon Williams Mon, 12 Sep 2011 06:33:16 -0700

On Mon, Sep 12, 2011 at 12:35 AM, Tharindu Mathew <mcclou...@gmail.com> wrote:
> Hi,
>
> I plan to do $subject and contribute.
>
> Right now, the hadoop integration splits according to the number of rows in
> a slice predicate. This doesn't scale if a row has a large number of
> columns.
>
> I'd like to know from the cassandra-devs as to how feasible this is?


It's feasible, but not entirely easy.  Essentially you need to page
through the row since you can't know how large it is beforehand.  IIRC
though, this breaks the current input format contract, since an entire
row is expected to be returned.

-Brandon

Re: Implementing a input format that splits according to column size

Reply via email to