Thanx Gary.
I was thinking of using range partitioning for breaking the input.
Say, we could have different threads handling diffierent rages - (A-J)
by thread1, (K-P) by thread2. This way, there won't probably be any
chance of collision. But the thread which actually performs the
distribution cou
On Wed, Oct 27, 2010 at 03:24, Arijit Mukherjee wrote:
> Hi All
>
> I've another related question.
>
> I am using a stream of records of the form (A, B, n) where the pair
> (A,B) can occur multiple times. For example, you could have the
> following rset of records -
>
> A, B, 2
> P, Q, 5
> X, Y, 3
Hi All
I've another related question.
I am using a stream of records of the form (A, B, n) where the pair
(A,B) can occur multiple times. For example, you could have the
following rset of records -
A, B, 2
P, Q, 5
X, Y, 3
A, B, 8
A, B, 2
...
The data store has a set of columns - (key, count, s
On Mon, Oct 11, 2010 at 04:01, Arijit Mukherjee wrote:
> Hi All
>
> I've just started reading about Cassandra and writing simple tests
> using Cassandra 0.6.5 to see if we can use it for our product.
>
> I have a data store with a set of columns, like C1, C2, C3, and C4,
> but the columns aren't m
Just a follow on question to this - would PIG be a good fit for such questions?
Arijit
On 11 October 2010 14:31, Arijit Mukherjee wrote:
> Hi All
>
> I've just started reading about Cassandra and writing simple tests
> using Cassandra 0.6.5 to see if we can use it for our product.
>
> I have a d