Hi Robin,
Im from an analytics background, was working in the traditional BI tools like
OBIEE and Business Objects, so I am very interested in your evaluations of a
good analytics toolset combination.
Do share your learnings,
At a high level as I understand, cassandra can be used as the backend
Appreciate the insightful replies. Understood Sylvain's argument that
having different partitioning locally and globally could create
problem in data movement.
Edward, for a given sstable in a node, why having lexicographically
closer rows clumped together should matter ? Anyways the lookups for
Just to completely eliminate the possibility of the same bug, if you look here:
http://www.mail-archive.com/dev@cassandra.apache.org/msg04992.html
If you create a test keyspace, and look at the timestamp in the
"schema_keyspaces" column family in comparison to your existing keyspace, is
that ti
So, if I have a 6 node cluster in the token ring, A->B->C->D-E->F, replication
factor 3, and I run repair (without -pr) on A, is the flow of information:
A synchronizes information it is responsible for with B and C (because B and C
are replicas of A).
A, as a replica of E and F, synchronizes E
Great job sleuthing.
Originally repair did not have a -pr. When you run the standard repair
the node compares it's data with its neighbours and vice versa. They
also send each other updates. Since you are supposed to repair every
node < gc_grace submitting a full repair to each node would create
d