Hi Peter,
I posted row sizes (min/max/mean) of our largest data set in my original
message, but had zero responses on the mailing list. The folks in IRC
told me to wait it out, see if to rebalanced on its own (it didn't), or
to run a repair on each node one at a time (didn't help), and that it
wasn't a big concern until we had "dozens of GBs" worth of data.
On 01/06/2011 10:08 AM, Peter Schuller wrote:
I've been lurking in the #cassandra IRC channel lately looking for help on
this, but wanted to try the mailing list as well.
Was this resolved off-list, and if so what was the problem?
I don't see a problem in your description to explain the imbalance,
assuming you don't have extreme variation in the size of rows (or very
few rows). I was hoping someone else would spot something but the
thread seems dead still :)