Hi, I have been testing repairs a bit in different ways on 0.8.0 and I am curious on what to really expect in terms of data transferred.
I would expect my data to be fairly consistent in this case from the start. More than a billion supercolumns, but there was no errors in feed and we have seen minimal amounts of read repair going on while doing a complete scan of the data for consistency checking. As such, I would also expect repair to finish reasonably fast. On some nodes, it finishes in a couple of hours, but other nodes it is taking more than 12 hours and I see some 65GB of data streamed to the node which surprises me as I am pretty sure that it is not that out of sync. Not sure how much the merkle trees are actually reducing what needs to be streamed though. What should we expect to see if this works? Regards, Terje