I have 4 nodes at my disposal. I can configure them like this:
1) RF=1, each node has 25% of the data. On random-reads, how big is the performance penalty if a node needs to look for data on another replica ?
2) RF=2, each node has 50% of the data. Same question ? -- Regards, Oleg Dulin NYC Java Big Data Engineer http://www.olegdulin.com/