Just found a paper from Google in this year's SIGMOD accepted papers about
joining continuous data streams:
Photon: Fault-tolerant and scalable joining of continuous data streams.
[slides: http://cloud.berkeley.edu/data/photon.pdf].
It uses sharding + Paxos to ensure scalability and exactly-onc
[
https://issues.apache.org/jira/browse/KAFKA-156?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13631474#comment-13631474
]
Scott Carey commented on KAFKA-156:
---
Jay --
I agree, the duplication issue does not depe
Thanks for the link. This paper provides an alternative, but similar
implementation to that in Zookeeper. The key difference seems to be that
the former supports membership reconfiguration.
Kafka replication is simpler because it separates the leader election part
from log replication. Such separa