Producer-side persistence and re-sending

matan Sat, 02 Mar 2013 12:54:04 -0800

Hi,

I am designing a patch on top the 0.8 code base.

The patch would provide persistence on the producer side. Meaning thatmessages being passed to the producer are persisted rather than kepttransiently in memory. So that if the broker/s cannot be reached,messages can accumulate and will be sent through to the broker/s, whenthey are available again. Although this would be somewhat superfluous inthe new replication paradigm of 0.8, it's still possible to have somefailures that disconnect a producer from the entire set of brokers. Inthat case, this patch-under-design would prevent data loss. Making thepipeline even more secured, and relieving producers of the need tohandle persistence on their own. Plan is to use the Kafka Loggercomponent for that. Of course having this behavior completely optionalthrough a configuration option.

The slightly-deeper level design details are to use a Kafka Log pertopic & partition (otherwise given the existing 0.8 code, in and around/producer.async.DefaultEventHandler.dispatchSerializedData/, it wouldseem resource intensive to keep track of messages sent v.s. failed onesfor managing resending). Using the logger, behavior for keeping replicasets in sync would be skipped through choice of parameters, or would bemade parameterized to fully neutralize it regarding the producer's ownlogging.

Now of course, given that 0.8 seems to be far along its runway, I assumethis should go on top the trunk, which I'd like to confirm with you iswhere post 0.8 lives.


I'd appreciate your comments...

Thanks,
Matan

Producer-side persistence and re-sending

Reply via email to