Timur,

there is not API to pause/resume partitions in Streams, because Streams
handles/manages its internal consumer by itself.

The "batch processing KIP" is currently delayed -- but I am sure we will
pick it up again. Hopefully after 0.11 got released.

> So we are considering to just pause specific
>> topic partitions as soon as we arrive to stop offsets for them.

I am just wondering how you would do this in a fault-tolerant way (if
you would have pause API)?

>> 2) Assume we process multiple topics in some parallel way and want to pause
>> some topics while waiting for other topics to catch up.

Streams synchronizes topics on time automatically for your. So I am
wondering why this does not work for you?


-Matthias


On 4/27/17 8:52 AM, Timur Yusupov wrote:
> I see it is possible to pause specific topic partition consumption when
> using KafkaConsumer directly, but looks like it is not possible when using
> KafkaStreams.
> 
> There are following use cases for that:
> 1) Doing batch processing using Kafka Streams (I found
> https://cwiki.apache.org/confluence/display/KAFKA/KIP-95%3A+Incremental+Batch+Processing+for+Kafka+Streams
> proposal for Kafka Streams, but according to
> https://issues.apache.org/jira/browse/KAFKA-4437 there is no active
> development on that side). So we are considering to just pause specific
> topic partitions as soon as we arrive to stop offsets for them.
> 
> 2) Assume we process multiple topics in some parallel way and want to pause
> some topics while waiting for other topics to catch up.
> 
> Actually, the first use case is more important for us, so would be good to
> know if there is a possibility or some improvements are already planned for
> allowing to pause specific topic partition consumption in KafkaStream.
> 

Attachment: signature.asc
Description: OpenPGP digital signature

Reply via email to