Your observation is correct.

The paragraph you quote is not very precise but also not necessarily
wrong. The example is simplified and assumes that there is no
re-partitioning even if it is not mentioned explicitly.


-Matthias


On 6/20/17 9:32 AM, Paolo Patierno wrote:
> Hi devs,
> 
> 
> at following documentation page (by Confluent) I read 
> (http://docs.confluent.io/current/streams/architecture.html#stream-partitions-and-tasks)
>  ...
> 
> 
> "the maximum parallelism at which your application may run is bounded by the 
> maximum number of stream tasks, which itself is determined by maximum number 
> of partitions of the input topic(s) the application is reading from. For 
> example, if your input topic has 5 partitions, then you can run up to 5 
> applications instances"
> 
> but it seems not so true ... I mean ...
> The number of the application instances depends on the possibility that we 
> have "internal" repartition topic in our processor topology.
> I tried the WordCountDemo starting from a topic with 2 partitions. In this 
> case I'm able to run up to 4 application instances while the 5th stays idle.
> It's possible because due to the map() in the example we have repartitioning 
> (so 1 repartition topic with 2 partitions) ... it means 4 tasks for the total 
> 4 partitions (2 for the input topic, 2 for the repartition topic) ... and 
> this tasks can run even one for each application instance.
> Following the above mentioned doc part the maximum should be just 2 (not 4).
> 
> Do you confirm this ?
> 
> Thanks,
> Paolo
> 
> 
> Paolo Patierno
> Senior Software Engineer (IoT) @ Red Hat
> Microsoft MVP on Windows Embedded & IoT
> Microsoft Azure Advisor
> 
> Twitter : @ppatierno<http://twitter.com/ppatierno>
> Linkedin : paolopatierno<http://it.linkedin.com/in/paolopatierno>
> Blog : DevExperience<http://paolopatierno.wordpress.com/>
> 

Attachment: signature.asc
Description: OpenPGP digital signature

Reply via email to