Hello Xin,

The ZK write performance, especially latency, depends on the underlying
hardware. AFAIK some organizations use SSD for their ZK clusters so that
the latency is less than 1ms.

There are some more discussions on how many partitions one should choose in
practice and what are ZK's impact on it:

http://www.confluent.io/blog/how-to-choose-the-number-of-topicspartitions-in-a-kafka-cluster/


Guozhang


On Tue, Aug 16, 2016 at 10:18 AM, Xin Jin <xin...@cs.berkeley.edu> wrote:

> Hi,
>
> I'm working on streaming systems in AMPLab at UC Berkeley. This article (
> https://cwiki.apache.org/confluence/display/KAFKA/Committing
> +and+fetching+consumer+offsets+in+Kafka)
> mentioned the ZooKeeper performance issue when consumers store offsets in
> ZooKeeper.
>
> "In Kafka releases through 0.8.1.1, consumers commit their offsets to
> ZooKeeper. ZooKeeper does not scale extremely well (especially for writes)
> when there are a large number of offsets (i.e., consumer-count *
> partition-count)."
>
> Can anyone tell me in production scenarios, how many consumers and
> partitions do you have? How much write (offset update) traffic do you
> generate that ZooKeeper cannot handle?
>
> Thank you very much!
> Xin
>



-- 
-- Guozhang

Reply via email to