Hi,

Please help/guide me to identify the size of the cluster.

Thanks and Regards,
Gowtham S.


On Fri, 24 Jan 2020 at 14:31, Gowtham S <gowtham.co....@gmail.com> wrote:

> Hi,
>
> We are in the process of deploying Kafka in our service. We need to decide
> the machine capacity plan, we arrived at the below formulae for deriving
> total machine capacity.
>
> Total Broker machine size  = Message size per second *  Retention period *
> Replication Factor
>
> Am I need to consider the topic, index files in the calculation? Please
> help/guide me if i missing any param required in the formulae.
>
> Index file calculation (Reference
> <https://issues.apache.org/jira/browse/KAFKA-3300>)
>
> Currently, the initial/max size of offset index file is configured by 
> log.index.max.bytes.
> This will be the offset index file size for the active log segment until it
> rolls out.
>
> Theoretically, we can calculate the upper bound of offset index size using
> the following formula:
>
> log.segment.bytes / index.interval.bytes * 8
>
> With the default setting the bytes needed for an offset index size is 1GB
> / 4K * 8 = 2MB. And the default log.index.max.bytes is 10MB.
>
>
> Retention Period = Retention period + (log.retention.check.interval.ms +
> log.segment.delete.delay.ms ) / 1000
>                          = 86400 + (30000 + 6000)/1000
>                          = 86400 + 360
>                          = 86760 Seconds
>
> Total Broker machine size  = Message size per second *  Retention period *
> Replication Factor
>                                        = 90 MB/Sec * 86760 * 3
>                                        = 23425200 MB
>                                        = 23.4252 Tb
>
> *Machine Configuration*
> 6 Brokers with 3 ZK
>
> *Kafka (per machine)*
> Disk Space - 2 * 2TB
> RAM - 128 GB
> CPU - 40 core
>
> Thanks and Regards
> Gowtham.S
>

Reply via email to