"Thirding" the use of prometheus, recommend reading this blog post as well: https://www.robustperception.io/monitoring-kafka-with-prometheus
On Wed, Apr 25, 2018 at 11:39 PM, Jonathan Bethune < jonathan.beth...@instaclustr.com> wrote: > I also recommend Prometheus. Works great with JMX for Kafka or any Java > service. > > Datadog is also fine if you want simple and don't spending money. Easy to > integrate with alerting systems and great visualization. > > They have a good blogpost about integrating with Kafka and ZooKeeper: > https://www.datadoghq.com/blog/monitoring-kafka-performance-metrics/ > > On 26 April 2018 at 08:02, Stanislav Antic <stanislav.an...@gmail.com> > wrote: > > > I recommend setting up Prometheus which have Zookeeper exporter (helper > > program which gives metrics from outside software) and you can use JMX > > exporter with Kafka. > > > > You can find example configs in their repo, which is pretty good and you > > also have already done Grafana dashboards (https://grafana.com/ > dashboards > > ): > > https://github.com/prometheus/jmx_exporter/tree/master/example_configs > > > > > > On Thu, Apr 26, 2018 at 12:05 AM, Arunkumar <pm_arunku...@yahoo.com. > > invalid> > > wrote: > > > > > HI All > > > I am working on setting up Monitoring and alerting for our production > > > cluster. As of now we have a cluster of 3 zookeeper and 3 kafka Brokers > > > which will expand later. > > > We are planning for basic metrics (important ones) on which we need to > > > alert. We are in a process of developing alerting system for our > > cluster. I > > > googled and did not get much detail as needed. > > > I see many say the following ones are important and I understand these > > are > > > aggregate metrics across cluster (I may not be > right).kafka.server:type= > > > ReplicaManager,name=UnderReplicatedPartitionskafka.controller:type= > > > KafkaController,name=OfflinePartitionsCountkafka.controller:type= > > > KafkaController,name=ActiveControllerCount > > > But I am looking for broker,topic etc metrics to alert. Can any one > > > provide info or point to any docs on this regards will be of great > > > help. Thanks in advance > > > > > > ThanksArunkumar Pichaimuthu, PMP > > > > > > > > > > -- > > Stanislav Antic > > > > > > -- > > *Jonathan Bethune - **Senior Consultant* > > JP: +81 70 4069 4357 > > <https://www.facebook.com/instaclustr> <https://twitter.com/instaclustr> > <https://www.linkedin.com/company/instaclustr> > > Read our latest technical blog posts here > <https://www.instaclustr.com/blog/>. This email has been sent on behalf > of Instaclustr Pty. Limited (Australia) and Instaclustr Inc (USA). This > email and any attachments may contain confidential and legally > privileged information. If you are not the intended recipient, do not copy > or disclose its content, but please reply to this email immediately and > highlight the error to the sender and then immediately delete the message. >