Samza has pretty good support for collectng metrics:
https://samza.apache.org/learn/documentation/0.10/container/metrics.html

With these metrics logged in Kafka, you can simply consume from this stream
for monitoring. In our case, we pipe these metrics into OpenTSDB for
visualization.

For example, you can monitor the rate of processing by a specific container
(msgs/sec) by looking at SamzaContainerMetrics.process-envelopes. Consumer
lag is also an important metric, which is
named kafka-*-*-messages-behind-high-watermark.

You will need to take a peek at the published JSON to get a sense of what
metrics are reported.

Thanks,
David



On Mon, Aug 8, 2016 at 12:50 AM 李斯宁 <lisin...@gmail.com> wrote:

> Hi guys,
> Is there a simpler way for monitoring: "Is every task in my samza jobs are
> normally running and making progress?"
>
> Currently, the samza job's Application Master's web do not show the
> information in a easy-to-discover way.  If you want information of each
> task, you need to click to view every container's log or connect to every
> container's jmxport to get information.
>
> I already found that we can do some monitoring from 'coordinate' and
> 'checkpoint' streams, but
> is there a tool available for such monitoring requirement? I think it is a
> very common need.
> --
> 李斯宁
>

Reply via email to