Hi Tom,

That is one of the first things that i checked. Active memory never goes
above 50% of overall available. File cache uses the rest of the memory but
i do not think that causes OOM killer.
Either way there is no entries in /var/log/messages (centos) to show OOM is
happening.

Thanks

On Thu, Jun 2, 2016 at 5:36 AM, Tom Crayford <tcrayf...@heroku.com> wrote:

> That looks like somebody is killing the process. I'd suspect either the
> linux OOM killer or something else automatically killing the JVM for some
> reason.
>
> For the OOM killer, assuming you're on ubuntu, it's pretty easy to find in
> /var/log/syslog (depending on your setup). I don't know about other
> operating systems.
>
> On Thu, Jun 2, 2016 at 5:54 AM, allen chan <allen.michael.c...@gmail.com>
> wrote:
>
> > I have an issue where my brokers would randomly shut itself down.
> > I turned on debug in log4j.properties but still do not see a reason why
> the
> > shutdown is happening.
> >
> > Anyone seen this behavior before?
> >
> > version 0.10.0
> > log4j.properties
> >     log4j.rootLogger=DEBUG, kafkaAppender
> > * I tried TRACE level but i do not see any additional log messages
> >
> > snippet of log around shutdown
> > [2016-06-01 15:11:51,374] DEBUG Got ping response for sessionid:
> > 0x2550a693b470001 after 1ms (org.apache.zookeeper.ClientCnxn)
> > [2016-06-01 15:11:53,376] DEBUG Got ping response for sessionid:
> > 0x2550a693b470001 after 1ms (org.apache.zookeeper.ClientCnxn)
> > [2016-06-01 15:11:55,377] DEBUG Got ping response for sessionid:
> > 0x2550a693b470001 after 1ms (org.apache.zookeeper.ClientCnxn)
> > [2016-06-01 15:11:57,380] DEBUG Got ping response for sessionid:
> > 0x2550a693b470001 after 1ms (org.apache.zookeeper.ClientCnxn)
> > [2016-06-01 15:11:59,383] DEBUG Got ping response for sessionid:
> > 0x2550a693b470001 after 1ms (org.apache.zookeeper.ClientCnxn)
> > [2016-06-01 15:12:01,386] DEBUG Got ping response for sessionid:
> > 0x2550a693b470001 after 1ms (org.apache.zookeeper.ClientCnxn)
> > [2016-06-01 15:12:03,389] DEBUG Got ping response for sessionid:
> > 0x2550a693b470001 after 1ms (org.apache.zookeeper.ClientCnxn)
> > [2016-06-01 15:12:04,121] INFO [Group Metadata Manager on Broker 2]:
> > Removed 0 expired offsets in 0 milliseconds.
> > (kafka.coordinator.GroupMetadataManager)
> > [2016-06-01 15:12:04,121] INFO [Group Metadata Manager on Broker 2]:
> > Removed 0 expired offsets in 0 milliseconds.
> > (kafka.coordinator.GroupMetadataManager)
> > [2016-06-01 15:12:05,390] DEBUG Got ping response for sessionid:
> > 0x2550a693b470001 after 1ms (org.apache.zookeeper.ClientCnxn)
> > [2016-06-01 15:12:07,393] DEBUG Got ping response for sessionid:
> > 0x2550a693b470001 after 1ms (org.apache.zookeeper.ClientCnxn)
> > [2016-06-01 15:12:09,396] DEBUG Got ping response for sessionid:
> > 0x2550a693b470001 after 1ms (org.apache.zookeeper.ClientCnxn)
> > [2016-06-01 15:12:11,399] DEBUG Got ping response for sessionid:
> > 0x2550a693b470001 after 1ms (org.apache.zookeeper.ClientCnxn)
> > [2016-06-01 15:12:13,334] INFO [Kafka Server 2], shutting down
> > (kafka.server.KafkaServer)
> > [2016-06-01 15:12:13,334] INFO [Kafka Server 2], shutting down
> > (kafka.server.KafkaServer)
> > [2016-06-01 15:12:13,336] INFO [Kafka Server 2], Starting controlled
> > shutdown (kafka.server.KafkaServer)
> > [2016-06-01 15:12:13,336] INFO [Kafka Server 2], Starting controlled
> > shutdown (kafka.server.KafkaServer)
> > [2016-06-01 15:12:13,338] DEBUG Added sensor with name
> connections-closed:
> > (org.apache.kafka.common.metrics.Metrics)
> > [2016-06-01 15:12:13,338] DEBUG Added sensor with name
> connections-created:
> > (org.apache.kafka.common.metrics.Metrics)
> > [2016-06-01 15:12:13,338] DEBUG Added sensor with name
> bytes-sent-received:
> > (org.apache.kafka.common.metrics.Metrics)
> > [2016-06-01 15:12:13,338] DEBUG Added sensor with name bytes-sent:
> > (org.apache.kafka.common.metrics.Metrics)
> > [2016-06-01 15:12:13,339] DEBUG Added sensor with name bytes-received:
> > (org.apache.kafka.common.metrics.Metrics)
> > [2016-06-01 15:12:13,339] DEBUG Added sensor with name select-time:
> > (org.apache.kafka.common.metrics.Metrics)
> >
> > --
> > Allen Michael Chan
> >
>



-- 
Allen Michael Chan

Reply via email to