subject:"Re\: Brokers cashing with OOME Map failed"

Re: Brokers cashing with OOME Map failed

2016-12-16 Thread Zakee

Brokers failed repeatedly leaving behind page-cache in memory, which caused broker restarts to fail with OOM every time. After manually cleaning up page-cache, I was able to restart the broker. However, still wondering what could have caused this state in the first place. Any ideas? -Zakee

Re: Brokers cashing with OOME Map failed

2016-12-15 Thread Ismael Juma

Hi, This is probably not a Kafka bug, but we should improve the information we report in this case. Something along the lines of what Lucene did here: https://issues.apache.org/jira/browse/LUCENE-5673 This error may be caused by lack of enough unfragmented virtual address space or too restrictiv

Re: Brokers cashing with OOME Map failed

2016-12-14 Thread Gwen Shapira

Did you recently add topics / partitions? Each partitions takes a memory buffer for replication, so you sometimes get OOME by adding partitions without sizing memory. You basically need the Java heapsize to be larger than # partitions on the broker X replica.fetch.size Gwen On Wed, Dec 14, 2016