Hi Hemant, Have you tried running a new Flink version? Can you create a heap dump when the process fails? This could help us digging into whether there is some memory leak.
Cheers, Till On Tue, Feb 16, 2021 at 5:21 PM bat man <tintin0...@gmail.com> wrote: > Hi there, > > I am facing *java.lang.OutOfMemoryError: GC overhead limit exceeded *when > using prometheus exporter with* Flink 1.9 *on *AWS EMR *emr-5.28.1. I > have other jobs which run fine. tihs specific job fails with the below > error stack. > > Exception in thread "pool-3-thread-2" java.lang.OutOfMemoryError: GC > overhead limit exceeded > at sun.nio.cs.StreamEncoder.write(StreamEncoder.java:133) > at java.io.OutputStreamWriter.write(OutputStreamWriter.java:220) > at java.io.Writer.write(Writer.java:157) > at > org.apache.flink.shaded.io.prometheus.client.exporter.common.TextFormat.write004(TextFormat.java:40) > at > org.apache.flink.shaded.io.prometheus.client.exporter.HTTPServer$HTTPMetricHandler.handle(HTTPServer.java:59) > at com.sun.net.httpserver.Filter$Chain.doFilter(Filter.java:79) > at sun.net.httpserver.AuthFilter.doFilter(AuthFilter.java:83) > at com.sun.net.httpserver.Filter$Chain.doFilter(Filter.java:82) > at > sun.net.httpserver.ServerImpl$Exchange$LinkHandler.handle(ServerImpl.java:675) > at com.sun.net.httpserver.Filter$Chain.doFilter(Filter.java:79) > at sun.net.httpserver.ServerImpl$Exchange.run(ServerImpl.java:647) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) > at java.lang.Thread.run(Thread.java:748) > > Thanks, > Hemant >