[ 
https://issues.apache.org/jira/browse/FLINK-26335?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17610148#comment-17610148
 ] 

Austin Cawley-Edwards commented on FLINK-26335:
-----------------------------------------------

 Hey [~mlnogueira], thanks for the report. Do you know roughly how many metrics 
your job was trying to export (# of labels, # of custom metrics, total size 
(hundreds of mbs? gbs?)) when this happened? Were you able to resolve it by 
giving the job more resources?

> prometheus/metrics - java.lang.OutOfMemoryError: Java heap space
> ----------------------------------------------------------------
>
>                 Key: FLINK-26335
>                 URL: https://issues.apache.org/jira/browse/FLINK-26335
>             Project: Flink
>          Issue Type: Bug
>          Components: Runtime / Metrics
>    Affects Versions: 1.14.3
>            Reporter: Marcelo Nogueira
>            Priority: Major
>
>  
> {code:java}
> 2022-02-22 14:39:33.782 INFO 
> [org.apache.flink.streaming.runtime.operators.sink.AbstractStreamingCommitterHandler]
>  - Committing the state for checkpoint 4
> 2022-02-22 14:39:47.367 ERROR 
> [org.apache.flink.runtime.util.ClusterUncaughtExceptionHandler] - WARNING: 
> Thread 'prometheus-http-1-6' produced an uncaught exception. If you want to 
> fail on uncaught exceptions, then configure 
> cluster.uncaught-exception-handling accordingly
> java.lang.OutOfMemoryError: Java heap space
>     at java.util.Arrays.copyOf(Unknown Source) ~[?:?]
>     at java.io.ByteArrayOutputStream.grow(Unknown Source) ~[?:?]
>     at java.io.ByteArrayOutputStream.ensureCapacity(Unknown Source) ~[?:?]
>     at java.io.ByteArrayOutputStream.write(Unknown Source) ~[?:?]
>     at sun.nio.cs.StreamEncoder.writeBytes(Unknown Source) ~[?:?]
>     at sun.nio.cs.StreamEncoder.implWrite(Unknown Source) ~[?:?]
>     at sun.nio.cs.StreamEncoder.implWrite(Unknown Source) ~[?:?]
>     at sun.nio.cs.StreamEncoder.write(Unknown Source) ~[?:?]
>     at sun.nio.cs.StreamEncoder.write(Unknown Source) ~[?:?]
>     at java.io.OutputStreamWriter.write(Unknown Source) ~[?:?]
>     at java.io.Writer.append(Unknown Source) ~[?:?]
>     at 
> io.prometheus.client.exporter.common.TextFormat.writeEscapedLabelValue(TextFormat.java:88)
>  ~[flink-metrics-prometheus-1.14.3.jar:1.14.3]
>     at 
> io.prometheus.client.exporter.common.TextFormat.write004(TextFormat.java:42) 
> ~[flink-metrics-prometheus-1.14.3.jar:1.14.3]
>     at 
> io.prometheus.client.exporter.HTTPServer$HTTPMetricHandler.handle(HTTPServer.java:60)
>  ~[flink-metrics-prometheus-1.14.3.jar:1.14.3]
>     at com.sun.net.httpserver.Filter$Chain.doFilter(Unknown Source) 
> ~[jdk.httpserver:?]
>     at sun.net.httpserver.AuthFilter.doFilter(Unknown Source) 
> ~[jdk.httpserver:?]
>     at com.sun.net.httpserver.Filter$Chain.doFilter(Unknown Source) 
> ~[jdk.httpserver:?]
>     at sun.net.httpserver.ServerImpl$Exchange$LinkHandler.handle(Unknown 
> Source) ~[jdk.httpserver:?]
>     at com.sun.net.httpserver.Filter$Chain.doFilter(Unknown Source) 
> ~[jdk.httpserver:?]
>     at sun.net.httpserver.ServerImpl$Exchange.run(Unknown Source) 
> ~[jdk.httpserver:?]
>     at java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown Source) 
> ~[?:?]
>     at java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source) 
> ~[?:?]
>     at java.lang.Thread.run(Unknown Source) [?:?]
> 2022-02-22 14:40:32.668 INFO 
> [org.apache.flink.connector.kafka.sink.FlinkKafkaInternalProducer] - Flushing 
> new partitions// code placeholder
> {code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to