Hi Mark, Thanks a lot for your message and the good investigation! I believe you've found a bug in Flink. I filed an issue for the problem: https://issues.apache.org/jira/browse/FLINK-19064.
Would you be interested in opening a pull request to fix this? Otherwise, I'm sure a committer will pick up the issue soon. I'm not aware of a simple workaround for the problem. Best, Robert On Wed, Aug 26, 2020 at 4:05 PM Mark Davis <moda...@protonmail.com> wrote: > Hi, > > I am trying to investigate a problem with non-released resources in my > application. > > I have a stateful application which submits Flink DataSet jobs using code > very similar to the code in CliFrontend. > I noticed what I am getting a lot of non-closed connections to my data > store (HBase in my case). The connections are held by the application not > the jobs themselves. > > I am using HBaseRowDataInputFormat and it seems that HBase connections > opened in the configure() method during the job graph creation(before the > jobs is executed) are not closed. My search lead me to the method > DataSourceNode.computeOperatorSpecificDefaultEstimates(DataStatistics) > where I see that a format is not closed after being configured. > > Is that correct? How can I overcome this issue? > > My application is long running that is probably why I observe the resource > leak. Would I spawn a new JVM to run jobs this problem would not be > noticeable. > > Thank you! > > Cheers, > Marc >