Hi Matt... Thanks for the heads up. I can surely help in getting it fixed. I can help with the Jira and if you want to contribute you can go ahead with the PR. I and others in the community can help with the review of the same. Please feel free to loop me in.
Thanks again for reporting this to us. - Anuj Modi On Wed, Jul 9, 2025 at 9:51 AM Matt <m...@fastmail.to> wrote: > CC'ing common-dev as that list has more activity > > On 7/7/25 20:39, Matt wrote: > > Hi Hadoop team! > > > > I'm not sure where to report this and the Jira board does not allow > > for public sign up so I figured I'd start here. I found a thread leak > > in the ABFS driver that causes OutOfMemoryErrors in Hive Metastore > > environments -- specifically in this part of the code base: > > > > > https://github.com/apache/hadoop/blob/f099f08d922689dd2bd641bbbbd7c29c451463df/hadoop-tools/hadoop-azure/src/main/java/org/apache/hadoop/fs/azurebfs/services/AbfsClientThrottlingAnalyzer.java#L127 > > > > > > What seems to be the issue is that the timer tasks are cleaned up but > > the timer threads themselves are never actually cleaned up. This will > > eventually lead to an OOM since nothing is collecting these. I was > > able to reproduce this locally in 3.3.6 and 3.4.1 but I believe that > > it would affect any version that relies on autothrottling for ABFS. > > > > I was also able to make a quick fix as well as confirm a workaround -- > > the long term fix would be to include `timer.cancel()` and > > `timer.purge()` in a method for AbfsClientThrottlingAnalyzer.java. The > > short term workaround is to disable autothrottling and rely on Azure > > to throttle the connections as needed with the below configuration. > > > > ``` > > > > <property> > > <name>fs.azure.enable.autothrottling</name> > > <value>false</value> > > </property> > > > > ``` > > > > I'm happy to share my fix and test results but I'm not quite sure who > > to share them with -- any direction is greatly appreciated! > > > > Thank you, > > Matt > > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: common-dev-unsubscr...@hadoop.apache.org > For additional commands, e-mail: common-dev-h...@hadoop.apache.org > >