suheng.cloud created HUDI-2960:
----------------------------------

             Summary: create hudi table may cause memory leak in spark thrift 
server
                 Key: HUDI-2960
                 URL: https://issues.apache.org/jira/browse/HUDI-2960
             Project: Apache Hudi
          Issue Type: Bug
          Components: Spark Integration
    Affects Versions: 0.10.0
            Reporter: suheng.cloud


Hi, community

I currently try to use spark-hudi integration in spark-thrift-server, and after 
test create hudi table for a while, I found it would finally result in 
META-SPACE OOM(in my case, jvm option -XX:MaxMetaspaceSize=256m assigned).

After track the source, I found that every time a CreateHoodieTableCommand 
performed, `HiveClientUtils.newClientForMetadata` will be invoked, thus a 
IsolatedClientLoader will be created, in my scene, the OOM will occured after 
about 10 create statement executed.

Why not use 
`sessionState.catalog.externalCatalog.asInstanceOf[ExternalCatalogWithListener].unwrapped.asInstanceOf[HiveExternalCatalog].client
 ` instead ? Does it has anything side effect?

env: hudi master/spark-3.1.2/hive-2.3.6

Thanks.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to