[ https://issues.apache.org/jira/browse/HIVE-19041?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16463109#comment-16463109 ]
Vihang Karajgaonkar commented on HIVE-19041: -------------------------------------------- The test which did not show comments as wasting but then in my testing we do not use comments to the column fields. We would need actual dumps from users to confirm that theory which we don't have. Particularly for this issue. Even otherwise, I think we can intern the comment in the partition objects since they will be the same for all the partitions. > Thrift deserialization of Partition objects should intern fields > ---------------------------------------------------------------- > > Key: HIVE-19041 > URL: https://issues.apache.org/jira/browse/HIVE-19041 > Project: Hive > Issue Type: Improvement > Components: Metastore > Affects Versions: 3.0.0, 2.3.2 > Reporter: Vihang Karajgaonkar > Assignee: Vihang Karajgaonkar > Priority: Major > Attachments: HIVE-19041.01.patch > > > When a client is creating large number of partitions, the thrift objects are > deserialized into Partition objects. The read method of these objects does > not intern the inputformat, location, outputformat which cause large number > of duplicate Strings in the HMS memory. We should intern these objects while > deserialization to reduce memory pressure. -- This message was sent by Atlassian JIRA (v7.6.3#76005)