[ https://issues.apache.org/jira/browse/HIVE-17638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16184690#comment-16184690 ]
Sahil Takiar commented on HIVE-17638: ------------------------------------- CC: [~janulatha] > SparkDynamicPartitionPruner loads all partition metadata into memory > -------------------------------------------------------------------- > > Key: HIVE-17638 > URL: https://issues.apache.org/jira/browse/HIVE-17638 > Project: Hive > Issue Type: Sub-task > Components: Spark > Reporter: Sahil Takiar > > The {{SparkDynamicPartitionPruner}} first loads the contents of each > partition pruning file into memory, and then prunes all the partitions from > the {{MapWork}}. This can cause increased memory pressure on the HoS Remote > Driver because it requires loading all the partition metadata into memory. It > would be more efficient if pruning of partitions was done while scanning the > files, so that all the partition metadata doesn't need to be buffered in > memory. -- This message was sent by Atlassian JIRA (v6.4.14#64029)