Thanks for the reply Venkatesh. I did tried to increase the tez container size to 4GB but still giving me the same error. In addition, below are the settings I have tried:
set mapreduce.map.memory.mb=4096; set mapreduce.map.java.opts=-Xmx3686m; set mapreduce.reduce.memory.mb=8192; set mapreduce.reduce.java.opts=-Xmx7372m; set hive.tez.container.size = 4096; set hive.tez.java.opts =-Xmx3686m; Let me know if I'm missing anything or configuring incorrectly. Thanks, Sai. On Thu, Nov 14, 2019 at 10:52 AM Venkatesh Selvaraj < venkateshselva...@pinterest.com> wrote: > Try increasing the AM Container memory. set it to 2 gigs may be. > > Regards, > Venkat > > On Thu, Nov 14, 2019, 6:46 AM Sai Teja Desu < > saiteja.d...@globalfoundries.com> wrote: > >> Hello All, >> >> I'm new to hive development and I'm memory limitation error for running a >> simple query with a predicate which should return only a few records. Below >> are the details of the Hive table, Query and Error. Please advise me on how >> to efficiently query on predicates which does not have partitions. >> >> Table Properties: CREATE EXTERNAL TABLE TEST(location_id double, >> >> longitude double, >> >> latitude double, >> >> state string >> >> ) >> >> COMMENT 'This table is created for testing purposes' >> >> PARTITIONED BY(country string, date string) >> >> STORED AS ORC >> >> LOCATION '<S3 Location>' >> >> Total records: 9 Billion Records >> >> Number of partitions: >4k >> >> EMR Cluster Properties: Total Memory: 48 GB >> >> Number of Nodes: 2 >> >> Total vCores: 8 >> >> mapreduce.map.memory.mb=3072 >> >> mapreduce.map.java.opts=-Xmx2458m >> >> >> Query Executed: select * from test where location_id = 1234; >> >> Error:Status: Failed >> >> Application failed 2 times due to AM Container for exited with >> exitCode: -104 >> >> Failing this attempt.Diagnostics: Container is running beyond physical >> memory limits. Current usage: 1.1 GB of 1 GB physical memory used; 2.8 GB >> of 5 GB virtual memory used. Killing container. >> >> Dump of the process-tree for : >> >> |- PID PPID PGRPID SESSID CMD_NAME USER_MODE_TIME(MILLIS) >> SYSTEM_TIME(MILLIS) VMEM_USAGE(BYTES) RSSMEM_USAGE(PAGES) FULL_CMD_LINE >> >> |- 1253 1234 1234 123 (bash) 0 0 11597648 676 /bin/bash -c >> /usr/lib/jvm/java-openjdk/bin/java -Xmx819m >> -Djava.io.tmpdir=/mnt/yarn/usercache/hadoop/appcache/app30/container_11/tmp >> -server -Djava.net.preferIPv4Stack=true -Dhadoop.metrics.log.level=WARN >> -XX:+PrintGCDetails -verbose:gc -XX:+PrintGCTimeStamps -XX:+UseNUMA >> -XX:+UseParallelGC >> -Dlog4j.configuratorClass=org.apache.tez.common.TezLog4jConfigurator >> -Dlog4j.configuration=tez-container-log4j.properties >> -Dyarn.app.container.log.dir=/var/log/hadoop-yarn/containers/application_10/container_11 >> -Dtez.root.logger=INFO,CLA -Dsun.nio.ch.bugLevel='' >> org.apache.tez.dag.app.DAGAppMaster --session >> 1>/var/log/hadoop-yarn/containers/application_10/container_11/stdout >> 2>/var/log/hadoop-yarn/containers/application_10/container_11/stderr >> >> |- 1253 1234 1234 123 (java) 1253 1234 1234 123 >> /usr/lib/jvm/java-openjdk/bin/java -Xmx819m >> -Djava.io.tmpdir=/mnt/yarn/usercache/hadoop/appcache/application_10/container_11/tmp >> -server -Djava.net.preferIPv4Stack=true -Dhadoop.metrics.log.level=WARN >> -XX:+PrintGCDetails -verbose:gc -XX:+PrintGCTimeStamps -XX:+UseNUMA >> -XX:+UseParallelGC >> -Dlog4j.configuratorClass=org.apache.tez.common.TezLog4jConfigurator >> -Dlog4j.configuration=tez-container-log4j.properties >> -Dyarn.app.container.log.dir=/var/log/hadoop-yarn/containers/application_10/container_11 >> -Dtez.root.logger=INFO,CLA -Dsun.nio.ch.bugLevel= >> org.apache.tez.dag.app.DAGAppMaster --session >> >> Container killed on request. Exit code is 143 >> >> Container exited with a non-zero exit code 143 >> >> >> >>