Thanks for the reply Venkatesh. I did tried to increase the tez container
size to 4GB but still giving me the same error. In addition, below are the
settings I have tried:

set mapreduce.map.memory.mb=4096;
set mapreduce.map.java.opts=-Xmx3686m;


set mapreduce.reduce.memory.mb=8192;
set mapreduce.reduce.java.opts=-Xmx7372m;


set hive.tez.container.size = 4096;
set hive.tez.java.opts =-Xmx3686m;

Let me know if I'm missing anything or configuring incorrectly.

Thanks,
Sai.

On Thu, Nov 14, 2019 at 10:52 AM Venkatesh Selvaraj <
venkateshselva...@pinterest.com> wrote:

> Try increasing the AM Container memory. set it to 2 gigs may be.
>
> Regards,
> Venkat
>
> On Thu, Nov 14, 2019, 6:46 AM Sai Teja Desu <
> saiteja.d...@globalfoundries.com> wrote:
>
>> Hello All,
>>
>> I'm new to hive development and I'm memory limitation error for running a
>> simple query with a predicate which should return only a few records. Below
>> are the details of the Hive table, Query and Error. Please advise me on how
>> to efficiently query on predicates which does not have partitions.
>>
>> Table Properties:     CREATE EXTERNAL TABLE TEST(location_id double,
>>
>> longitude double,
>>
>> latitude double,
>>
>> state string
>>
>> )
>>
>> COMMENT 'This table is created for testing purposes'
>>
>> PARTITIONED BY(country string, date string)
>>
>> STORED AS ORC
>>
>> LOCATION '<S3 Location>'
>>
>> Total records:  9 Billion Records
>>
>> Number of partitions: >4k
>>
>> EMR Cluster Properties:   Total Memory: 48 GB
>>
>> Number of Nodes: 2
>>
>> Total vCores: 8
>>
>> mapreduce.map.memory.mb=3072
>>
>> mapreduce.map.java.opts=-Xmx2458m
>>
>>
>> Query Executed:  select * from test where location_id = 1234;
>>
>> Error:Status:  Failed
>>
>> Application  failed 2 times due to AM Container for exited with
>>  exitCode: -104
>>
>> Failing this attempt.Diagnostics: Container is running beyond physical
>> memory limits. Current usage: 1.1 GB of 1 GB physical memory used; 2.8 GB
>> of 5 GB virtual memory used. Killing container.
>>
>> Dump of the process-tree for  :
>>
>>         |- PID PPID PGRPID SESSID CMD_NAME USER_MODE_TIME(MILLIS)
>> SYSTEM_TIME(MILLIS) VMEM_USAGE(BYTES) RSSMEM_USAGE(PAGES) FULL_CMD_LINE
>>
>>         |- 1253 1234 1234 123 (bash) 0 0 11597648 676 /bin/bash -c
>> /usr/lib/jvm/java-openjdk/bin/java  -Xmx819m
>> -Djava.io.tmpdir=/mnt/yarn/usercache/hadoop/appcache/app30/container_11/tmp
>> -server -Djava.net.preferIPv4Stack=true -Dhadoop.metrics.log.level=WARN
>> -XX:+PrintGCDetails -verbose:gc -XX:+PrintGCTimeStamps -XX:+UseNUMA
>> -XX:+UseParallelGC
>> -Dlog4j.configuratorClass=org.apache.tez.common.TezLog4jConfigurator
>> -Dlog4j.configuration=tez-container-log4j.properties
>> -Dyarn.app.container.log.dir=/var/log/hadoop-yarn/containers/application_10/container_11
>> -Dtez.root.logger=INFO,CLA -Dsun.nio.ch.bugLevel=''
>> org.apache.tez.dag.app.DAGAppMaster --session
>> 1>/var/log/hadoop-yarn/containers/application_10/container_11/stdout
>> 2>/var/log/hadoop-yarn/containers/application_10/container_11/stderr
>>
>>         |- 1253 1234 1234 123  (java) 1253 1234 1234 123
>>  /usr/lib/jvm/java-openjdk/bin/java -Xmx819m
>> -Djava.io.tmpdir=/mnt/yarn/usercache/hadoop/appcache/application_10/container_11/tmp
>> -server -Djava.net.preferIPv4Stack=true -Dhadoop.metrics.log.level=WARN
>> -XX:+PrintGCDetails -verbose:gc -XX:+PrintGCTimeStamps -XX:+UseNUMA
>> -XX:+UseParallelGC
>> -Dlog4j.configuratorClass=org.apache.tez.common.TezLog4jConfigurator
>> -Dlog4j.configuration=tez-container-log4j.properties
>> -Dyarn.app.container.log.dir=/var/log/hadoop-yarn/containers/application_10/container_11
>> -Dtez.root.logger=INFO,CLA -Dsun.nio.ch.bugLevel=
>> org.apache.tez.dag.app.DAGAppMaster --session
>>
>> Container killed on request. Exit code is 143
>>
>> Container exited with a non-zero exit code 143
>>
>>
>>
>>

Reply via email to