Hi folks,

I have a question related configuration for new memory introduced in flink
1.10. Has anyone encountered similar problem?
I'm trying to make use of *taskmanager.memory.process.size* configuration
key in combination with mesos session cluster, but I get an error like this:

2020-03-11 11:44:09,771 [main] ERROR
org.apache.flink.mesos.entrypoint.MesosTaskExecutorRunner     - Error
while starting the TaskManager
org.apache.flink.configuration.IllegalConfigurationException: Failed
to create TaskExecutorResourceSpec
        at 
org.apache.flink.runtime.taskexecutor.TaskExecutorResourceUtils.resourceSpecFromConfig(TaskExecutorResourceUtils.java:72)
        at 
org.apache.flink.runtime.taskexecutor.TaskManagerRunner.startTaskManager(TaskManagerRunner.java:356)
        at 
org.apache.flink.runtime.taskexecutor.TaskManagerRunner.<init>(TaskManagerRunner.java:152)
        at 
org.apache.flink.runtime.taskexecutor.TaskManagerRunner.runTaskManager(TaskManagerRunner.java:308)
        at 
org.apache.flink.mesos.entrypoint.MesosTaskExecutorRunner.lambda$main$0(MesosTaskExecutorRunner.java:106)
        at java.base/java.security.AccessController.doPrivileged(Native Method)
        at java.base/javax.security.auth.Subject.doAs(Subject.java:423)
        at 
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1692)
        at 
org.apache.flink.runtime.security.HadoopSecurityContext.runSecured(HadoopSecurityContext.java:41)
        at 
org.apache.flink.mesos.entrypoint.MesosTaskExecutorRunner.main(MesosTaskExecutorRunner.java:105)
Caused by: org.apache.flink.configuration.IllegalConfigurationException:
The required configuration option Key:
'taskmanager.memory.task.heap.size' , default: null (fallback keys:
[]) is not set
        at 
org.apache.flink.runtime.taskexecutor.TaskExecutorResourceUtils.checkConfigOptionIsSet(TaskExecutorResourceUtils.java:90)
        at 
org.apache.flink.runtime.taskexecutor.TaskExecutorResourceUtils.lambda$checkTaskExecutorResourceConfigSet$0(TaskExecutorResourceUtils.java:84)
        at java.base/java.util.Arrays$ArrayList.forEach(Arrays.java:4390)
        at 
org.apache.flink.runtime.taskexecutor.TaskExecutorResourceUtils.checkTaskExecutorResourceConfigSet(TaskExecutorResourceUtils.java:84)
        at 
org.apache.flink.runtime.taskexecutor.TaskExecutorResourceUtils.resourceSpecFromConfig(TaskExecutorResourceUtils.java:70)
        ... 9 more

But when task manager is launched, it correctly parses process memory key:

2020-03-11 11:43:55,376 [main] INFO
org.apache.flink.mesos.entrypoint.MesosTaskExecutorRunner     -
--------------------------------------------------------------------------------
2020-03-11 11:43:55,377 [main] INFO
org.apache.flink.mesos.entrypoint.MesosTaskExecutorRunner     -
Starting MesosTaskExecutorRunner (Version: 1.10.0, Rev:aa4eb8f,
Date:07.02.2020 @ 19:18:19 CET)
2020-03-11 11:43:55,377 [main] INFO
org.apache.flink.mesos.entrypoint.MesosTaskExecutorRunner     -  OS
current user: root
2020-03-11 11:43:57,347 [main] WARN
org.apache.hadoop.util.NativeCodeLoader                       - Unable
to load native-hadoop library for your platform... using builtin-java
classes where applicable
2020-03-11 11:43:57,535 [main] INFO
org.apache.flink.mesos.entrypoint.MesosTaskExecutorRunner     -  JVM:
OpenJDK 64-Bit Server VM - AdoptOpenJDK - 11/11.0.2+9
2020-03-11 11:43:57,535 [main] INFO
org.apache.flink.mesos.entrypoint.MesosTaskExecutorRunner     -
Maximum heap size: 746 MiBytes
2020-03-11 11:43:57,535 [main] INFO
org.apache.flink.mesos.entrypoint.MesosTaskExecutorRunner     -
JAVA_HOME: (not set)
2020-03-11 11:43:57,539 [main] INFO
org.apache.flink.mesos.entrypoint.MesosTaskExecutorRunner     -
Hadoop version: 2.6.5
2020-03-11 11:43:57,539 [main] INFO
org.apache.flink.mesos.entrypoint.MesosTaskExecutorRunner     -  JVM
Options:
2020-03-11 11:43:57,539 [main] INFO
org.apache.flink.mesos.entrypoint.MesosTaskExecutorRunner     -
-Xmx781818251
2020-03-11 11:43:57,539 [main] INFO
org.apache.flink.mesos.entrypoint.MesosTaskExecutorRunner     -
-Xms781818251
2020-03-11 11:43:57,540 [main] INFO
org.apache.flink.mesos.entrypoint.MesosTaskExecutorRunner     -
-XX:MaxDirectMemorySize=317424929
2020-03-11 11:43:57,540 [main] INFO
org.apache.flink.mesos.entrypoint.MesosTaskExecutorRunner     -
-XX:MaxMetaspaceSize=100663296
2020-03-11 11:43:57,540 [main] INFO
org.apache.flink.mesos.entrypoint.MesosTaskExecutorRunner     -
-Dlog.file=/var/log/flink-session-cluster/taskmanager.log
2020-03-11 11:43:57,540 [main] INFO
org.apache.flink.mesos.entrypoint.MesosTaskExecutorRunner     -
-Dlog4j.configuration=file:/opt/flink/conf/log4j.properties
2020-03-11 11:43:57,540 [main] INFO
org.apache.flink.mesos.entrypoint.MesosTaskExecutorRunner     -
-Dlogback.configurationFile=file:/opt/flink/conf/logback.xml
2020-03-11 11:43:57,540 [main] INFO
org.apache.flink.mesos.entrypoint.MesosTaskExecutorRunner     -
Program Arguments: (none)
2020-03-11 11:43:57,540 [main] INFO
org.apache.flink.mesos.entrypoint.MesosTaskExecutorRunner     -
Classpath: 
/opt/flink/lib/apache-log4j-extras-1.2.17.jar:/opt/flink/lib/flink-metrics-graphite-1.10.0.jar:/opt/flink/lib/flink-shaded-hadoop-2-uber-2.6.5-8.0.jar:/opt/flink/lib/flink-table-blink_2.12-1.10.0.jar:/opt/flink/lib/flink-table_2.12-1.10.0.jar:/opt/flink/lib/log4j-1.2.17.jar:/opt/flink/lib/slf4j-log4j12-1.7.15.jar:/opt/flink/lib/flink-dist_2.12-1.10.0.jar:
2020-03-11 11:43:57,541 [main] INFO
org.apache.flink.mesos.entrypoint.MesosTaskExecutorRunner     -
--------------------------------------------------------------------------------
2020-03-11 11:43:57,542 [main] INFO
org.apache.flink.mesos.entrypoint.MesosTaskExecutorRunner     -
Registered UNIX signal handlers for [TERM, HUP, INT]
2020-03-11 11:43:57,550 [main] INFO
org.apache.flink.configuration.GlobalConfiguration            -
Loading configuration property: taskmanager.memory.process.size, 2g
2020-03-11 11:43:57,550 [main] INFO
org.apache.flink.configuration.GlobalConfiguration            -
Loading configuration property: taskmanager.cpu.cores, 2
2020-03-11 11:43:57,551 [main] INFO
org.apache.flink.configuration.GlobalConfiguration            -
Loading configuration property: taskmanager.numberOfTaskSlots, 4
2020-03-11 11:43:57,551 [main] INFO
org.apache.flink.configuration.GlobalConfiguration            -
Loading configuration property: parallelism.default, 1
...

Judging by the docs specifying *taskmanager.memory.process.size* key should
be enough to launch the job, but it seems like this value is ignored.
I would appreciate any suggestion.

Regards and thanks in advance,
Alex.

Reply via email to