Hello Chesnay, Cluster (in kubernetes)-wide checkpointing directory using glusterfs volume mount (thus file access protocol file:///) was working fine till 1.4.2 for us. So we like to understand where the breakage happened in 1.5.0. Can you please mention me the relevant source code files related to rocksdb “custom file path” parsing logic? We would be interested to investigate this.
I also observed below in the log – Config uses deprecated configuration key 'state.backend.rocksdb.checkpointdir' instead of proper key 'state.backend.rocksdb.localdir' Regards, Shaswata From: Chesnay Schepler [mailto:ches...@apache.org] Sent: Tuesday, July 03, 2018 5:52 PM To: Data Engineer <dataenginee...@gmail.com> Cc: user@flink.apache.org Subject: Re: Checkpointing in Flink 1.5.0 The code appears to be working fine. This may happen because you're using a GlusterFS volume. The RocksDBStateBackend uses java Files internally (NOT nio Paths), which AFAIK only work properly against the plain local file-system. The GlusterFS nio FIleSystem implementation also explicitly does not support conversions to File<https://github.com/gluster/glusterfs-java-filesystem/blob/master/glusterfs-java-filesystem/src/main/java/com/peircean/glusterfs/GlusterPath.java#L271>. On 03.07.2018 13:53, Chesnay Schepler wrote: Thanks. Looks like RocksDBStateBackend.setDbStoragePaths has some custom file path parsing logic, will probe it a bit to see what the issue is. On 03.07.2018 13:45, Data Engineer wrote: 2018-07-03 11:30:35,703 INFO org.apache.flink.runtime.entrypoint.ClusterEntrypoint - -------------------------------------------------------------------------------- 2018-07-03 11:30:35,705 INFO org.apache.flink.runtime.entrypoint.ClusterEntrypoint - Starting StandaloneSessionClusterEntrypoint (Version: <unknown>, Rev:c61b108, Date:24.05.2018 @ 16:54:44 CEST) 2018-07-03 11:30:35,705 INFO org.apache.flink.runtime.entrypoint.ClusterEntrypoint - OS current user: flink 2018-07-03 11:30:35,705 INFO org.apache.flink.runtime.entrypoint.ClusterEntrypoint - Current Hadoop/Kerberos user: <no hadoop dependency found> 2018-07-03 11:30:35,706 INFO org.apache.flink.runtime.entrypoint.ClusterEntrypoint - JVM: OpenJDK 64-Bit Server VM - Oracle Corporation - 1.8/25.171-b10 2018-07-03 11:30:35,706 INFO org.apache.flink.runtime.entrypoint.ClusterEntrypoint - Maximum heap size: 981 MiBytes 2018-07-03 11:30:35,706 INFO org.apache.flink.runtime.entrypoint.ClusterEntrypoint - JAVA_HOME: /etc/alternatives/jre_openjdk/ 2018-07-03 11:30:35,707 INFO org.apache.flink.runtime.entrypoint.ClusterEntrypoint - No Hadoop Dependency available 2018-07-03 11:30:35,707 INFO org.apache.flink.runtime.entrypoint.ClusterEntrypoint - JVM Options: 2018-07-03 11:30:35,707 INFO org.apache.flink.runtime.entrypoint.ClusterEntrypoint - -Xms1024m 2018-07-03 11:30:35,707 INFO org.apache.flink.runtime.entrypoint.ClusterEntrypoint - -Xmx1024m 2018-07-03 11:30:35,708 INFO org.apache.flink.runtime.entrypoint.ClusterEntrypoint - -Dlog.file=/opt/flink-1.5.0/log/flink--standalonesession-0-myfl-flink-jobmanager-7b4d8c4dd4-bv6zf.log 2018-07-03 11:30:35,708 INFO org.apache.flink.runtime.entrypoint.ClusterEntrypoint - -Dlog4j.configuration=file:/opt/flink-1.5.0/conf/log4j.properties<file://opt/flink-1.5.0/conf/log4j.properties> 2018-07-03 11:30:35,708 INFO org.apache.flink.runtime.entrypoint.ClusterEntrypoint - -Dlogback.configurationFile=file:/opt/flink-1.5.0/conf/logback.xml<file://opt/flink-1.5.0/conf/logback.xml> 2018-07-03 11:30:35,708 INFO org.apache.flink.runtime.entrypoint.ClusterEntrypoint - Program Arguments: 2018-07-03 11:30:35,709 INFO org.apache.flink.runtime.entrypoint.ClusterEntrypoint - --configDir 2018-07-03 11:30:35,709 INFO org.apache.flink.runtime.entrypoint.ClusterEntrypoint - /opt/flink-1.5.0/conf 2018-07-03 11:30:35,709 INFO org.apache.flink.runtime.entrypoint.ClusterEntrypoint - --executionMode 2018-07-03 11:30:35,709 INFO org.apache.flink.runtime.entrypoint.ClusterEntrypoint - cluster 2018-07-03 11:30:35,710 INFO org.apache.flink.runtime.entrypoint.ClusterEntrypoint - --host 2018-07-03 11:30:35,710 INFO org.apache.flink.runtime.entrypoint.ClusterEntrypoint - cluster 2018-07-03 11:30:35,710 INFO org.apache.flink.runtime.entrypoint.ClusterEntrypoint - Classpath: /opt/flink-1.5.0/lib/flink-cep_2.11-1.5.0.jar:/opt/flink-1.5.0/lib/flink-connectors-1.5.0.jar:/opt/flink-1.5.0/lib/flink-gelly_2.11-1.5.0.jar:/opt/flink-1.5.0/lib/flink-ml_2.11-1.5.0.jar:/opt/flink-1.5.0/lib/flink-python_2.11-1.5.0.jar:/opt/flink-1.5.0/lib/flink-table_2.11-1.5.0.jar:/opt/flink-1.5.0/lib/log4j-1.2.17.jar:/opt/flink-1.5.0/lib/slf4j-log4j12-1.7.7.jar:/opt/flink-1.5.0/lib/flink-dist_2.11-1.5.0.jar::: 2018-07-03 11:30:35,710 INFO org.apache.flink.runtime.entrypoint.ClusterEntrypoint - -------------------------------------------------------------------------------- 2018-07-03 11:30:35,712 INFO org.apache.flink.runtime.entrypoint.ClusterEntrypoint - Registered UNIX signal handlers for [TERM, HUP, INT] 2018-07-03 11:30:35,720 INFO org.apache.flink.configuration.GlobalConfiguration - Loading configuration property: blob.server.port, 4124 2018-07-03 11:30:35,720 INFO org.apache.flink.configuration.GlobalConfiguration - Loading configuration property: jobmanager.rpc.address, myfl-flink-jobmanager 2018-07-03 11:30:35,720 INFO org.apache.flink.configuration.GlobalConfiguration - Loading configuration property: jobmanager.rpc.port, 4123 2018-07-03 11:30:35,721 INFO org.apache.flink.configuration.GlobalConfiguration - Loading configuration property: jobmanager.heap.mb, 1024 2018-07-03 11:30:35,721 INFO org.apache.flink.configuration.GlobalConfiguration - Loading configuration property: taskmanager.heap.mb, 1024 2018-07-03 11:30:35,721 INFO org.apache.flink.configuration.GlobalConfiguration - Loading configuration property: taskmanager.rpc.port, 4122 2018-07-03 11:30:35,721 INFO org.apache.flink.configuration.GlobalConfiguration - Loading configuration property: taskmanager.data.port, 4121 2018-07-03 11:30:35,721 INFO org.apache.flink.configuration.GlobalConfiguration - Loading configuration property: taskmanager.query.port, 4125 2018-07-03 11:30:35,722 INFO org.apache.flink.configuration.GlobalConfiguration - Loading configuration property: rest.port, 8081 2018-07-03 11:30:35,762 INFO org.apache.flink.configuration.GlobalConfiguration - Loading configuration property: rest.address, myfl-flink-jobmanager-ui 2018-07-03 11:30:35,762 INFO org.apache.flink.configuration.GlobalConfiguration - Loading configuration property: state.backend, rocksdb 2018-07-03 11:30:35,762 INFO org.apache.flink.configuration.GlobalConfiguration - Loading configuration property: state.checkpoints.dir, file:///opt/flink/share/myfl-flink/checkpoints/ext_checkpoints 2018-07-03 11:30:35,763 INFO org.apache.flink.configuration.GlobalConfiguration - Loading configuration property: state.backend.fs.checkpointdir, file:///opt/flink/share/myfl-flink/checkpoints/fs_state 2018-07-03 11:30:35,763 INFO org.apache.flink.configuration.GlobalConfiguration - Loading configuration property: state.backend.rocksdb.checkpointdir, file:///opt/flink/share/myfl-flink/checkpoints/rocksdb_state 2018-07-03 11:30:35,763 INFO org.apache.flink.configuration.GlobalConfiguration - Loading configuration property: taskmanager.numberOfTaskSlots, 4 2018-07-03 11:30:35,763 INFO org.apache.flink.configuration.GlobalConfiguration - Loading configuration property: restart-strategy, fixed-delay 2018-07-03 11:30:35,764 INFO org.apache.flink.configuration.GlobalConfiguration - Loading configuration property: restart-strategy.fixed-delay.attempts, 100 2018-07-03 11:30:35,764 INFO org.apache.flink.configuration.GlobalConfiguration - Loading configuration property: restart-strategy.fixed-delay.delay, 1 s 2018-07-03 11:30:35,885 INFO org.apache.flink.runtime.entrypoint.ClusterEntrypoint - Starting StandaloneSessionClusterEntrypoint. 2018-07-03 11:30:35,885 INFO org.apache.flink.runtime.entrypoint.ClusterEntrypoint - Install default filesystem. 2018-07-03 11:30:35,892 INFO org.apache.flink.core.fs.FileSystem - Hadoop is not in the classpath/dependencies. The extended set of supported File Systems via Hadoop is not available. 2018-07-03 11:30:35,963 INFO org.apache.flink.runtime.entrypoint.ClusterEntrypoint - Install security context. 2018-07-03 11:30:35,970 INFO org.apache.flink.runtime.security.modules.HadoopModuleFactory - Cannot create Hadoop Security Module because Hadoop cannot be found in the Classpath. 2018-07-03 11:30:35,988 INFO org.apache.flink.runtime.security.SecurityUtils - Cannot install HadoopSecurityContext because Hadoop cannot be found in the Classpath. 2018-07-03 11:30:35,989 INFO org.apache.flink.runtime.entrypoint.ClusterEntrypoint - Initializing cluster services. 2018-07-03 11:30:36,003 INFO org.apache.flink.runtime.entrypoint.ClusterEntrypoint - Trying to start actor system at myfl-flink-jobmanager:4123 2018-07-03 11:30:37,288 INFO akka.event.slf4j.Slf4jLogger - Slf4jLogger started 2018-07-03 11:30:37,396 INFO akka.remote.Remoting - Starting remoting 2018-07-03 11:30:37,583 INFO akka.remote.Remoting - Remoting started; listening on addresses :[akka.tcp://flink@myfl-flink-jobmanager:4123] 2018-07-03 11:30:37,591 INFO org.apache.flink.runtime.entrypoint.ClusterEntrypoint - Actor system started at akka.tcp://flink@myfl-flink-jobmanager:4123 2018-07-03 11:30:37,611 INFO org.apache.flink.runtime.blob.BlobServer - Created BLOB server storage directory /tmp/blobStore-e445bc66-cee3-4a3d-b810-74df02627eca 2018-07-03 11:30:37,613 INFO org.apache.flink.runtime.blob.BlobServer - Started BLOB server at 0.0.0.0:4124<http://0.0.0.0:4124> - max concurrent requests: 50 - max backlog: 1000 2018-07-03 11:30:37,629 INFO org.apache.flink.runtime.metrics.MetricRegistryImpl - No metrics reporter configured, no metrics will be exposed/reported. 2018-07-03 11:30:37,664 INFO org.apache.flink.runtime.dispatcher.FileArchivedExecutionGraphStore - Initializing FileArchivedExecutionGraphStore: Storage directory /tmp/executionGraphStore-4ff546b1-4bfb-4911-9314-89c61d7e7149, expiration time 3600000, maximum cache size 52428800 bytes. 2018-07-03 11:30:37,694 INFO org.apache.flink.runtime.blob.TransientBlobCache - Created BLOB cache storage directory /tmp/blobStore-7e0efdb8-f70b-42ed-9387-c0e1b8090b36 2018-07-03 11:30:37,702 WARN org.apache.flink.runtime.dispatcher.DispatcherRestEndpoint - Upload directory /tmp/flink-web-e68a12b9-b9cc-4508-be00-4bf9f113afcd/flink-web-upload does not exist, or has been deleted externally. Previously uploaded files are no longer available. 2018-07-03 11:30:37,703 INFO org.apache.flink.runtime.dispatcher.DispatcherRestEndpoint - Created directory /tmp/flink-web-e68a12b9-b9cc-4508-be00-4bf9f113afcd/flink-web-upload for file uploads. 2018-07-03 11:30:37,706 INFO org.apache.flink.runtime.dispatcher.DispatcherRestEndpoint - Starting rest endpoint. 2018-07-03 11:30:38,369 INFO org.apache.flink.runtime.webmonitor.WebMonitorUtils - Determined location of main cluster component log file: /opt/flink-1.5.0/log/flink--standalonesession-0-myfl-flink-jobmanager-7b4d8c4dd4-bv6zf.log 2018-07-03 11:30:38,369 INFO org.apache.flink.runtime.webmonitor.WebMonitorUtils - Determined location of main cluster component stdout file: /opt/flink-1.5.0/log/flink--standalonesession-0-myfl-flink-jobmanager-7b4d8c4dd4-bv6zf.out 2018-07-03 11:30:38,567 INFO org.apache.flink.runtime.dispatcher.DispatcherRestEndpoint - Rest endpoint listening at myfl-flink-jobmanager-ui:8081 2018-07-03 11:30:38,568 INFO org.apache.flink.runtime.dispatcher.DispatcherRestEndpoint - http://myfl-flink-jobmanager-ui:8081 was granted leadership with leaderSessionID=00000000-0000-0000-0000-000000000000 2018-07-03 11:30:38,568 INFO org.apache.flink.runtime.dispatcher.DispatcherRestEndpoint - Web frontend listening at http://myfl-flink-jobmanager-ui:8081. 2018-07-03 11:30:38,578 INFO org.apache.flink.runtime.rpc.akka.AkkaRpcService - Starting RPC endpoint for org.apache.flink.runtime.resourcemanager.StandaloneResourceManager at akka://flink/user/resourcemanager . 2018-07-03 11:30:38,966 INFO org.apache.flink.runtime.rpc.akka.AkkaRpcService - Starting RPC endpoint for org.apache.flink.runtime.dispatcher.StandaloneDispatcher at akka://flink/user/dispatcher . 2018-07-03 11:30:39,068 INFO org.apache.flink.runtime.resourcemanager.StandaloneResourceManager - ResourceManager akka.tcp://flink@myfl-flink-jobmanager:4123/user/resourcemanager was granted leadership with fencing token 00000000000000000000000000000000 2018-07-03 11:30:39,069 INFO org.apache.flink.runtime.resourcemanager.slotmanager.SlotManager - Starting the SlotManager. 2018-07-03 11:30:39,164 INFO org.apache.flink.runtime.dispatcher.StandaloneDispatcher - Dispatcher akka.tcp://flink@myfl-flink-jobmanager:4123/user/dispatcher was granted leadership with fencing token 00000000000000000000000000000000 2018-07-03 11:30:39,165 INFO org.apache.flink.runtime.dispatcher.StandaloneDispatcher - Recovering all persisted jobs. 2018-07-03 11:30:39,682 INFO org.apache.flink.runtime.resourcemanager.StandaloneResourceManager - Replacing old registration of TaskExecutor 068c693b9585900f68c53b00507ee889. 2018-07-03 11:30:39,683 INFO org.apache.flink.runtime.resourcemanager.slotmanager.SlotManager - Unregister TaskManager 8a5cee3aa38081030dc8558ac477d3b3 from the SlotManager. 2018-07-03 11:30:39,683 INFO org.apache.flink.runtime.resourcemanager.StandaloneResourceManager - The target with resource ID 068c693b9585900f68c53b00507ee889 is already been monitored. 2018-07-03 11:30:39,770 INFO org.apache.flink.runtime.resourcemanager.slotmanager.SlotManager - Registering TaskManager 068c693b9585900f68c53b00507ee889 under 03d409e5166fad4f4082b6165eb0de2e at the SlotManager. 2018-07-03 11:34:20,257 INFO org.apache.flink.runtime.dispatcher.StandaloneDispatcher - Submitting job b684656d9afd75cc384a7bcd071bf55e (CSV Files Read -> CSV to Avro encode -> Kafka publish). 2018-07-03 11:34:20,269 INFO org.apache.flink.runtime.rpc.akka.AkkaRpcService - Starting RPC endpoint for org.apache.flink.runtime.jobmaster.JobMaster at akka://flink/user/jobmanager_0 . 2018-07-03 11:34:20,278 INFO org.apache.flink.runtime.jobmaster.JobMaster - Initializing job CSV Files Read -> CSV to Avro encode -> Kafka publish (b684656d9afd75cc384a7bcd071bf55e). 2018-07-03 11:34:20,285 INFO org.apache.flink.runtime.jobmaster.JobMaster - Using restart strategy FixedDelayRestartStrategy(maxNumberRestartAttempts=2147483647, delayBetweenRestartAttempts=0) for CSV Files Read -> CSV to Avro encode -> Kafka publish (b684656d9afd75cc384a7bcd071bf55e). 2018-07-03 11:34:20,289 INFO org.apache.flink.runtime.rpc.akka.AkkaRpcService - Starting RPC endpoint for org.apache.flink.runtime.jobmaster.slotpool.SlotPool at akka://flink/user/67ceb2ae-1cb1-44be-a09e-601032e23fb5 . 2018-07-03 11:34:20,481 INFO org.apache.flink.runtime.executiongraph.ExecutionGraph - Job recovers via failover strategy: full graph restart 2018-07-03 11:34:20,562 INFO org.apache.flink.runtime.jobmaster.JobMaster - Running initialization on master for job CSV Files Read -> CSV to Avro encode -> Kafka publish (b684656d9afd75cc384a7bcd071bf55e). 2018-07-03 11:34:20,563 INFO org.apache.flink.runtime.jobmaster.JobMaster - Successfully ran initialization on master in 0 ms. 2018-07-03 11:34:20,580 INFO org.apache.flink.runtime.jobmaster.JobMaster - Loading state backend via factory org.apache.flink.contrib.streaming.state.RocksDBStateBackendFactory 2018-07-03 11:34:20,590 WARN org.apache.flink.configuration.Configuration - Config uses deprecated configuration key 'state.backend.rocksdb.checkpointdir' instead of proper key 'state.backend.rocksdb.localdir' 2018-07-03 11:34:20,592 ERROR org.apache.flink.runtime.rest.handler.job.JobSubmitHandler - Implementation error: Unhandled exception. org.apache.flink.util.FlinkException: Failed to submit job b684656d9afd75cc384a7bcd071bf55e. at org.apache.flink.runtime.dispatcher.Dispatcher.submitJob(Dispatcher.java:254) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) at java.lang.reflect.Method.invoke(Method.java:498) at org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleRpcInvocation(AkkaRpcActor.java:247) at org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleRpcMessage(AkkaRpcActor.java:162) at org.apache.flink.runtime.rpc.akka.FencedAkkaRpcActor.handleRpcMessage(FencedAkkaRpcActor.java:70) at org.apache.flink.runtime.rpc.akka.AkkaRpcActor.onReceive(AkkaRpcActor.java:142) at org.apache.flink.runtime.rpc.akka.FencedAkkaRpcActor.onReceive(FencedAkkaRpcActor.java:40) at akka.actor.UntypedActor$$anonfun$receive$1.applyOrElse(UntypedActor.scala:165) at akka.actor.Actor$class.aroundReceive(Actor.scala:502) at akka.actor.UntypedActor.aroundReceive(UntypedActor.scala:95) at akka.actor.ActorCell.receiveMessage(ActorCell.scala:526) at akka.actor.ActorCell.invoke(ActorCell.scala:495) at akka.dispatch.Mailbox.processMailbox(Mailbox.scala:257) at akka.dispatch.Mailbox.run(Mailbox.scala:224) at akka.dispatch.Mailbox.exec(Mailbox.scala:234) at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260) at scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1339) at scala.concurrent.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979) at scala.concurrent.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107) Caused by: org.apache.flink.runtime.client.JobExecutionException: Could not set up JobManager at org.apache.flink.runtime.jobmaster.JobManagerRunner.<init>(JobManagerRunner.java:169) at org.apache.flink.runtime.dispatcher.Dispatcher$DefaultJobManagerRunnerFactory.createJobManagerRunner(Dispatcher.java:885) at org.apache.flink.runtime.dispatcher.Dispatcher.createJobManagerRunner(Dispatcher.java:287) at org.apache.flink.runtime.dispatcher.Dispatcher.runJob(Dispatcher.java:277) at org.apache.flink.runtime.dispatcher.Dispatcher.persistAndRunJob(Dispatcher.java:262) at org.apache.flink.runtime.dispatcher.Dispatcher.submitJob(Dispatcher.java:249) ... 21 more Caused by: org.apache.flink.runtime.client.JobExecutionException: Could not instantiate configured state backend at org.apache.flink.runtime.executiongraph.ExecutionGraphBuilder.buildGraph(ExecutionGraphBuilder.java:308) at org.apache.flink.runtime.executiongraph.ExecutionGraphBuilder.buildGraph(ExecutionGraphBuilder.java:100) at org.apache.flink.runtime.jobmaster.JobMaster.createExecutionGraph(JobMaster.java:1150) at org.apache.flink.runtime.jobmaster.JobMaster.createAndRestoreExecutionGraph(JobMaster.java:1130) at org.apache.flink.runtime.jobmaster.JobMaster.<init>(JobMaster.java:298) at org.apache.flink.runtime.jobmaster.JobManagerRunner.<init>(JobManagerRunner.java:151) ... 26 more Caused by: org.apache.flink.configuration.IllegalConfigurationException: Invalid configuration for RocksDB state backend's local storage directories: Relative paths are not supported at org.apache.flink.contrib.streaming.state.RocksDBStateBackend.<init>(RocksDBStateBackend.java:273) at org.apache.flink.contrib.streaming.state.RocksDBStateBackend.configure(RocksDBStateBackend.java:296) at org.apache.flink.contrib.streaming.state.RocksDBStateBackendFactory.createFromConfig(RocksDBStateBackendFactory.java:47) at org.apache.flink.contrib.streaming.state.RocksDBStateBackendFactory.createFromConfig(RocksDBStateBackendFactory.java:32) at org.apache.flink.runtime.state.StateBackendLoader.loadStateBackendFromConfig(StateBackendLoader.java:157) at org.apache.flink.runtime.state.StateBackendLoader.fromApplicationOrConfigOrDefault(StateBackendLoader.java:222) at org.apache.flink.runtime.executiongraph.ExecutionGraphBuilder.buildGraph(ExecutionGraphBuilder.java:304) ... 31 more Caused by: java.lang.IllegalArgumentException: Relative paths are not supported at org.apache.flink.contrib.streaming.state.RocksDBStateBackend.setDbStoragePaths(RocksDBStateBackend.java:518) at org.apache.flink.contrib.streaming.state.RocksDBStateBackend.<init>(RocksDBStateBackend.java:269) ... 37 more On Tue, Jul 3, 2018 at 5:11 PM, Chesnay Schepler <ches...@apache.org<mailto:ches...@apache.org>> wrote: Doesn't sound like intended behavior, can you give us the stacktrace? On 03.07.2018 13:17, Data Engineer wrote: The Flink documentation says that we need to specify the filesystem type (file://<file:///>, hdfs://) when configuring the rocksdb backend dir. https://ci.apache.org/projects/flink/flink-docs-release-1.5/ops/state/state_backends.html#the-rocksdbstatebackend But when I do this, I get an error on job submission saying that relative paths are not permitted in the rocksdb stand backend. I am submitting the job via flink cli (bin/flink run). Also, even though I give a local file system path "file:///home/abc/share"<file:///home/abc/share>, it is a shared GlusterFS volume mount, so it will be accessible by the JobManager and all TaskManagers. I removed the filesystem type from the rocksdb backend dir configuration, and though the job got submitted, the rocksdb checkpoint directory was not created. I have enabled checkpointing in my Flink application. I am using Flink 1.5.0. Any help or pointers would be appreciated.