Hello Chesnay,

Cluster (in kubernetes)-wide checkpointing directory using glusterfs volume 
mount (thus file access protocol file:///) was working fine till 1.4.2 for us. 
So we like to understand where the breakage happened in 1.5.0.
Can you please mention me the relevant source code files related to rocksdb 
“custom file path” parsing logic? We would be interested to investigate this.

I also observed below in the log –


Config uses deprecated configuration key 'state.backend.rocksdb.checkpointdir' 
instead of proper key 'state.backend.rocksdb.localdir'
Regards,
Shaswata

From: Chesnay Schepler [mailto:ches...@apache.org]
Sent: Tuesday, July 03, 2018 5:52 PM
To: Data Engineer <dataenginee...@gmail.com>
Cc: user@flink.apache.org
Subject: Re: Checkpointing in Flink 1.5.0

The code appears to be working fine.

This may happen because you're using a GlusterFS volume.
The RocksDBStateBackend uses java Files internally (NOT nio Paths), which AFAIK 
only work properly against the plain local file-system.

The GlusterFS nio FIleSystem implementation also explicitly does not support 
conversions to 
File<https://github.com/gluster/glusterfs-java-filesystem/blob/master/glusterfs-java-filesystem/src/main/java/com/peircean/glusterfs/GlusterPath.java#L271>.

On 03.07.2018 13:53, Chesnay Schepler wrote:
Thanks. Looks like RocksDBStateBackend.setDbStoragePaths has some custom file 
path parsing logic, will probe it a bit to see what the issue is.

On 03.07.2018 13:45, Data Engineer wrote:

2018-07-03 11:30:35,703 INFO  
org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - 
--------------------------------------------------------------------------------

2018-07-03 11:30:35,705 INFO  
org.apache.flink.runtime.entrypoint.ClusterEntrypoint         -  Starting 
StandaloneSessionClusterEntrypoint (Version: <unknown>, Rev:c61b108, 
Date:24.05.2018 @ 16:54:44 CEST)

2018-07-03 11:30:35,705 INFO  
org.apache.flink.runtime.entrypoint.ClusterEntrypoint         -  OS current 
user: flink

2018-07-03 11:30:35,705 INFO  
org.apache.flink.runtime.entrypoint.ClusterEntrypoint         -  Current 
Hadoop/Kerberos user: <no hadoop dependency found>

2018-07-03 11:30:35,706 INFO  
org.apache.flink.runtime.entrypoint.ClusterEntrypoint         -  JVM: OpenJDK 
64-Bit Server VM - Oracle Corporation - 1.8/25.171-b10

2018-07-03 11:30:35,706 INFO  
org.apache.flink.runtime.entrypoint.ClusterEntrypoint         -  Maximum heap 
size: 981 MiBytes

2018-07-03 11:30:35,706 INFO  
org.apache.flink.runtime.entrypoint.ClusterEntrypoint         -  JAVA_HOME: 
/etc/alternatives/jre_openjdk/

2018-07-03 11:30:35,707 INFO  
org.apache.flink.runtime.entrypoint.ClusterEntrypoint         -  No Hadoop 
Dependency available

2018-07-03 11:30:35,707 INFO  
org.apache.flink.runtime.entrypoint.ClusterEntrypoint         -  JVM Options:

2018-07-03 11:30:35,707 INFO  
org.apache.flink.runtime.entrypoint.ClusterEntrypoint         -     -Xms1024m

2018-07-03 11:30:35,707 INFO  
org.apache.flink.runtime.entrypoint.ClusterEntrypoint         -     -Xmx1024m

2018-07-03 11:30:35,708 INFO  
org.apache.flink.runtime.entrypoint.ClusterEntrypoint         -     
-Dlog.file=/opt/flink-1.5.0/log/flink--standalonesession-0-myfl-flink-jobmanager-7b4d8c4dd4-bv6zf.log

2018-07-03 11:30:35,708 INFO  
org.apache.flink.runtime.entrypoint.ClusterEntrypoint         -     
-Dlog4j.configuration=file:/opt/flink-1.5.0/conf/log4j.properties<file://opt/flink-1.5.0/conf/log4j.properties>

2018-07-03 11:30:35,708 INFO  
org.apache.flink.runtime.entrypoint.ClusterEntrypoint         -     
-Dlogback.configurationFile=file:/opt/flink-1.5.0/conf/logback.xml<file://opt/flink-1.5.0/conf/logback.xml>

2018-07-03 11:30:35,708 INFO  
org.apache.flink.runtime.entrypoint.ClusterEntrypoint         -  Program 
Arguments:

2018-07-03 11:30:35,709 INFO  
org.apache.flink.runtime.entrypoint.ClusterEntrypoint         -     --configDir

2018-07-03 11:30:35,709 INFO  
org.apache.flink.runtime.entrypoint.ClusterEntrypoint         -     
/opt/flink-1.5.0/conf

2018-07-03 11:30:35,709 INFO  
org.apache.flink.runtime.entrypoint.ClusterEntrypoint         -     
--executionMode

2018-07-03 11:30:35,709 INFO  
org.apache.flink.runtime.entrypoint.ClusterEntrypoint         -     cluster

2018-07-03 11:30:35,710 INFO  
org.apache.flink.runtime.entrypoint.ClusterEntrypoint         -     --host

2018-07-03 11:30:35,710 INFO  
org.apache.flink.runtime.entrypoint.ClusterEntrypoint         -     cluster

2018-07-03 11:30:35,710 INFO  
org.apache.flink.runtime.entrypoint.ClusterEntrypoint         -  Classpath: 
/opt/flink-1.5.0/lib/flink-cep_2.11-1.5.0.jar:/opt/flink-1.5.0/lib/flink-connectors-1.5.0.jar:/opt/flink-1.5.0/lib/flink-gelly_2.11-1.5.0.jar:/opt/flink-1.5.0/lib/flink-ml_2.11-1.5.0.jar:/opt/flink-1.5.0/lib/flink-python_2.11-1.5.0.jar:/opt/flink-1.5.0/lib/flink-table_2.11-1.5.0.jar:/opt/flink-1.5.0/lib/log4j-1.2.17.jar:/opt/flink-1.5.0/lib/slf4j-log4j12-1.7.7.jar:/opt/flink-1.5.0/lib/flink-dist_2.11-1.5.0.jar:::

2018-07-03 11:30:35,710 INFO  
org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - 
--------------------------------------------------------------------------------

2018-07-03 11:30:35,712 INFO  
org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Registered UNIX 
signal handlers for [TERM, HUP, INT]

2018-07-03 11:30:35,720 INFO  
org.apache.flink.configuration.GlobalConfiguration            - Loading 
configuration property: blob.server.port, 4124

2018-07-03 11:30:35,720 INFO  
org.apache.flink.configuration.GlobalConfiguration            - Loading 
configuration property: jobmanager.rpc.address, myfl-flink-jobmanager

2018-07-03 11:30:35,720 INFO  
org.apache.flink.configuration.GlobalConfiguration            - Loading 
configuration property: jobmanager.rpc.port, 4123

2018-07-03 11:30:35,721 INFO  
org.apache.flink.configuration.GlobalConfiguration            - Loading 
configuration property: jobmanager.heap.mb, 1024

2018-07-03 11:30:35,721 INFO  
org.apache.flink.configuration.GlobalConfiguration            - Loading 
configuration property: taskmanager.heap.mb, 1024

2018-07-03 11:30:35,721 INFO  
org.apache.flink.configuration.GlobalConfiguration            - Loading 
configuration property: taskmanager.rpc.port, 4122

2018-07-03 11:30:35,721 INFO  
org.apache.flink.configuration.GlobalConfiguration            - Loading 
configuration property: taskmanager.data.port, 4121

2018-07-03 11:30:35,721 INFO  
org.apache.flink.configuration.GlobalConfiguration            - Loading 
configuration property: taskmanager.query.port, 4125

2018-07-03 11:30:35,722 INFO  
org.apache.flink.configuration.GlobalConfiguration            - Loading 
configuration property: rest.port, 8081

2018-07-03 11:30:35,762 INFO  
org.apache.flink.configuration.GlobalConfiguration            - Loading 
configuration property: rest.address, myfl-flink-jobmanager-ui

2018-07-03 11:30:35,762 INFO  
org.apache.flink.configuration.GlobalConfiguration            - Loading 
configuration property: state.backend, rocksdb

2018-07-03 11:30:35,762 INFO  
org.apache.flink.configuration.GlobalConfiguration            - Loading 
configuration property: state.checkpoints.dir, 
file:///opt/flink/share/myfl-flink/checkpoints/ext_checkpoints

2018-07-03 11:30:35,763 INFO  
org.apache.flink.configuration.GlobalConfiguration            - Loading 
configuration property: state.backend.fs.checkpointdir, 
file:///opt/flink/share/myfl-flink/checkpoints/fs_state

2018-07-03 11:30:35,763 INFO  
org.apache.flink.configuration.GlobalConfiguration            - Loading 
configuration property: state.backend.rocksdb.checkpointdir, 
file:///opt/flink/share/myfl-flink/checkpoints/rocksdb_state

2018-07-03 11:30:35,763 INFO  
org.apache.flink.configuration.GlobalConfiguration            - Loading 
configuration property: taskmanager.numberOfTaskSlots, 4

2018-07-03 11:30:35,763 INFO  
org.apache.flink.configuration.GlobalConfiguration            - Loading 
configuration property: restart-strategy, fixed-delay

2018-07-03 11:30:35,764 INFO  
org.apache.flink.configuration.GlobalConfiguration            - Loading 
configuration property: restart-strategy.fixed-delay.attempts, 100

2018-07-03 11:30:35,764 INFO  
org.apache.flink.configuration.GlobalConfiguration            - Loading 
configuration property: restart-strategy.fixed-delay.delay, 1 s

2018-07-03 11:30:35,885 INFO  
org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Starting 
StandaloneSessionClusterEntrypoint.

2018-07-03 11:30:35,885 INFO  
org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Install default 
filesystem.

2018-07-03 11:30:35,892 INFO  org.apache.flink.core.fs.FileSystem               
            - Hadoop is not in the classpath/dependencies. The extended set of 
supported File Systems via Hadoop is not available.

2018-07-03 11:30:35,963 INFO  
org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Install 
security context.

2018-07-03 11:30:35,970 INFO  
org.apache.flink.runtime.security.modules.HadoopModuleFactory  - Cannot create 
Hadoop Security Module because Hadoop cannot be found in the Classpath.

2018-07-03 11:30:35,988 INFO  org.apache.flink.runtime.security.SecurityUtils   
            - Cannot install HadoopSecurityContext because Hadoop cannot be 
found in the Classpath.

2018-07-03 11:30:35,989 INFO  
org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Initializing 
cluster services.

2018-07-03 11:30:36,003 INFO  
org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Trying to start 
actor system at myfl-flink-jobmanager:4123

2018-07-03 11:30:37,288 INFO  akka.event.slf4j.Slf4jLogger                      
            - Slf4jLogger started

2018-07-03 11:30:37,396 INFO  akka.remote.Remoting                              
            - Starting remoting

2018-07-03 11:30:37,583 INFO  akka.remote.Remoting                              
            - Remoting started; listening on addresses 
:[akka.tcp://flink@myfl-flink-jobmanager:4123]

2018-07-03 11:30:37,591 INFO  
org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Actor system 
started at akka.tcp://flink@myfl-flink-jobmanager:4123

2018-07-03 11:30:37,611 INFO  org.apache.flink.runtime.blob.BlobServer          
            - Created BLOB server storage directory 
/tmp/blobStore-e445bc66-cee3-4a3d-b810-74df02627eca

2018-07-03 11:30:37,613 INFO  org.apache.flink.runtime.blob.BlobServer          
            - Started BLOB server at 0.0.0.0:4124<http://0.0.0.0:4124> - max 
concurrent requests: 50 - max backlog: 1000

2018-07-03 11:30:37,629 INFO  
org.apache.flink.runtime.metrics.MetricRegistryImpl           - No metrics 
reporter configured, no metrics will be exposed/reported.

2018-07-03 11:30:37,664 INFO  
org.apache.flink.runtime.dispatcher.FileArchivedExecutionGraphStore  - 
Initializing FileArchivedExecutionGraphStore: Storage directory 
/tmp/executionGraphStore-4ff546b1-4bfb-4911-9314-89c61d7e7149, expiration time 
3600000, maximum cache size 52428800 bytes.

2018-07-03 11:30:37,694 INFO  org.apache.flink.runtime.blob.TransientBlobCache  
            - Created BLOB cache storage directory 
/tmp/blobStore-7e0efdb8-f70b-42ed-9387-c0e1b8090b36

2018-07-03 11:30:37,702 WARN  
org.apache.flink.runtime.dispatcher.DispatcherRestEndpoint    - Upload 
directory /tmp/flink-web-e68a12b9-b9cc-4508-be00-4bf9f113afcd/flink-web-upload 
does not exist, or has been deleted externally. Previously uploaded files are 
no longer available.

2018-07-03 11:30:37,703 INFO  
org.apache.flink.runtime.dispatcher.DispatcherRestEndpoint    - Created 
directory /tmp/flink-web-e68a12b9-b9cc-4508-be00-4bf9f113afcd/flink-web-upload 
for file uploads.

2018-07-03 11:30:37,706 INFO  
org.apache.flink.runtime.dispatcher.DispatcherRestEndpoint    - Starting rest 
endpoint.

2018-07-03 11:30:38,369 INFO  
org.apache.flink.runtime.webmonitor.WebMonitorUtils           - Determined 
location of main cluster component log file: 
/opt/flink-1.5.0/log/flink--standalonesession-0-myfl-flink-jobmanager-7b4d8c4dd4-bv6zf.log

2018-07-03 11:30:38,369 INFO  
org.apache.flink.runtime.webmonitor.WebMonitorUtils           - Determined 
location of main cluster component stdout file: 
/opt/flink-1.5.0/log/flink--standalonesession-0-myfl-flink-jobmanager-7b4d8c4dd4-bv6zf.out

2018-07-03 11:30:38,567 INFO  
org.apache.flink.runtime.dispatcher.DispatcherRestEndpoint    - Rest endpoint 
listening at myfl-flink-jobmanager-ui:8081

2018-07-03 11:30:38,568 INFO  
org.apache.flink.runtime.dispatcher.DispatcherRestEndpoint    - 
http://myfl-flink-jobmanager-ui:8081 was granted leadership with 
leaderSessionID=00000000-0000-0000-0000-000000000000

2018-07-03 11:30:38,568 INFO  
org.apache.flink.runtime.dispatcher.DispatcherRestEndpoint    - Web frontend 
listening at http://myfl-flink-jobmanager-ui:8081.

2018-07-03 11:30:38,578 INFO  org.apache.flink.runtime.rpc.akka.AkkaRpcService  
            - Starting RPC endpoint for 
org.apache.flink.runtime.resourcemanager.StandaloneResourceManager at 
akka://flink/user/resourcemanager .

2018-07-03 11:30:38,966 INFO  org.apache.flink.runtime.rpc.akka.AkkaRpcService  
            - Starting RPC endpoint for 
org.apache.flink.runtime.dispatcher.StandaloneDispatcher at 
akka://flink/user/dispatcher .

2018-07-03 11:30:39,068 INFO  
org.apache.flink.runtime.resourcemanager.StandaloneResourceManager  - 
ResourceManager 
akka.tcp://flink@myfl-flink-jobmanager:4123/user/resourcemanager was granted 
leadership with fencing token 00000000000000000000000000000000

2018-07-03 11:30:39,069 INFO  
org.apache.flink.runtime.resourcemanager.slotmanager.SlotManager  - Starting 
the SlotManager.

2018-07-03 11:30:39,164 INFO  
org.apache.flink.runtime.dispatcher.StandaloneDispatcher      - Dispatcher 
akka.tcp://flink@myfl-flink-jobmanager:4123/user/dispatcher was granted 
leadership with fencing token 00000000000000000000000000000000

2018-07-03 11:30:39,165 INFO  
org.apache.flink.runtime.dispatcher.StandaloneDispatcher      - Recovering all 
persisted jobs.

2018-07-03 11:30:39,682 INFO  
org.apache.flink.runtime.resourcemanager.StandaloneResourceManager  - Replacing 
old registration of TaskExecutor 068c693b9585900f68c53b00507ee889.

2018-07-03 11:30:39,683 INFO  
org.apache.flink.runtime.resourcemanager.slotmanager.SlotManager  - Unregister 
TaskManager 8a5cee3aa38081030dc8558ac477d3b3 from the SlotManager.

2018-07-03 11:30:39,683 INFO  
org.apache.flink.runtime.resourcemanager.StandaloneResourceManager  - The 
target with resource ID 068c693b9585900f68c53b00507ee889 is already been 
monitored.

2018-07-03 11:30:39,770 INFO  
org.apache.flink.runtime.resourcemanager.slotmanager.SlotManager  - Registering 
TaskManager 068c693b9585900f68c53b00507ee889 under 
03d409e5166fad4f4082b6165eb0de2e at the SlotManager.

2018-07-03 11:34:20,257 INFO  
org.apache.flink.runtime.dispatcher.StandaloneDispatcher      - Submitting job 
b684656d9afd75cc384a7bcd071bf55e (CSV Files Read -> CSV to Avro encode -> Kafka 
publish).

2018-07-03 11:34:20,269 INFO  org.apache.flink.runtime.rpc.akka.AkkaRpcService  
            - Starting RPC endpoint for 
org.apache.flink.runtime.jobmaster.JobMaster at akka://flink/user/jobmanager_0 .

2018-07-03 11:34:20,278 INFO  org.apache.flink.runtime.jobmaster.JobMaster      
            - Initializing job CSV Files Read -> CSV to Avro encode -> Kafka 
publish (b684656d9afd75cc384a7bcd071bf55e).

2018-07-03 11:34:20,285 INFO  org.apache.flink.runtime.jobmaster.JobMaster      
            - Using restart strategy 
FixedDelayRestartStrategy(maxNumberRestartAttempts=2147483647, 
delayBetweenRestartAttempts=0) for CSV Files Read -> CSV to Avro encode -> 
Kafka publish (b684656d9afd75cc384a7bcd071bf55e).

2018-07-03 11:34:20,289 INFO  org.apache.flink.runtime.rpc.akka.AkkaRpcService  
            - Starting RPC endpoint for 
org.apache.flink.runtime.jobmaster.slotpool.SlotPool at 
akka://flink/user/67ceb2ae-1cb1-44be-a09e-601032e23fb5 .

2018-07-03 11:34:20,481 INFO  
org.apache.flink.runtime.executiongraph.ExecutionGraph        - Job recovers 
via failover strategy: full graph restart

2018-07-03 11:34:20,562 INFO  org.apache.flink.runtime.jobmaster.JobMaster      
            - Running initialization on master for job CSV Files Read -> CSV to 
Avro encode -> Kafka publish (b684656d9afd75cc384a7bcd071bf55e).

2018-07-03 11:34:20,563 INFO  org.apache.flink.runtime.jobmaster.JobMaster      
            - Successfully ran initialization on master in 0 ms.

2018-07-03 11:34:20,580 INFO  org.apache.flink.runtime.jobmaster.JobMaster      
            - Loading state backend via factory 
org.apache.flink.contrib.streaming.state.RocksDBStateBackendFactory

2018-07-03 11:34:20,590 WARN  org.apache.flink.configuration.Configuration      
            - Config uses deprecated configuration key 
'state.backend.rocksdb.checkpointdir' instead of proper key 
'state.backend.rocksdb.localdir'

2018-07-03 11:34:20,592 ERROR 
org.apache.flink.runtime.rest.handler.job.JobSubmitHandler    - Implementation 
error: Unhandled exception.

org.apache.flink.util.FlinkException: Failed to submit job 
b684656d9afd75cc384a7bcd071bf55e.

  at 
org.apache.flink.runtime.dispatcher.Dispatcher.submitJob(Dispatcher.java:254)

  at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)

  at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)

  at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)

  at java.lang.reflect.Method.invoke(Method.java:498)

  at 
org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleRpcInvocation(AkkaRpcActor.java:247)

  at 
org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleRpcMessage(AkkaRpcActor.java:162)

  at 
org.apache.flink.runtime.rpc.akka.FencedAkkaRpcActor.handleRpcMessage(FencedAkkaRpcActor.java:70)

  at 
org.apache.flink.runtime.rpc.akka.AkkaRpcActor.onReceive(AkkaRpcActor.java:142)

  at 
org.apache.flink.runtime.rpc.akka.FencedAkkaRpcActor.onReceive(FencedAkkaRpcActor.java:40)

  at 
akka.actor.UntypedActor$$anonfun$receive$1.applyOrElse(UntypedActor.scala:165)

  at akka.actor.Actor$class.aroundReceive(Actor.scala:502)

  at akka.actor.UntypedActor.aroundReceive(UntypedActor.scala:95)

  at akka.actor.ActorCell.receiveMessage(ActorCell.scala:526)

  at akka.actor.ActorCell.invoke(ActorCell.scala:495)

  at akka.dispatch.Mailbox.processMailbox(Mailbox.scala:257)

  at akka.dispatch.Mailbox.run(Mailbox.scala:224)

  at akka.dispatch.Mailbox.exec(Mailbox.scala:234)

  at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260)

  at 
scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1339)

  at scala.concurrent.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979)

  at 
scala.concurrent.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107)

Caused by: org.apache.flink.runtime.client.JobExecutionException: Could not set 
up JobManager

  at 
org.apache.flink.runtime.jobmaster.JobManagerRunner.<init>(JobManagerRunner.java:169)

  at 
org.apache.flink.runtime.dispatcher.Dispatcher$DefaultJobManagerRunnerFactory.createJobManagerRunner(Dispatcher.java:885)

  at 
org.apache.flink.runtime.dispatcher.Dispatcher.createJobManagerRunner(Dispatcher.java:287)

  at org.apache.flink.runtime.dispatcher.Dispatcher.runJob(Dispatcher.java:277)

  at 
org.apache.flink.runtime.dispatcher.Dispatcher.persistAndRunJob(Dispatcher.java:262)

  at 
org.apache.flink.runtime.dispatcher.Dispatcher.submitJob(Dispatcher.java:249)

  ... 21 more

Caused by: org.apache.flink.runtime.client.JobExecutionException: Could not 
instantiate configured state backend

  at 
org.apache.flink.runtime.executiongraph.ExecutionGraphBuilder.buildGraph(ExecutionGraphBuilder.java:308)

  at 
org.apache.flink.runtime.executiongraph.ExecutionGraphBuilder.buildGraph(ExecutionGraphBuilder.java:100)

  at 
org.apache.flink.runtime.jobmaster.JobMaster.createExecutionGraph(JobMaster.java:1150)

  at 
org.apache.flink.runtime.jobmaster.JobMaster.createAndRestoreExecutionGraph(JobMaster.java:1130)

  at org.apache.flink.runtime.jobmaster.JobMaster.<init>(JobMaster.java:298)

  at 
org.apache.flink.runtime.jobmaster.JobManagerRunner.<init>(JobManagerRunner.java:151)

  ... 26 more

Caused by: org.apache.flink.configuration.IllegalConfigurationException: 
Invalid configuration for RocksDB state backend's local storage directories: 
Relative paths are not supported

  at 
org.apache.flink.contrib.streaming.state.RocksDBStateBackend.<init>(RocksDBStateBackend.java:273)

  at 
org.apache.flink.contrib.streaming.state.RocksDBStateBackend.configure(RocksDBStateBackend.java:296)

  at 
org.apache.flink.contrib.streaming.state.RocksDBStateBackendFactory.createFromConfig(RocksDBStateBackendFactory.java:47)

  at 
org.apache.flink.contrib.streaming.state.RocksDBStateBackendFactory.createFromConfig(RocksDBStateBackendFactory.java:32)

  at 
org.apache.flink.runtime.state.StateBackendLoader.loadStateBackendFromConfig(StateBackendLoader.java:157)

  at 
org.apache.flink.runtime.state.StateBackendLoader.fromApplicationOrConfigOrDefault(StateBackendLoader.java:222)

  at 
org.apache.flink.runtime.executiongraph.ExecutionGraphBuilder.buildGraph(ExecutionGraphBuilder.java:304)

  ... 31 more

Caused by: java.lang.IllegalArgumentException: Relative paths are not supported

  at 
org.apache.flink.contrib.streaming.state.RocksDBStateBackend.setDbStoragePaths(RocksDBStateBackend.java:518)

  at 
org.apache.flink.contrib.streaming.state.RocksDBStateBackend.<init>(RocksDBStateBackend.java:269)

  ... 37 more


On Tue, Jul 3, 2018 at 5:11 PM, Chesnay Schepler 
<ches...@apache.org<mailto:ches...@apache.org>> wrote:
Doesn't sound like intended behavior, can you give us the stacktrace?


On 03.07.2018 13:17, Data Engineer wrote:
 The Flink documentation says that we need to specify the filesystem type 
(file://<file:///>, hdfs://) when configuring the rocksdb backend dir.
https://ci.apache.org/projects/flink/flink-docs-release-1.5/ops/state/state_backends.html#the-rocksdbstatebackend

But when I do this, I get an error on job submission saying that relative paths 
are not permitted in the rocksdb stand backend.
I am submitting the job via flink cli (bin/flink run).

Also, even though I give a local file system path 
"file:///home/abc/share"<file:///home/abc/share>, it is a shared GlusterFS 
volume mount, so it will be accessible by the JobManager and all TaskManagers.

I removed the filesystem type from the rocksdb backend dir configuration, and 
though the job got submitted, the rocksdb checkpoint directory was not created.
I have enabled checkpointing in my Flink application.

I am using Flink 1.5.0.

Any help or pointers would be appreciated.






Reply via email to