Robert Metzger created FLINK-5383: ------------------------------------- Summary: TaskManager fails with SIGBUS when loading RocksDB Key: FLINK-5383 URL: https://issues.apache.org/jira/browse/FLINK-5383 Project: Flink Issue Type: Bug Affects Versions: 1.2.0 Reporter: Robert Metzger
While trying out Flink 1.2, my TaskManager died with the following error while deploying a job: {code} 2016-12-21 15:57:50,080 INFO org.apache.flink.runtime.taskmanager.Task - Map -> Sink : Unnamed (15/16) (50f527e4445479fb1fc9f34394d86d2f) switched from DEPLOYING to RUNNING. 2016-12-21 15:57:50,081 INFO org.apache.flink.runtime.taskmanager.Task - Map -> Sink : Unnamed (16/16) (b4b3d3340de587d729fe83d65eac3e10) switched from DEPLOYING to RUNNING. 2016-12-21 15:57:50,081 INFO org.apache.flink.streaming.runtime.tasks.StreamTask - Using user- defined state backend: RocksDB State Backend {isInitialized=false, configuredDbBasePaths=null, initialize dDbBasePaths=null, checkpointStreamBackend=File State Backend @ hdfs://nameservice1/shared/checkpoint-dir -rocks}. 2016-12-21 15:57:50,081 INFO org.apache.flink.streaming.runtime.tasks.StreamTask - Using user- defined state backend: RocksDB State Backend {isInitialized=false, configuredDbBasePaths=null, initialize dDbBasePaths=null, checkpointStreamBackend=File State Backend @ hdfs://nameservice1/shared/checkpoint-dir -rocks}. 2016-12-21 15:57:50,223 INFO org.apache.flink.contrib.streaming.state.RocksDBStateBackend - Attempting to load RocksDB native library and store it at '/yarn/nm/usercache/longrunning/appcache/application_14821 56101125_0016' LogType:taskmanager.out Log Upload Time:Wed Dec 21 16:00:35 +0000 2016 LogLength:959 Log Contents: # # A fatal error has been detected by the Java Runtime Environment: # # SIGBUS (0x7) at pc=0x00007fe745fd596a, pid=7414, tid=140630801725184 # # JRE version: Java(TM) SE Runtime Environment (7.0_67-b01) (build 1.7.0_67-b01) # Java VM: Java HotSpot(TM) 64-Bit Server VM (24.65-b04 mixed mode linux-amd64 compressed oops) # Problematic frame: # C [ld-linux-x86-64.so.2+0x1a96a] realloc+0x2bfa # {code} the error report file contained the following frames: {code} Java frames: (J=compiled Java code, j=interpreted, Vv=VM code) j java.lang.ClassLoader$NativeLibrary.load(Ljava/lang/String;)V+0 j java.lang.ClassLoader.loadLibrary1(Ljava/lang/Class;Ljava/io/File;)Z+302 j java.lang.ClassLoader.loadLibrary0(Ljava/lang/Class;Ljava/io/File;)Z+2 j java.lang.ClassLoader.loadLibrary(Ljava/lang/Class;Ljava/lang/String;Z)V+48 j java.lang.Runtime.load0(Ljava/lang/Class;Ljava/lang/String;)V+57 j java.lang.System.load(Ljava/lang/String;)V+7 j org.rocksdb.NativeLibraryLoader.loadLibraryFromJar(Ljava/lang/String;)V+14 j org.rocksdb.NativeLibraryLoader.loadLibrary(Ljava/lang/String;)V+22 j org.apache.flink.contrib.streaming.state.RocksDBStateBackend.ensureRocksDBIsLoaded(Ljava/lang/String;)V+62 j org.apache.flink.contrib.streaming.state.RocksDBStateBackend.createKeyedStateBackend(Lorg/apache/flink/runtime/execution/Environment;Lorg/apache/flink/api/common/JobID;Ljava/lang/String;Lorg/apache/flink/api/common/typeutils/TypeSerializer;ILorg/apache/flink/runtime/state/KeyGroupRange;Lorg/apache/flink/runtime/query/TaskKvStateRegistry;)Lorg/apache/flink/runtime/state/AbstractKeyedStateBackend;+16 j org.apache.flink.streaming.runtime.tasks.StreamTask.createKeyedStateBackend(Lorg/apache/flink/api/common/typeutils/TypeSerializer;ILorg/apache/flink/runtime/state/KeyGroupRange;)Lorg/apache/flink/runtime/state/AbstractKeyedStateBackend;+137 {code} I saw this error only once so far. I'll report again if it happens more frequently. -- This message was sent by Atlassian JIRA (v6.3.4#6332)