Prateek Kohli created FLINK-22962:
-------------------------------------

             Summary: Key group is not in KeyGroupRange error while 
checkpointing
                 Key: FLINK-22962
                 URL: https://issues.apache.org/jira/browse/FLINK-22962
             Project: Flink
          Issue Type: Bug
          Components: Runtime / State Backends
    Affects Versions: 1.12.1
         Environment: Linux
            Reporter: Prateek Kohli


Hi,

 

We are getting the below exception while using rocksdb as state backend at the 
time of checkpointing:

2021-06-10 12:05:13,933 INFO 
org.apache.flink.streaming.runtime.tasks.AsyncCheckpointRunnable [] - 
Aggregator (3/4)#0 - asynchronous part of checkpoint 2 could not be completed.
java.util.concurrent.ExecutionException: java.lang.IllegalArgumentException: 
Key group 0 is not in KeyGroupRange\{startKeyGroup=5, endKeyGroup=7}.
 at java.util.concurrent.FutureTask.report(FutureTask.java:122) ~[?:1.8.0_261]
 at java.util.concurrent.FutureTask.get(FutureTask.java:192) ~[?:1.8.0_261]
 at 
org.apache.flink.runtime.concurrent.FutureUtils.runIfNotDoneAndGet(FutureUtils.java:621)
 ~[flink-dist_2.11-1.12.1.jar:1.12.1]
 at 
org.apache.flink.streaming.api.operators.OperatorSnapshotFinalizer.<init>(OperatorSnapshotFinalizer.java:54)
 ~[flink-streaming-java_2.11-1.12.1.jar:1.12.1]
 at 
org.apache.flink.streaming.runtime.tasks.AsyncCheckpointRunnable.run(AsyncCheckpointRunnable.java:122)
 [flink-streaming-java_2.11-1.12.1.jar:1.12.1]
 at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) 
[?:1.8.0_261]
 at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) 
[?:1.8.0_261]
 at java.lang.Thread.run(Thread.java:748) [?:1.8.0_261]
Caused by: java.lang.IllegalArgumentException: Key group 0 is not in 
KeyGroupRange\{startKeyGroup=5, endKeyGroup=7}.
 at 
org.apache.flink.runtime.state.KeyGroupRangeOffsets.computeKeyGroupIndex(KeyGroupRangeOffsets.java:144)
 ~[flink-dist_2.11-1.12.1.jar:1.12.1]
 at 
org.apache.flink.runtime.state.KeyGroupRangeOffsets.setKeyGroupOffset(KeyGroupRangeOffsets.java:106)
 ~[flink-dist_2.11-1.12.1.jar:1.12.1]
 at 
org.apache.flink.contrib.streaming.state.snapshot.RocksFullSnapshotStrategy$SnapshotAsynchronousPartCallable.writeKVStateData(RocksFullSnapshotStrategy.java:333)
 ~[flink-dist_2.11-1.12.1.jar:1.12.1]
 at 
org.apache.flink.contrib.streaming.state.snapshot.RocksFullSnapshotStrategy$SnapshotAsynchronousPartCallable.writeSnapshotToOutputStream(RocksFullSnapshotStrategy.java:264)
 ~[flink-dist_2.11-1.12.1.jar:1.12.1]
 at 
org.apache.flink.contrib.streaming.state.snapshot.RocksFullSnapshotStrategy$SnapshotAsynchronousPartCallable.callInternal(RocksFullSnapshotStrategy.java:227)
 ~[flink-dist_2.11-1.12.1.jar:1.12.1]
 at 
org.apache.flink.contrib.streaming.state.snapshot.RocksFullSnapshotStrategy$SnapshotAsynchronousPartCallable.callInternal(RocksFullSnapshotStrategy.java:180)
 ~[flink-dist_2.11-1.12.1.jar:1.12.1]
 at 
org.apache.flink.runtime.state.AsyncSnapshotCallable.call(AsyncSnapshotCallable.java:78)
 ~[flink-dist_2.11-1.12.1.jar:1.12.1]
 at java.util.concurrent.FutureTask.run(FutureTask.java:266) ~[?:1.8.0_261]
 at 
org.apache.flink.runtime.concurrent.FutureUtils.runIfNotDoneAndGet(FutureUtils.java:618)
 ~[flink-dist_2.11-1.12.1.jar:1.12.1]
 ... 5 more

 

When we change the state backend to file or heap we do not get this error.

 

 

 

 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to