Hi Kevin, We bumped the RocksDb version with Flink 1.14 which we thought increases the memory control [1]. In the past we also saw problems with the allocator used of the OS. We switched to use jemalloc within our docker images which has a better memory fragmentation [2]. Are you using the official Flink docker image or did you build your own?
I am also pulling in yun tang who is more familiar with Flinkās state backend. Maybe he has an immediate idea about your problem. Best, Fabian [1] https://issues.apache.org/jira/browse/FLINK-14482 <https://issues.apache.org/jira/browse/FLINK-14482> [2] https://lists.apache.org/thread.html/r596a19f8cf7278bcf9e30c3060cf00562677d4be072050444a5caf99%40%3Cdev.flink.apache.org%3E <https://lists.apache.org/thread.html/r596a19f8cf7278bcf9e30c3060cf00562677d4be072050444a5caf99@%3Cdev.flink.apache.org%3E>