Re: flink sql row_number() over () OOM

Wesley Peng Wed, 04 Sep 2019 04:48:17 -0700

Hi

on 2019/9/4 19:30, liu ze wrote:

I use the row_number() over() function to do topN, the total amount ofdata is 60,000, and the state is 12G .
Finally, oom, is there any way to optimize it?

ref:https://stackoverflow.com/questions/50812837/flink-taskmanager-out-of-memory-and-memory-configuration

The total amount of required physical and heap memory is quite difficultto compute since it strongly depends on your user code, your job'stopology and which state backend you use.

As a rule of thumb, if you experience OOM and are still using theFileSystemStateBackend or the MemoryStateBackend, then you should switchto RocksDBStateBackend, because it can gracefully spill to disk if thestate grows too big.

If you are still experiencing OOM exceptions as you have described, thenyou should check your user code whether it keeps references to stateobjects or generates in some other way large objects which cannot begarbage collected. If this is the case, then you should try to refactoryour code to rely on Flink's state abstraction, because with RocksDB itcan go out of core.

RocksDB itself needs native memory which adds to Flink's memoryfootprint. This depends on the block cache size, indexes, bloom filtersand memtables. You can find out more about these things and how toconfigure them here.

Last but not least, you should not activatetaskmanager.memory.preallocate when running streaming jobs, becausestreaming jobs currently don't use managed memory. Thus, by activatingpreallocation, you would allocate memory for Flink's managed memorywhich is reduces the available heap space.

Re: flink sql row_number() over () OOM

Reply via email to