Hi, I am running into an Ignite (Ignite ver. 2.10.0) critical failure triggered by high write load. This is the error summary:
[04:11:11,605][SEVERE][db-checkpoint-thread-#72][] JVM will be halted immediately due to the failure: [failureCtx=FailureContext [type=CRITICAL_ERROR, err=class o.a.i.IgniteCheckedException: Compound exception for CountDownFuture.]] The more detailed exception: [04:11:11,435][INFO][db-checkpoint-thread-#72][Checkpointer] Checkpoint started [checkpointId=251fa396-1611-416f-a569-c93c1e8f6c84, startPtr=WALPointer [idx=8, fileOff=13437451, len=40871], checkpointBeforeLockTime=193ms, checkpointLockWait=6ms, checkpointListenersExecuteTime=33ms, checkpointLockHoldTime=45ms, walCpRecordFsyncDuration=16ms, writeCheckpointEntryDuration=17ms, splitAndSortCpPagesDuration=109ms, pages=76628, reason='too big size of WAL without checkpoint'] [04:11:11,470][SEVERE][db-checkpoint-thread-#72][] Critical system error detected. Will be handled accordingly to configured handler [hnd=StopNodeOrHaltFailureHandler [tryStop=false, timeout=0, super=AbstractFailureHandler [ignoredFailureTypes=UnmodifiableSet [SYSTEM_WORKER_BLOCKED, SYSTEM_CRITICAL_OPERATION_TIMEOUT]]], failureCtx=FailureContext [type=CRITICAL_ERROR, err=class o.a.i.IgniteCheckedException: Compound exception for CountDownFuture.]] class org.apache.ignite.IgniteCheckedException: Compound exception for CountDownFuture. at org.apache.ignite.internal.util.future.CountDownFuture.addError(CountDownFuture.java:72) at org.apache.ignite.internal.util.future.CountDownFuture.onDone(CountDownFuture.java:46) at org.apache.ignite.internal.util.future.CountDownFuture.onDone(CountDownFuture.java:28) at org.apache.ignite.internal.util.future.GridFutureAdapter.onDone(GridFutureAdapter.java:478) at org.apache.ignite.internal.processors.cache.persistence.checkpoint.CheckpointPagesWriter.run(CheckpointPagesWriter.java:166) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) at java.lang.Thread.run(Thread.java:748) Suppressed: class org.apache.ignite.IgniteException: errno: -1 at org.apache.ignite.internal.processors.compress.NativeFileSystemLinux.punchHole(NativeFileSystemLinux.java:122) at org.apache.ignite.internal.processors.compress.FileSystemUtils.punchHole(FileSystemUtils.java:125) at org.apache.ignite.internal.processors.cache.persistence.file.AsyncFileIO.punchHole(AsyncFileIO.java:93) Some background of what I am doing: * I am using data streamer to write ~1GB of data into a single Ignite node (laptop) with persistence enabled. Everything was working fine until I enabled disk compression (zstd level 3, 8KB page size). After I enabled disk compression I get the above exception. * I tried enabling/disabling writeThrottlingEnabled but it did not help. * I turned WAL archive off and it did not help. * I increased checkpointPageBufferSize from default 256MB to 1GB and that delayed the exception until further into the upload but the exception still throws eventually. -- Sent from: http://apache-ignite-users.70518.x6.nabble.com/