Hi,

I am running into an Ignite (Ignite ver. 2.10.0) critical failure triggered
by high write load. This is the error summary:

[04:11:11,605][SEVERE][db-checkpoint-thread-#72][] JVM will be halted
immediately due to the failure: [failureCtx=FailureContext
[type=CRITICAL_ERROR, err=class o.a.i.IgniteCheckedException: Compound
exception for CountDownFuture.]]

The more detailed exception:
[04:11:11,435][INFO][db-checkpoint-thread-#72][Checkpointer] Checkpoint
started [checkpointId=251fa396-1611-416f-a569-c93c1e8f6c84,
startPtr=WALPointer [idx=8, fileOff=13437451, len=40871],
checkpointBeforeLockTime=193ms, checkpointLockWait=6ms,
checkpointListenersExecuteTime=33ms, checkpointLockHoldTime=45ms,
walCpRecordFsyncDuration=16ms, writeCheckpointEntryDuration=17ms,
splitAndSortCpPagesDuration=109ms, pages=76628, reason='too big size of WAL
without checkpoint']
[04:11:11,470][SEVERE][db-checkpoint-thread-#72][] Critical system error
detected. Will be handled accordingly to configured handler
[hnd=StopNodeOrHaltFailureHandler [tryStop=false, timeout=0,
super=AbstractFailureHandler [ignoredFailureTypes=UnmodifiableSet
[SYSTEM_WORKER_BLOCKED, SYSTEM_CRITICAL_OPERATION_TIMEOUT]]],
failureCtx=FailureContext [type=CRITICAL_ERROR, err=class
o.a.i.IgniteCheckedException: Compound exception for CountDownFuture.]]
class org.apache.ignite.IgniteCheckedException: Compound exception for
CountDownFuture.
        at
org.apache.ignite.internal.util.future.CountDownFuture.addError(CountDownFuture.java:72)
        at
org.apache.ignite.internal.util.future.CountDownFuture.onDone(CountDownFuture.java:46)
        at
org.apache.ignite.internal.util.future.CountDownFuture.onDone(CountDownFuture.java:28)
        at
org.apache.ignite.internal.util.future.GridFutureAdapter.onDone(GridFutureAdapter.java:478)
        at
org.apache.ignite.internal.processors.cache.persistence.checkpoint.CheckpointPagesWriter.run(CheckpointPagesWriter.java:166)
        at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
        at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
        at java.lang.Thread.run(Thread.java:748)
        Suppressed: class org.apache.ignite.IgniteException: errno: -1
                at
org.apache.ignite.internal.processors.compress.NativeFileSystemLinux.punchHole(NativeFileSystemLinux.java:122)
                at
org.apache.ignite.internal.processors.compress.FileSystemUtils.punchHole(FileSystemUtils.java:125)
                at
org.apache.ignite.internal.processors.cache.persistence.file.AsyncFileIO.punchHole(AsyncFileIO.java:93)


Some background of what I am doing:
* I am using data streamer to write ~1GB of data into a single Ignite node
(laptop) with persistence enabled. Everything was working fine until I
enabled disk compression (zstd level 3, 8KB page size). After I enabled disk
compression I get the above exception. 
* I tried enabling/disabling writeThrottlingEnabled but it did not help. 
* I turned WAL archive off and it did not help.
* I increased checkpointPageBufferSize from default 256MB to 1GB and that
delayed the exception until further into the upload but the exception still
throws eventually.







--
Sent from: http://apache-ignite-users.70518.x6.nabble.com/

Reply via email to