Yes. And defaultTxTimeout for similar reasons.

Frankly, these are terrible defaults. There is a ticket to fix the latter: 
https://issues.apache.org/jira/browse/IGNITE-13064

> On 3 Nov 2021, at 11:59, Sumit Deshinge <sumit.deshi...@broadcom.com> wrote:
> 
> Hi Ilya,
> 
> I have tried to run this with both ignite 2.10 and 2.11
> There are 24 thin clients on different VMs which are trying to perform CRUD 
> operations on the same AND/OR different caches parallelly.
> If I set the TxTimeoutOnPartitionMapExchange property to some value, then the 
> transactions started by the thin clients gets timed out. If this property is 
> not set, then transactions don't time out or rolled back and in this case 
> cluster becomes unresponsive.
> 
> Is it advisable to always have TxTimeoutOnPartitionMapExchange set to some 
> value ?
> 
> On Wed, Nov 3, 2021 at 7:17 AM Ilya Korol <llivezk...@gmail.com 
> <mailto:llivezk...@gmail.com>> wrote:
> Hi Sumit,
> 
> What is Ignite version that you use?
> AFAIK partition map exchange is a king of "stop the world" actiity for 
> the cluster, so any other actions with cluster (like cache creation) 
> would be suspended until PME end.
> If all of your clients concurrently try to create same cache it's OK 
> that there are some rolledback transactions, but if your cluster became 
> unresponsive after that this looks like a bug, so you can submit a JIRA 
> ticket, with steps to reproduce this issue.
> 
> But if you experience a deadlock thic looks like a bug.
> 
> On 2021/11/02 13:45:49 Sumit Deshinge wrote:
>  > Hi,
>  >
>  > I have apache ignite cluster of 3 ignite server and more than 20 ignite
>  > thin clients (each thin client being on separate VM). These thin clients
>  > are trying to create caches at approximately the same time parallely and
>  > also starting with cache CRUD operations after that.
>  >
>  > Looks like partition map exchange process and cache CRUD operations in
>  > parallel are causing deadlock or lock acquire failures.
>  >
>  > *What should be the strategy to handle this scenario ?*
>  >
>  > Ignite server has below errors:
>  >
>  > *Exception stack trace 1:*
>  >
>  > WARNING: Dumping the near node thread that started transaction
>  > [xidVer=GridCacheVersion [topVer=247332659, order=1635852705217,
>  > nodeOrder=1], nodeId=2735bef0-7404-41e3-843f-7043490c9d84]
>  > Stack trace of the transaction owner thread:
>  > Thread [name="client-connector-#56%perf-dn1%", id=93, state=WAITING,
>  > blockCnt=5023, waitCnt=36165]
>  > at sun.misc.Unsafe.park(Native Method)
>  > at java.util.concurrent.locks.LockSupport.park(LockSupport.java:304)
>  > at o.a.i.i.util.future.GridFutureAdapter.get0(GridFutureAdapter.java:178)
>  > at o.a.i.i.util.future.GridFutureAdapter.get(GridFutureAdapter.java:141)
>  > at 
> o.a.i.i.processors.cache.GridCacheAdapter$41.op(GridCacheAdapter.java:3430)
>  > at 
> o.a.i.i.processors.cache.GridCacheAdapter$41.op(GridCacheAdapter.java:3423)
>  > at 
> o.a.i.i.processors.cache.GridCacheAdapter.syncOp(GridCacheAdapter.java:4480)
>  > at 
> o.a.i.i.processors.cache.GridCacheAdapter.remove0(GridCacheAdapter.java:3423)
>  > at 
> o.a.i.i.processors.cache.GridCacheAdapter.remove(GridCacheAdapter.java:3405)
>  > at 
> o.a.i.i.processors.cache.GridCacheAdapter.remove(GridCacheAdapter.java:3388)
>  > at 
> o.a.i.i.processors.cache.IgniteCacheProxyImpl.remove(IgniteCacheProxyImpl.java:1438)
>  > at 
> o.a.i.i.processors.cache.GatewayProtectedCacheProxy.remove(GatewayProtectedCacheProxy.java:964)
>  > at 
> o.a.i.i.processors.platform.client.cache.ClientCacheRemoveKeyRequest.process(ClientCacheRemoveKeyRequest.java:41)
>  > at 
> o.a.i.i.processors.platform.client.ClientRequestHandler.handle(ClientRequestHandler.java:77)
>  > at 
> o.a.i.i.processors.odbc.ClientListenerNioListener.onMessage(ClientListenerNioListener.java:204)
>  > at 
> o.a.i.i.processors.odbc.ClientListenerNioListener.onMessage(ClientListenerNioListener.java:55)
>  > at 
> o.a.i.i.util.nio.GridNioFilterChain$TailFilter.onMessageReceived(GridNioFilterChain.java:279)
>  > at 
> o.a.i.i.util.nio.GridNioFilterAdapter.proceedMessageReceived(GridNioFilterAdapter.java:109)
>  > at 
> o.a.i.i.util.nio.GridNioAsyncNotifyFilter$3.body(GridNioAsyncNotifyFilter.java:97)
>  > at o.a.i.i.util.worker.GridWorker.run(GridWorker.java:120)
>  > at o.a.i.i.util.worker.GridWorkerPool$1.run(GridWorkerPool.java:70)
>  > at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
>  > at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
>  > at java.lang.Thread.run(Thread.java:748)
>  >
>  > *Exception stack trace 2:*
>  >
>  > WARNING: >>> Transaction [startTime=11:39:27.214,
>  > curTime=11:40:36.277, systemTime=0, userTime=69063, tx=GridNearTxLocal
>  > [mappings=IgniteTxMappingsImpl [], nearLocallyMapped=false,
>  > colocatedLocallyMapped=false, needCheckBackup=null,
>  > hasRemoteLocks=false, trackTimeout=false, systemTime=44700,
>  > systemStartTime=0, prepareStartTime=0, prepareTime=0,
>  > commitOrRollbackStartTime=0, commitOrRollbackTime=0, lb=null,
>  > mvccOp=null, qryId=-1, crdVer=0,
>  > thread=client-connector-#57%perf-dn1%, mappings=IgniteTxMappingsImpl
>  > [], super=GridDhtTxLocalAdapter [nearOnOriginatingNode=false,
>  > span=o.a.i.i.processors.tracing.NoopSpan@4a931268,
>  > nearNodes=KeySetView [], dhtNodes=KeySetView [], explicitLock=false,
>  > super=IgniteTxLocalAdapter [completedBase=null,
>  > sndTransformedVals=false, depEnabled=false, txState=IgniteTxStateImpl
>  > [activeCacheIds=[], recovery=null, mvccEnabled=null,
>  > mvccCachingCacheIds=[], txMap=EmptySet []], super=IgniteTxAdapter
>  > [xidVer=GridCacheVersion [topVer=247332659, order=1635852705226,
>  > nodeOrder=1], writeVer=null, implicit=false, loc=true, threadId=95,
>  > startTime=1635853167214, nodeId=2735bef0-7404-41e3-843f-7043490c9d84,
>  > isolation=REPEATABLE_READ, concurrency=PESSIMISTIC, timeout=0,
>  > sysInvalidate=false, sys=false, plc=2, commitVer=null,
>  > finalizing=NONE, invalidParts=null, state=SUSPENDED, timedOut=false,
>  > topVer=AffinityTopologyVersion [topVer=-1, minorTopVer=0],
>  > mvccSnapshot=null, skipCompletedVers=false, parentTx=null,
>  > duration=69079ms, onePhaseCommit=false], size=0]]]]
>  > Nov 2, 2021 11:40:36 AM org.apache.ignite.logger.java.JavaLogger warning
>  > WARNING: First 10 long running cache futures [total=16]
>  > Nov 2, 2021 11:40:36 AM org.apache.ignite.logger.java.JavaLogger warning
>  > WARNING: >>> Future [startTime=11:39:27.324, curTime=11:40:36.277,
>  > fut=GridDhtLockFuture
>  > [span=o.a.i.i.processors.tracing.NoopSpan@4a931268,
>  > nearNodeId=2735bef0-7404-41e3-843f-7043490c9d84,
>  > nearLockVer=GridCacheVersion [topVer=247332659, order=1635852705208,
>  > nodeOrder=1], topVer=AffinityTopologyVersion [topVer=1,
>  > minorTopVer=162], threadId=124,
>  > futId=be58a60ec71-1d64903c-c700-4deb-bace-cc5158713120,
>  > lockVer=GridCacheVersion [topVer=247332659, order=1635852705208,
>  > nodeOrder=1], read=false, err=null, timedOut=false, timeout=0,
>  > tx=GridNearTxLocal [mappings=IgniteTxMappingsImpl []dNearNodes=null,
>  > ownerVer=GridCacheVersion [topVer=247332659, order=1635852705211,
>  > nodeOrder=1], serOrder=null, key=KeyCacheObjectImpl [part=244,
>  > val=data=6ff0c60ec71-625345be-9a91-497a-895e-abbe5df9da3d],
>  > hasValBytes=true],
>  > 
> masks=local=1|owner=0|ready=1|reentry=0|used=0|tx=1|single_implicit=0|dht_local=1|near_local=0|removed=0|read=0,
>  > prevVer=null, nextVer=null]], rmts=null]], flags=3]]], prepared=0,
>  > locked=false, nodeId=2735bef0-7404-41e3-843f-7043490c9d84,
>  > locMapped=false, expiryPlc=null, transferExpiryPlc=false, flags=2,
>  > partUpdateCntr=0, serReadVer=null, xidVer=GridCacheVersion
>  > [topVer=247332659, order=1635852705208, nodeOrder=1]]]],
>  > super=IgniteTxAdapter [xidVer=GridCacheVersion [topVer=247332659,
>  > order=1635852705208, nodeOrder=1], writeVer=null, implicit=false,
>  > loc=true, threadId=124, startTime=1635853167214,
>  > nodeId=2735bef0-7404-41e3-843f-7043490c9d84,
>  > isolation=REPEATABLE_READ, concurrency=PESSIMISTIC, timeout=0,
>  > sysInvalidate=false, sys=false, plc=2, commitVer=null,
>  > finalizing=NONE, invalidParts=null, state=ACTIVE, timedOut=false,
>  > topVer=AffinityTopologyVersion [topVer=1, minorTopVer=162],
>  > mvccSnapshot=null, skipCompletedVers=false, parentTx=null,
>  > duration=69094ms, onePhaseCommit=false], size=1]],
>  > nearLocallyMapped=false, colocatedLocallyMapped=true,
>  > needCheckBackup=null, hasRemoteLocks=false, trackTimeout=false,
>  > systemTime=75000, systemStartTime=971108549857700, prepareStartTime=0,
>  > prepareTime=0, commitOrRollbackStartTime=0, commitOrRollbackTime=0,
>  > lb=null, mvccOp=null, qryId=-1, crdVer=0,
>  > thread=client-connector-#84%perf-dn1%, mappings=IgniteTxMappingsImpl
>  > []dNearNodes=null, ownerVer=GridCacheVersion [topVer=247332659,
>  > order=1635852705211, nodeOrder=1], serOrder=null,
>  > key=KeyCacheObjectImpl [part=244,
>  > val=data=6ff0c60ec71-625345be-9a91-497a-895e-abbe5df9da3d],
>  > hasValBytes=true],
>  > 
> masks=local=1|owner=0|ready=1|reentry=0|used=0|tx=1|single_implicit=0|dht_local=1|near_local=0|removed=0|read=0,
>  > prevVer=null, nextVer=null]], rmts=null]], flags=3]]], prepared=0,
>  > locked=false, nodeId=2735bef0-7404-41e3-843f-7043490c9d84,
>  > locMapped=false, expiryPlc=null, transferExpiryPlc=false, flags=2,
>  > partUpdateCntr=0, serReadVer=null, xidVer=GridCacheVersion
>  > [topVer=247332659, order=1635852705208, nodeOrder=1]]]],
>  > super=IgniteTxAdapter [xidVer=GridCacheVersion [topVer=247332659,
>  > order=1635852705208, nodeOrder=1], writeVer=null, implicit=false,
>  > loc=true, threadId=124, startTime=1635853167214,
>  > nodeId=2735bef0-7404-41e3-843f-7043490c9d84,
>  > isolation=REPEATABLE_READ, concurrency=PESSIMISTIC, timeout=0,
>  > sysInvalidate=false, sys=false, plc=2, commitVer=null,
>  > finalizing=NONE, invalidParts=null, state=ACTIVE, timedOut=false,
>  > topVer=AffinityTopologyVersion [topVer=1, minorTopVer=162],
>  > mvccSnapshot=null, skipCompletedVers=false, parentTx=null,
>  > duration=69094ms, onePhaseCommit=false], size=1]],
>  > super=GridDhtTxLocalAdapter [nearOnOriginatingNode=false,
>  > span=o.a.i.i.processors.tracing.NoopSpan@4a931268,
>  > nearNodes=KeySetView [], dhtNodes=KeySetView [], explicitLock=false,
>  > super=IgniteTxLocalAdapter [completedBase=null,
>  > sndTransformedVals=false, depEnabled=false, txState=IgniteTxStateImpl
>  > [activeCacheIds=[585748697], recovery=false, mvccEnabled=false,
>  > mvccCachingCacheIds=[], txMap=ArrayList [IgniteTxEntry
>  > [txKey=IgniteTxKey [key=KeyCacheObjectImpl [part=244,
>  > val=data=6ff0c60ec71-625345be-9a91-497a-895e-abbe5df9da3d],
>  > hasValBytes=true], cacheId=585748697], val=TxEntryValueHolder
>  > [val=null, op=DELETE], prevVal=TxEntryValueHolder [val=null, op=NOOP],
>  > oldVal=TxEntryValueHolder [val=null, op=NOOP],
>  > entryProcessorsCol=null, ttl=-1, conflictExpireTime=-1,
>  > conflictVer=null, explicitVer=null, dhtVer=null,
>  > filters=CacheEntryPredicate[] [], filtersPassed=false,
>  > filtersSet=true, entry=GridDhtCacheEntry [rdrs=ReaderId[] [],
>  > part=244, super=GridDistributedCacheEntry [super=GridCacheMapEntry
>  > [key=KeyCacheObjectImpl [part=244,
>  > val=data=6ff0c60ec71-625345be-9a91-497a-895e-abbe5df9da3d],
>  > hasValBytes=true], val=null, ver=GridCacheVersion [topVer=247332659,
>  > order=1635852705229, nodeOrder=1], hash=1085684290,
>  > extras=GridCacheMvccEntryExtras [mvcc=GridCacheMvcc [locs=LinkedList
>  > [GridCacheMvccCandidate [nodeId=2735bef0-7404-41e3-843f-7043490c9d84,
>  > ver=GridCacheVersion [topVer=247332659, order=1635852705207,
>  > nodeOrder=1], threadId=122, id=2104, topVer=AffinityTopologyVersion
>  > [topVer=1, minorTopVer=162], reentry=null,
>  > otherNodeId=2735bef0-7404-41e3-843f-7043490c9d84,
>  > otherVer=GridCacheVersion [topVer=247332659, order=1635852705207,
>  > nodeOrder=1], mappedDhtNodes=null, mappedNearNodes=null,
>  > ownerVer=GridCacheVersion [topVer=247332659, order=1635852705211,
>  > nodeOrder=1], serOrder=null, key=KeyCacheObjectImpl [part=244,
>  > val=data=6ff0c60ec71-625345be-9a91-497a-895e-abbe5df9da3d],
>  > hasValBytes=true],
>  > 
> masks=local=1|owner=1|ready=1|reentry=0|used=0|tx=1|single_implicit=0|dht_local=1|near_local=0|removed=0|read=0,
>  > prevVer=null, nextVer=null], GridCacheMvccCandidate
>  > [nodeId=2735bef0-7404-41e3-843f-7043490c9d84, ver=GridCacheVersion
>  > [topVer=247332659, order=1635852705208, nodeOrder=1], threadId=124,
>  > id=2102, topVer=AffinityTopologyVersion [topVer=1, minorTopVer=162],
>  > reentry=null, otherNodeId=2735bef0-7404-41e3-843f-7043490c9d84,
>  > otherVer=GridCacheVersion [topVer=247332659, order=1635852705208,
>  > nodeOrder=1], mappedDhtNodes=null, mappedNearNodes=null,
>  > ownerVer=GridCacheVersion [topVer=247332659, order=1635852705211,
>  > nodeOrder=1], serOrder=null, key=KeyCacheObjectImpl [part=244,
>  > val=data=6ff0c60ec71-625345be-9a91-497a-895e-abbe5df9da3d],
>  > hasValBytes=true],
>  > 
> masks=local=1|owner=0|ready=1|reentry=0|used=0|tx=1|single_implicit=0|dht_local=1|near_local=0|removed=0|read=0,
>  > prevVer=null, nextVer=null], GridCacheMvccCandidate
>  > [nodeId=2735bef0-7404-41e3-843f-7043490c9d84, ver=GridCacheVersion
>  > [topVer=247332659, order=1635852705213, nodeOrder=1], threadId=122,
>  > id=2120, topVer=AffinityTopologyVersion [topVer=1, minorTopVer=162],
>  > reentry=null, otherNodeId=2735bef0-7404-41e3-843f-7043490c9d84,
>  > otherVer=GridCacheVersion [topVer=247332659, order=1635852705213,
>  > nodeOrder=1], mappedDhtNodes=null, mappedNearNodes=null,
>  > ownerVer=GridCacheVersion [topVer=247332659, order=1635852705207,
>  > nodeOrder=1], serOrder=null, key=KeyCacheObjectImpl [part=244,
>  > val=data=6ff0c60ec71-625345be-9a91-497a-895e-abbe5df9da3d],
>  > hasValBytes=true],
>  > 
> masks=local=1|owner=0|ready=1|reentry=0|used=0|tx=1|single_implicit=0|dht_local=1|near_local=0|removed=0|read=0,
>  > prevVer=null, nextVer=null], GridCacheMvccCandidate
>  > [nodeId=2735bef0-7404-41e3-843f-7043490c9d84, ver=GridCacheVersion
>  > [topVer=247332659, order=1635852705214, nodeOrder=1], threadId=123,
>  > id=2118, topVer=AffinityTopologyVersion [topVer=1, minorTopVer=162],
>  > reentry=null, otherNodeId=2735bef0-7404-41e3-843f-7043490c9d84,
>  > otherVer=GridCacheVersion [topVer=247332659, order=1635852705214,
>  > nodeOrder=1], mappedDhtNodes=null, mappedNearNodes=null,
>  > ownerVer=GridCacheVersion [topVer=247332659, order=1635852705207,
>  > nodeOrder=1], serOrder=null, key=KeyCacheObjectImpl [part=244,
>  > val=data=6ff0c60ec71-625345be-9a91-497a-895e-abbe5df9da3d],
>  > hasValBytes=true],
>  > 
> masks=local=1|owner=0|ready=1|reentry=0|used=0|tx=1|single_implicit=0|dht_local=1|near_local=0|removed=0|read=0,
>  > prevVer=null, nextVer=null], GridCacheMvccCandidate
>  > [nodeId=2735bef0-7404-41e3-843f-7043490c9d84, ver=GridCacheVersion
>  > [topVer=247332659, order=1635852705217, nodeOrder=1], threadId=93,
>  > id=2108, topVer=AffinityTopologyVersion [topVer=1, minorTopVer=162],
>  > reentry=null, otherNodeId=2735bef0-7404-41e3-843f-7043490c9d84,
>  > otherVer=GridCacheVersion [topVer=247332659, order=1635852705217,
>  > nodeOrder=1], mappedDhtNodes=null, mappedNearNodes=null,
>  > ownerVer=GridCacheVersion [topVer=247332659, order=1635852705211,
>  > nodeOrder=1], serOrder=null, key=KeyCacheObjectImpl [part=244,
>  > val=data=6ff0c60ec71-625345be-9a91-497a-895e-abbe5df9da3d],
>  > hasValBytes=true],
>  > 
> masks=local=1|owner=0|ready=1|reentry=0|used=0|tx=1|single_implicit=0|dht_local=1|near_local=0|removed=0|read=0,
>  > prevVer=null, nextVer=null], GridCacheMvccCandidate
>  > [nodeId=2735bef0-7404-41e3-843f-7043490c9d84, ver=GridCacheVersion
>  > [topVer=247332659, order=1635852705218, nodeOrder=1], threadId=115,
>  > id=2106, topVer=AffinityTopologyVersion [topVer=1, minorTopVer=162],
>  > reentry=null, otherNodeId=2735bef0-7404-41e3-843f-7043490c9d84,
>  > otherVer=GridCacheVersion [topVer=247332659, order=1635852705218,
>  > nodeOrder=1], mappedDhtNodes=null, mappedNearNodes=null,
>  > ownerVer=GridCacheVersion [topVer=247332659, order=1635852705211,
>  > nodeOrder=1], serOrder=null, key=KeyCacheObjectImpl [part=244,
>  > val=data=6ff0c60ec71-625345be-9a91-497a-895e-abbe5df9da3d],
>  > hasValBytes=true],
>  > 
> masks=local=1|owner=0|ready=1|reentry=0|used=0|tx=1|single_implicit=0|dht_local=1|near_local=0|removed=0|read=0,
>  > prevVer=null, nextVer=null], GridCacheMvccCandidate
>  > [nodeId=2735bef0-7404-41e3-843f-7043490c9d84, ver=GridCacheVersion
>  > [topVer=247332659, order=1635852705222, nodeOrder=1], threadId=95,
>  > id=2110, topVer=AffinityTopologyVersion [topVer=1, minorTopVer=162],
>  > reentry=null, otherNodeId=2735bef0-7404-41e3-843f-7043490c9d84,
>  > otherVer=GridCacheVersion [topVer=247332659, order=1635852705222,
>  > nodeOrder=1], mappedDhtNodes=null, mappedNearNodes=null,
>  > ownerVer=GridCacheVersion [topVer=247332659, order=1635852705211,
>  > nodeOrder=1], serOrder=null, key=KeyCacheObjectImpl [part=244,
>  > val=data=6ff0c60ec71-625345be-9a91-497a-895e-abbe5df9da3d],
>  > hasValBytes=true],
>  > 
> masks=local=1|owner=0|ready=1|reentry=0|used=0|tx=1|single_implicit=0|dht_local=1|near_local=0|removed=0|read=0,
>  > prevVer=null, nextVer=null], GridCacheMvccCandidate
>  > [nodeId=2735bef0-7404-41e3-843f-7043490c9d84, ver=GridCacheVersion
>  > [topVer=247332659, order=1635852705223, nodeOrder=1], threadId=120,
>  > id=2112, topVer=AffinityTopologyVersion [topVer=1, minorTopVer=162],
>  > reentry=null, otherNodeId=2735bef0-7404-41e3-843f-7043490c9d84,
>  > otherVer=GridCacheVersion [topVer=247332659, order=1635852705223,
>  > nodeOrder=1], mappedDhtNodes=null, mappedNearNodes=null,
>  > ownerVer=GridCacheVersion [topVer=247332659, order=1635852705211,
>  > nodeOrder=1], serOrder=null, key=KeyCacheObjectImpl [part=244,
>  > val=data=6ff0c60ec71-625345be-9a91-497a-895e-abbe5df9da3d],
>  > hasValBytes=true],
>  > 
> masks=local=1|owner=0|ready=1|reentry=0|used=0|tx=1|single_implicit=0|dht_loca
> [message truncated...]
> 
> 
> -- 
> Sumit Deshinge
> R&D Engineer | Symantec Enterprise Division
> Broadcom Software
> Email: Sumit Deshinge <mailto:sumit.deshi...@broadcom.com>
> 
> 
> This electronic communication and the information and any files transmitted 
> with it, or attached to it, are confidential and are intended solely for the 
> use of the individual or entity to whom it is addressed and may contain 
> information that is confidential, legally privileged, protected by privacy 
> laws, or otherwise restricted from disclosure to anyone else. If you are not 
> the intended recipient or the person responsible for delivering the e-mail to 
> the intended recipient, you are hereby notified that any use, copying, 
> distributing, dissemination, forwarding, printing, or copying of this e-mail 
> is strictly prohibited. If you received this e-mail in error, please return 
> the e-mail to the sender, delete it from your computer, and destroy any 
> printed copy of it.


Reply via email to