Hi, * How do you ensure that there are no concurrent updates on the keys? [Prasad]: The cache for which it is failing is kind of bootstrap cache which changes very rarely. I made sure that I was the only one working on this system while debugging the issue. The cache for which it is failing is REPLICATED cache. Read through is enabled and Write through is disabled. Whenever I get an update message for these caches from different system, I update an entry in my cache using following steps. 1. First Remove an entry from the cache using cache.remove() method. 2. Read the entry from cache using cache().get method, which reads the data from oracle db using read through approach.
* How many retry attempts did you run? [Prasad] I retried the transaction 8-10 times. * Are your caches in-memory with eviction policy? [Prasad] Yes, caches are in-memory but without eviction policy. I am using Oracle DB as a third party persistence. Ignite native persistence is disabled. * Do you have TTL enabled for either of the caches? [Prasad]: No, I have not set TTL. * Do you have a 3rd-party persistence and read-through and write-through enabled for either of the caches? [Prasad]: Yes, I have 3rd Party persistence. I have read-through caches but for all read-through caches write through is disabled. The write-through is disabled for some caches as I am not the owner of these tables. I also have Write through caches but for all such caches read-through is disabled. At this moment I do not have any cache where read-through and write-through both are enabled. I reload the all my caches using cache loaders. * Can you check if the issue reproduces if you set -DIGNITE_READ_LOAD_BALANCING=false system property? [Prasad]: Sure will try to reproduce this using this parameter. But the problem is this happens intermittently. As per the following code, serReadVer is the GridCache version of transaction co-ordinator node which it compares with grid cache version of other nodes. So as per your explanation, nodeOrder is unique number assigned to each node joins the grid. So each node in cluster will have a different nodeOrder. If this is the case then "serReadVer.equals(ver)" will always return false. Please correct me if I am wrong. I just trying to understand the code. This will help me to identify the issue. *public boolean checkSerializableReadVersion(GridCacheVersion serReadVer) throws GridCacheEntryRemovedException { lockEntry(); try { checkObsolete();* * if (!serReadVer.equals(ver)) { boolean empty = isStartVersion() || deletedUnlocked();* * if (serReadVer.equals(IgniteTxEntry.SER_READ_EMPTY_ENTRY_VER)) return empty; else if (serReadVer.equals(IgniteTxEntry.SER_READ_NOT_EMPTY_VER)) return !empty; return false; } return true; } finally { unlockEntry(); } }* Thanks, Prasad On Fri, Feb 28, 2020 at 2:24 PM Alexey Goncharuk <alexey.goncha...@gmail.com> wrote: > Prasad, > > >> Can you please answer following questions? >> 1) The significance of the nodeOrder w.r.t Grid and cache? >> > Node order is a unique integer assigned to a node when the node joins > grid. The node order is included into GridCacheVersion to disambiguate > versions generated on different nodes that happen to have the same local > version order. > >> 2) When does it change? >> > Node order does not change during the node lifetime. If two versions have > different node order, it means that the versions were generated on > different nodes. > >> 3) How it is important w.r.t. transaction? >> > GridCacheVersion is used to detect concurrent read-write conflicts as I > described in the previous message, as well as for data rebalance. > >> 4) Inside transaction I am reading and modifying Replicated as well as >> Partitioned cache. What I observed is this fails for Replicated cache. As >> workaround, I have moved the code which reads Replicated cache out of >> transaction block. Is it allowed to read and modify both replicated and >> Partitioned cache i.e. use both Replicated and Partitioned? >> > Yes, this is perfectly fine to update replicated and transactional caches > inside one transaction. > > From the debug output that you provided we can infer that the version of > both entries have changed for both caches before transaction prepare phase. > I would back up Alexei here: > * How do you ensure that there are no concurrent updates on the keys? > * How many retry attempts did you run? > * Are your caches in-memory with eviction policy? > * Do you have TTL enabled for either of the caches? > * Do you have a 3rd-party persistence and read-through and write-through > enabled for either of the caches? > * Can you check if the issue reproduces if you set > -DIGNITE_READ_LOAD_BALANCING=false system property? > > --AG >