adixitconfluent commented on code in PR #17739:
URL: https://github.com/apache/kafka/pull/17739#discussion_r1841165543


##########
core/src/main/java/kafka/server/share/DelayedShareFetch.java:
##########
@@ -90,39 +90,50 @@ public void onExpiration() {
      */
     @Override
     public void onComplete() {
+        // We are utilizing lock so that onComplete doesn't do a dirty read 
for global variables -
+        // partitionsAcquired and partitionsAlreadyFetched, since these 
variables can get updated in a different tryComplete thread.
+        lock.lock();
         log.trace("Completing the delayed share fetch request for group {}, 
member {}, "
             + "topic partitions {}", shareFetchData.groupId(), 
shareFetchData.memberId(),
             partitionsAcquired.keySet());
 
-        if (shareFetchData.future().isDone())
-            return;
+        try {
+            if (shareFetchData.future().isDone())
+                return;

Review Comment:
   So, if we have 2 different keys corresponding to a ShareFetch request, it 
could be a case that for one of those keys, we get a `checkAndComplete` call 
which could result in completing the share fetch request. Now when the 
purgatory entry corresponding to the other key could timeout/have 
`checkAndComplete` triggered, when the code reaches onComplete, the share fetch 
request's future was already complete, so it would hit 
`shareFetchData.future().isDone()` and return true.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: jira-unsubscr...@kafka.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to