[ https://issues.apache.org/jira/browse/FLINK-22198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17401454#comment-17401454 ]
Qingsheng Ren commented on FLINK-22198: --------------------------------------- Thanks [~xtsong] for the reminder. I had a discussion with [~lindong] and we checked the log of the latest instance. These logs caught our attention: {code:java} 13:27:11,089 INFO [Log partition=key_full_value_topic_avro-0, dir=/var/lib/kafka/data] Found deletable segments with base offsets [0] due to retention time 604800000ms breach (kafka.log.Log) 13:27:11,101 INFO [ProducerStateManager partition=key_full_value_topic_avro-0] Writing producer snapshot at offset 3 (kafka.log.ProducerStateManager) 13:27:11,104 INFO [Log partition=key_full_value_topic_avro-0, dir=/var/lib/kafka/data] Rolled new log segment at offset 3 in 15 ms. (kafka.log.Log) 13:27:11,106 INFO [Log partition=key_full_value_topic_avro-0, dir=/var/lib/kafka/data] Scheduling segments for deletion List(LogSegment(baseOffset=0, size=233, lastModifiedTime=1629293231000, largestTime=1583845931123)) (kafka.log.Log) 13:27:11,107 INFO [Log partition=key_full_value_topic_avro-0, dir=/var/lib/kafka/data] Incrementing log start offset to 3 (kafka.log.Log) {code} Basically a retention time based log deletion was triggered, so messages written to Kafka is deleted, then the test case hanged because it cannot receive expected messages. These logs proves the hypothesis made by [~lindong] that the clock is skewed on producer side. We are still investigating the reason and will update the comment once we have progress. I don't have the permission to change the state of this ticket. [~xtsong] Could you help to change it to {{in-progress}}? Thanks~ > KafkaTableITCase hang. > ---------------------- > > Key: FLINK-22198 > URL: https://issues.apache.org/jira/browse/FLINK-22198 > Project: Flink > Issue Type: Bug > Components: Connectors / Kafka > Affects Versions: 1.14.0, 1.12.4 > Reporter: Guowei Ma > Assignee: Qingsheng Ren > Priority: Blocker > Labels: pull-request-available, stale-blocker, test-stability > Fix For: 1.14.0 > > > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=16287&view=logs&j=c5f0071e-1851-543e-9a45-9ac140befc32&t=1fb1a56f-e8b5-5a82-00a0-a2db7757b4f5&l=6625 > There is no any artifacts. -- This message was sent by Atlassian Jira (v8.3.4#803005)