Danny Cranmer created FLINK-19898:
-------------------------------------

             Summary: [Kinesis][EFO] Ignore ReadTimeoutException from 
SubcribeToShard retry policy
                 Key: FLINK-19898
                 URL: https://issues.apache.org/jira/browse/FLINK-19898
             Project: Flink
          Issue Type: Improvement
          Components: Connectors / Kinesis
            Reporter: Danny Cranmer
             Fix For: 1.12.0


*Background* 

The Flink Kinesis EFO consumer has a {{SubscribeToShard}} retry policy which 
will terminate the job after a given number of subsequent attempt failures. In 
high backpressure scenarios the Netty HTTP Client throws a 
{{ReadTimeoutException}} when the consumer takes longer than 30s to process a 
batch. If this happens (by default) 10 times in a row, the job will terminate. 
There is no need to terminate in this condition, and the restart results in the 
job falling further behind.

*Scope*

Exclude the {{ReadTimeoutException}} from the {{SubscribeToShard}} retry 
policy, such that that connector will gracefully reconnect once the consumer 
has processed the queued records.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to