[ 
https://issues.apache.org/jira/browse/FLINK-34342?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hong Liang Teoh resolved FLINK-34342.
-------------------------------------
    Resolution: Fixed

> Address Shard Consistency Issue for DDB Streams Source
> ------------------------------------------------------
>
>                 Key: FLINK-34342
>                 URL: https://issues.apache.org/jira/browse/FLINK-34342
>             Project: Flink
>          Issue Type: Sub-task
>          Components: Connectors / DynamoDB
>            Reporter: Danny Cranmer
>            Assignee: Abhishek Kumar
>            Priority: Major
>
> *Problem*
> We call 
> [DescribeStream|https://docs.aws.amazon.com/amazondynamodb/latest/APIReference/API_streams_DescribeStream.html]
>  with the ExclusiveStartShardId parameter set to the last seen shard ID. The 
> issue is that the API is eventually consistent, meaning if we call the API 
> multiple times we might get different results, for example: 
>  * Call 1: [A, C]
>  * Call 2: [A, B, C]
> Since we would set ExclusiveStartShardId to {{{}C{}}}, in the above example 
> the connector would miss shard {{B}}
> *Solution*
> We need to find a solution to support this gap. This could be to periodically 
> list all shards to find gaps and not start processing new shards until their 
> parents are complete. This feature does not need to be applied to the KDS 
> source.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to