[ 
https://issues.apache.org/jira/browse/FLINK-7195?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16087227#comment-16087227
 ] 

ASF GitHub Bot commented on FLINK-7195:
---------------------------------------

GitHub user tzulitai opened a pull request:

    https://github.com/apache/flink/pull/4344

    (release-1.3) [FLINK-7195] [kafka] Remove partition list querying when 
restoring state in FlinkKafkaConsumer

    This issue is a re-appearance of FLINK-6006. On restore, we should not 
respect any fetched partitions list from Kafka and perform any filtering of the 
restored partition states. There are corner cases where, due to Kafka broker 
downtime, some partitions may be missing in the fetched partition list. 
Therefore, we should not respect the fetched partitions list on restore time to 
manipulate the restored state, which may lead to broken state. To be more 
precise, we actually should not require fetching partitions on restore.
    
    We've stepped on our own foot again and reintroduced this bug in 
ed68fedbe90db03823d75a020510ad3c344fa73e. This PR adds proper unit tests for 
this that does not rely on the internal implementations and test only on public 
abstractions of `FlinkKafkaConsumerBase`.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/tzulitai/flink FLINK-7195

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/flink/pull/4344.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #4344
    
----
commit 12af5d8b0e43b62935dc619258fb8f957b11d0bc
Author: Tzu-Li (Gordon) Tai <tzuli...@apache.org>
Date:   2017-07-14T11:51:03Z

    [FLINK-7195] [kafka] Remove partition list querying when restoring state in 
FlinkKafkaConsumer

----


> FlinkKafkaConsumer should not respect fetched partitions to filter restored 
> partition states
> --------------------------------------------------------------------------------------------
>
>                 Key: FLINK-7195
>                 URL: https://issues.apache.org/jira/browse/FLINK-7195
>             Project: Flink
>          Issue Type: Bug
>          Components: Kafka Connector
>    Affects Versions: 1.3.1
>            Reporter: Tzu-Li (Gordon) Tai
>            Assignee: Tzu-Li (Gordon) Tai
>            Priority: Blocker
>             Fix For: 1.3.2
>
>
> This issue is a re-appearance of FLINK-6006. On restore, we should not 
> respect any fetched partitions list from Kafka and perform any filtering of 
> the restored partition states. There are corner cases where, due to Kafka 
> broker downtime, some partitions may be missing in the fetched partition 
> list. To be more precise, we actually should not require fetching partitions 
> on restore.
> We've stepped on our own foot again and reintroduced this bug in 
> https://github.com/apache/flink/pull/3378/commits/ed68fedbe90db03823d75a020510ad3c344fa73e.
>  The previous test for this behavior was too implementation specific, and 
> therefore the leak in catching this on different internal implementations.
> We should have a proper unit test for this that does not rely on the internal 
> implementations and test only on public abstractions of 
> {{FlinkKafkaConsumerBase}}.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to