[
https://issues.apache.org/jira/browse/BEAM-14171?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17512140#comment-17512140
]
Robert Bradshaw commented on BEAM-14171:
----------------------------------------
Yes, the PR is a fix (and a simple one at that). It could be a pretty major
problem as well. I think it makes sense to get it in.
> CoGroupByKey loses values with large groups on Dataflow v1
> ----------------------------------------------------------
>
> Key: BEAM-14171
> URL: https://issues.apache.org/jira/browse/BEAM-14171
> Project: Beam
> Issue Type: Bug
> Components: runner-dataflow, sdk-java-core
> Affects Versions: 2.36.0, 2.37.0
> Reporter: Niel Markwick
> Assignee: Robert Bradshaw
> Priority: P1
> Fix For: 2.38.0
>
> Time Spent: 20m
> Remaining Estimate: 0h
>
> CoGroupByKey can lose elements - replacing them with null values when a group
> is large (>10,000 elements).
>
> This only occurs in dataflow v1, not dataflow-v2 runner
> Possibly related to BEAM-13541.
>
> https://lists.apache.org/thread/5y56kbgm3q0m1byzf7186rrkomrcfldm
>
>
>
--
This message was sent by Atlassian Jira
(v8.20.1#820001)