Coincidentally, I was looking at the same issue in Python yesterday which
also doesn't use a watermark estimator. For Python at least, the same
problem exists on Dataflow.

I agree that we should be providing a watermark estimate as part of
periodic sequences for both Java and Python. I was planning on adding
manual watermark estimation to Python once I worked through some other
issues with the transform, I'm happy to review a similar change in Java
(assuming nobody with more context has objections)

Thanks,
Danny

On Tue, Sep 20, 2022 at 5:15 AM Jan Lukavský <[email protected]> wrote:

> Hi,
>
> looking into the code of PeriodicSequence (which is used by
> PeriodicImpulse) it seems it uses SDF, but does not update downstream
> watermark using WatermarkEstimator in the @ProcessElement method. That
> seems to produce outputs only after the Pipeline terminates (at least it
> seems to work this way on Flink runner). Is this bug in the
> PeriodicImpulse (would be fixed by adding ManualWatermarkEstimator,
> possibly), or in the Flink's SDF implementation? I suspect it is the
> first one, because I cannot imagine how would the runner infer the
> watermark without being explicitly told by the SDF.
>
> Should this be fixed in the PeriodicSequence?
>
>   Jan
>
>

Reply via email to