[ 
https://issues.apache.org/jira/browse/FLINK-1493?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14316204#comment-14316204
 ] 

Gyula Fora commented on FLINK-1493:
-----------------------------------

Hey,
I implemented a simple indexed iterator from which you can get the channel 
index of the last record and also the total number of channels:
Please try it out at let me know if it is working properly I am not going to 
push to the master until then (I need to take a flight soon and can only get 
back to this on Friday).

Find the commit here:
https://github.com/mbalassi/flink/commit/ee3590ec4427aeca4bbde352199db972b47ee12d

The Jira issue:
https://issues.apache.org/jira/browse/FLINK-1517

On top of this you can implement the sorting operators I think by buffering up 
the inputs. Lets not push this logic down to the AbstractRecordReader and keep 
it at the operator. Thats the cleanest for now.


> Support for streaming jobs preserving global ordering of records
> ----------------------------------------------------------------
>
>                 Key: FLINK-1493
>                 URL: https://issues.apache.org/jira/browse/FLINK-1493
>             Project: Flink
>          Issue Type: New Feature
>          Components: Streaming
>            Reporter: Márton Balassi
>
> Distributed streaming jobs do not give total, global ordering guarantees for 
> records only partial ordering is provided by the system: records travelling 
> on the same exact route of the physical plan are ordered, but they aren't 
> between routes.
> It turns out that although this feature can only be implemented via "merge 
> sorting" in the input buffers on a timestamp field thus creating substantial 
> latency is still desired for a number of applications.
> Just a heads up for the implementation: the sorting introduces back pressure 
> in the buffers and might cause deadlocks.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to