[ https://issues.apache.org/jira/browse/FLINK-2779?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14947223#comment-14947223 ]
ASF GitHub Bot commented on FLINK-2779: --------------------------------------- Github user fhueske commented on the pull request: https://github.com/apache/flink/pull/1208#issuecomment-146264681 It's a bit painful to review because Github is not showing the diff :-( Here is what I found in `streaming_guide.ml` until but not including `Specifying Keys`: Intro: - What are regular programs? - "The data streams are initially created…" -> "A data stream can be produced by different sources such as …" - What are distributed files? - Flink program -> DataStream program Linking with Flink: - To write programs with Flink -> To implement a DataStream program - If you want to add Flink to an existing -> If you want to add Flink's DataStream API to an … Program Skeleton (Java and Scala): - Flink DataStream program -> DataStream Program - If you created a JAR file from you program -> If you created a JAR file from YOUR program - with your own custom transformation function -> with your own custom transformation FUNCTIONS - converting every String in the original set to an Integer. -> converting every String in the original STREAM to an Integer. DataStream abstraction: - unbounded immutable collection of data of a the same type -> unbounded immutable collection of data ITEMS of a the same type. ? Lazy Evaluation: - All Flink programs -> All DataStream programs Transformations (only checked Java, Scala probably the same…) - KeyBy - Logically partition a stream -> Logically PARTITIONS a stream - This transformations -> This transformation - Reduce + Fold - Combines the last element with the last reduced value and emits the new value. -> Combines the current element with the last reduced value and emits the new value. ? - Aggregations: - Rolling aggregations on keyed data stream. -> Rolling aggregations on A keyed data stream. - Window - Windows can be defined on already partitioned KeyedStreams -> Windows can be defined on partitioned KeyedStreams ? - Window Apply - Applies a general function to the window as a whole. -> Applies a general function to WINDOWS as a whole. ? - A function that manually sums the elements of a window -> Sentence is incomplete - Window Reduce / Aggregations - Point out difference to rolling reduce / aggregations? - Iterate - The resulting stream of an iteration is a DataStream again. The IterativeStream is only "within" the iteration, right? Shouldn't it be DataStream -> IteratativeStream -> DataStream then? - Extract Timestamps - Extract -> ExtractS - Flink windows -> windows > Update documentation to reflect new Stream/Window API > ----------------------------------------------------- > > Key: FLINK-2779 > URL: https://issues.apache.org/jira/browse/FLINK-2779 > Project: Flink > Issue Type: Sub-task > Components: Streaming > Reporter: Aljoscha Krettek > Assignee: Kostas Tzoumas > Fix For: 0.10 > > -- This message was sent by Atlassian JIRA (v6.3.4#6332)