Github user fhueske commented on the pull request: https://github.com/apache/flink/pull/1208#issuecomment-146264681 It's a bit painful to review because Github is not showing the diff :-( Here is what I found in `streaming_guide.ml` until but not including `Specifying Keys`: Intro: - What are regular programs? - "The data streams are initially createdâ¦" -> "A data stream can be produced by different sources such as â¦" - What are distributed files? - Flink program -> DataStream program Linking with Flink: - To write programs with Flink -> To implement a DataStream program - If you want to add Flink to an existing -> If you want to add Flink's DataStream API to an ⦠Program Skeleton (Java and Scala): - Flink DataStream program -> DataStream Program - If you created a JAR file from you program -> If you created a JAR file from YOUR program - with your own custom transformation function -> with your own custom transformation FUNCTIONS - converting every String in the original set to an Integer. -> converting every String in the original STREAM to an Integer. DataStream abstraction: - unbounded immutable collection of data of a the same type -> unbounded immutable collection of data ITEMS of a the same type. ? Lazy Evaluation: - All Flink programs -> All DataStream programs Transformations (only checked Java, Scala probably the sameâ¦) - KeyBy - Logically partition a stream -> Logically PARTITIONS a stream - This transformations -> This transformation - Reduce + Fold - Combines the last element with the last reduced value and emits the new value. -> Combines the current element with the last reduced value and emits the new value. ? - Aggregations: - Rolling aggregations on keyed data stream. -> Rolling aggregations on A keyed data stream. - Window - Windows can be defined on already partitioned KeyedStreams -> Windows can be defined on partitioned KeyedStreams ? - Window Apply - Applies a general function to the window as a whole. -> Applies a general function to WINDOWS as a whole. ? - A function that manually sums the elements of a window -> Sentence is incomplete - Window Reduce / Aggregations - Point out difference to rolling reduce / aggregations? - Iterate - The resulting stream of an iteration is a DataStream again. The IterativeStream is only "within" the iteration, right? Shouldn't it be DataStream -> IteratativeStream -> DataStream then? - Extract Timestamps - Extract -> ExtractS - Flink windows -> windows
--- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---