[jira] [Created] (FLINK-4197) Allow Kinesis Endpoint to be Overridden via Config

2016-07-11 Thread Scott Kidder (JIRA)
Scott Kidder created FLINK-4197: --- Summary: Allow Kinesis Endpoint to be Overridden via Config Key: FLINK-4197 URL: https://issues.apache.org/jira/browse/FLINK-4197 Project: Flink Issue Type: Im

[jira] [Created] (FLINK-4196) Remove "recoveryTimestamp"

2016-07-11 Thread Stephan Ewen (JIRA)
Stephan Ewen created FLINK-4196: --- Summary: Remove "recoveryTimestamp" Key: FLINK-4196 URL: https://issues.apache.org/jira/browse/FLINK-4196 Project: Flink Issue Type: Bug Components:

[jira] [Created] (FLINK-4195) Dedicated Configuration classes for Kinesis Consumer / Producer

2016-07-11 Thread Tzu-Li (Gordon) Tai (JIRA)
Tzu-Li (Gordon) Tai created FLINK-4195: -- Summary: Dedicated Configuration classes for Kinesis Consumer / Producer Key: FLINK-4195 URL: https://issues.apache.org/jira/browse/FLINK-4195 Project: Fl

[jira] [Created] (FLINK-4194) KinesisDeserializationSchema.isEndOfStream() is never called

2016-07-11 Thread Robert Metzger (JIRA)
Robert Metzger created FLINK-4194: - Summary: KinesisDeserializationSchema.isEndOfStream() is never called Key: FLINK-4194 URL: https://issues.apache.org/jira/browse/FLINK-4194 Project: Flink

Re: sampling function

2016-07-11 Thread Le Quoc Do
Hi all, Thank you all for your answers. By the way, I also recognized that Flink doesn't support "stratified sampling" function (only simple random sampling) for DataSet. It would be nice if someone can create a Jira for it, and assign the task to me so that I can work for it. Thank you, Do On

[jira] [Created] (FLINK-4193) Task manager JVM crashes while deploying cancelling jobs

2016-07-11 Thread Gyula Fora (JIRA)
Gyula Fora created FLINK-4193: - Summary: Task manager JVM crashes while deploying cancelling jobs Key: FLINK-4193 URL: https://issues.apache.org/jira/browse/FLINK-4193 Project: Flink Issue Type:

[jira] [Created] (FLINK-4192) Move Metrics API to separate module

2016-07-11 Thread Chesnay Schepler (JIRA)
Chesnay Schepler created FLINK-4192: --- Summary: Move Metrics API to separate module Key: FLINK-4192 URL: https://issues.apache.org/jira/browse/FLINK-4192 Project: Flink Issue Type: Improveme

[jira] [Created] (FLINK-4191) Expose shard information in KinesisDeserializationSchema

2016-07-11 Thread Robert Metzger (JIRA)
Robert Metzger created FLINK-4191: - Summary: Expose shard information in KinesisDeserializationSchema Key: FLINK-4191 URL: https://issues.apache.org/jira/browse/FLINK-4191 Project: Flink Issu

[jira] [Created] (FLINK-4190) Generalise RollingSink to work with arbitrary buckets

2016-07-11 Thread Josh Forman-Gornall (JIRA)
Josh Forman-Gornall created FLINK-4190: -- Summary: Generalise RollingSink to work with arbitrary buckets Key: FLINK-4190 URL: https://issues.apache.org/jira/browse/FLINK-4190 Project: Flink

Re: [DISCUSS] Move Documentation from Wiki to Documentation

2016-07-11 Thread Stephan Ewen
I would say that all user-relevant stuff should generally go to the docs. In that sense, I agree with you for most of the links you mentioned. The pages relevant to contributors only (like Hadoop Dependency shading) can be in the Wiki Greetings, Stephan On Fri, Jul 8, 2016 at 2:56 PM, Aljoscha

Re: sampling function

2016-07-11 Thread Vasiliki Kalavri
Hi Do, Paris and Martha worked on sampling techniques for data streams on Flink last year. If you want to implement your own samplers, you might find Martha's master thesis helpful [1]. -Vasia. [1]: http://kth.diva-portal.org/smash/get/diva2:910695/FULLTEXT01.pdf On 11 July 2016 at 11:31, Kosta

Re: sampling function

2016-07-11 Thread Kostas Kloudas
Hi Do, In DataStream you can always implement your own sampling function, hopefully without too much effort. Adding such functionality it to the API could be a good idea. But given that in sampling there is no “one-size-fits-all” solution (as not every use case needs random sampling and not al

[jira] [Created] (FLINK-4189) Introduce symbols for internal use

2016-07-11 Thread Timo Walther (JIRA)
Timo Walther created FLINK-4189: --- Summary: Introduce symbols for internal use Key: FLINK-4189 URL: https://issues.apache.org/jira/browse/FLINK-4189 Project: Flink Issue Type: Improvement