Re: Python API design doc?

2016-06-29 Thread Chesnay Schepler
Hello Daniel, there is neither Design Doc for the Python API nor for how it interacts the the DataSet API. If you have any questions feel free to contact me, I'll provide as much information as i can. Regards, Chesnay On 29.06.2016 22:53, Daniel Li wrote: I am evaluating the feasibility o

Re: IT Test

2016-06-29 Thread Vijay Srinivasaraghavan
Yes, I will take care of it as part other JIRA that I am working on. RegardsVijay On Wednesday, June 29, 2016 2:40 AM, Maximilian Michels wrote: Hi Vijay, Glad we solved the problem. Good catch with the FlinkYarnSessionCli. We should clean up the properties file after running the Yarn

Python API design doc?

2016-06-29 Thread Daniel Li
I am evaluating the feasibility of adding C# binding to Apache Flink. I wonder if anyone of you can share a design doc for Apache Python binding for Apache Flink? thx Daniel

Re: [Discussion] Query regarding Join and Windows

2016-06-29 Thread Vinay Patil
Hi, Yes , now I am getting clear with the concepts here. One last thing I want to try before going for custom trigger, I want to try Delta Trigger but I am not able to get the syntax right , this is how I am trying it : TypeInformation> typeInfo = TypeInformation.of(new TypeHint>() { }); // sourc

Web dashboard binaries and licensing

2016-06-29 Thread Till Rohrmann
Hi Flink community, while reviewing the LICENSE and NOTICE file of Apache Flink, I noticed that according to the LICENSE file Flink contains many java script files. However, tracking the corresponding files back was not so easy, because they are actually all merged into flink-runtime-web/web-dashb

Re: [Discuss] Why different job's tasks can run in the single process.

2016-06-29 Thread Aljoscha Krettek
Hi, yes, you are definitely right that allowing to run multiple user code tasks in the same TaskManager JVM is not good for stability. This mode is still there from the very early days of Flink where Yarn was not yet available. In a production environment I would now recommend to always run one Fli

Re: [Discussion] Query regarding Join and Windows

2016-06-29 Thread Aljoscha Krettek
Hi, you can use ingestion time if you don't care about the timestamps in your events, yes. If elements from the two streams happen to arrive at such times that they are not put into the same window then you won't get a match, correct. Regarding ingestion time and out-of-order events. I think this

[jira] [Created] (FLINK-4133) Reflect streaming file source changes in documentation

2016-06-29 Thread Robert Metzger (JIRA)
Robert Metzger created FLINK-4133: - Summary: Reflect streaming file source changes in documentation Key: FLINK-4133 URL: https://issues.apache.org/jira/browse/FLINK-4133 Project: Flink Issue

[jira] [Created] (FLINK-4132) Fix boxed comparison in CommunityDetection algorithm

2016-06-29 Thread Greg Hogan (JIRA)
Greg Hogan created FLINK-4132: - Summary: Fix boxed comparison in CommunityDetection algorithm Key: FLINK-4132 URL: https://issues.apache.org/jira/browse/FLINK-4132 Project: Flink Issue Type: Bug

Re: primitiveDefaultValue in CodeGenUtils in Table API

2016-06-29 Thread Cody Innowhere
Oh yes we've made a bit change on flink table API so that we can work on data streams like data sets, thus code-generation is called. On Wed, Jun 29, 2016 at 8:31 PM, Fabian Hueske wrote: > Hi Cody, > > > > Aggregations are currently not performed by code-generated user functions. > This would b

AW: primitiveDefaultValue in CodeGenUtils in Table API

2016-06-29 Thread Fabian Hueske
Hi Cody, Aggregations are currently not performed by code-generated user functions. This would be a good improvement though. Check the DataSetAggregate class to learn how aggregations are translated into Flink Dataset programs. Best, Fabian Von: Cody Innowhere

Re: [Discussion] Query regarding Join and Windows

2016-06-29 Thread Vinay Patil
Hi, Ok. Inside the checkAndGetNextWatermark(lastElement, extractedTimestamp) method both these parameters are coming same (timestamp value) , I was expecting last element timestamp value in the 1st param when I extract it. Lets say I decide to use IngestionTime (since I am getting accurate result

[jira] [Created] (FLINK-4131) Confusing error for out dated RequestPartitionState

2016-06-29 Thread Ufuk Celebi (JIRA)
Ufuk Celebi created FLINK-4131: -- Summary: Confusing error for out dated RequestPartitionState Key: FLINK-4131 URL: https://issues.apache.org/jira/browse/FLINK-4131 Project: Flink Issue Type: Imp

Re: [Discussion] Query regarding Join

2016-06-29 Thread Aljoscha Krettek
Hi, the element will be kept around indefinitely if no new watermark arrives. I think the same problem will persist for AssignerWithPunctuatedWatermarks since there you also might not get the required "last watermark" to trigger processing of the last window. Cheers, Aljoscha On Wed, 29 Jun 2016

Re: [Discussion] Query regarding Join

2016-06-29 Thread Vinay Patil
Hi Aljoscha, This clears a lot of doubts now. So now lets say the stream paused for a while or it stops completely on Friday , let us assume that the last message did not get processed and is kept in the internal buffers. So when the stream starts again on Monday , will it consider the last eleme

Re: IT Test

2016-06-29 Thread Maximilian Michels
Hi Vijay, Glad we solved the problem. Good catch with the FlinkYarnSessionCli. We should clean up the properties file after running the Yarn tests. If you want to fix that, you could open a pull request. Cheers, Max On Wed, Jun 29, 2016 at 1:34 AM, Vijay Srinivasaraghavan wrote: > Hi Max, > Yo

Re: [Discussion] Query regarding Join

2016-06-29 Thread Aljoscha Krettek
Hi, the reason why the last element might never be emitted is the way the ascending timestamp extractor works. I'll try and explain with an example. Let's say we have a window size of 2 milliseconds, elements arrive starting with timestamp 0, window begin timestamp is inclusive, end timestamp is e

[jira] [Created] (FLINK-4130) CallGenerator could generate illegal code when taking no operands

2016-06-29 Thread Cody (JIRA)
Cody created FLINK-4130: --- Summary: CallGenerator could generate illegal code when taking no operands Key: FLINK-4130 URL: https://issues.apache.org/jira/browse/FLINK-4130 Project: Flink Issue Type: Bu

Re: primitiveDefaultValue in CodeGenUtils in Table API

2016-06-29 Thread Cody Innowhere
@Fabian, given the example that a user wants to sum a column whose value might be null, with null check enabled. In this case, when the column value is null, -1 is set instead? Am I understanding correctly? On Wed, Jun 29, 2016 at 3:28 PM, Timo Walther wrote: > Hi Cody, > > default values are n

Re: primitiveDefaultValue in CodeGenUtils in Table API

2016-06-29 Thread Timo Walther
Hi Cody, default values are needed in cases where NULL values are not supported. This happens if the null check is disabled in TableConfig for efficiency reasons. Using 0 to DataType.MAX_VALUE for numeric types and -1 as a NULL equivalent in special cases seems more reasonable to me. Hope th

[Discuss] Why different job's tasks can run in the single process.

2016-06-29 Thread Longda Feng
hi , Sorry for asking the quest here? Any answer will be apprecated. Why different job's tasks can run in the single process. (There are some different job's tasks  in one TaskManager).It seems Flink-on-Yarn can let different job  run on different process. But for standalone mode, this problem s