Re: A small Project I've been working on

2015-04-01 Thread Aljoscha Krettek
Right now, runtime is roughly thrice that of equivalent java programs. But I plan on bringing that to the same ballpark using code generation. On Wed, Apr 1, 2015 at 8:54 AM, Fabian Hueske wrote: > :-D > This is awesome! > > Do you have some performance numbers? > On Apr 1, 2015 8:43 AM, "Aljosch

Re: A small Project I've been working on

2015-04-01 Thread Márton Balassi
Woot! On Wed, Apr 1, 2015 at 9:01 AM, Aljoscha Krettek wrote: > Right now, runtime is roughly thrice that of equivalent java programs. > But I plan on bringing that to the same ballpark using code > generation. > > On Wed, Apr 1, 2015 at 8:54 AM, Fabian Hueske wrote: > > :-D > > This is awesome

Re: A small Project I've been working on

2015-04-01 Thread Ufuk Celebi
Little side projects ftw. Very nice :-) Can you give some points on how this works internally? Is it making use of anything generic from the Python API pull request? On Wednesday, April 1, 2015, Márton Balassi wrote: > Woot! > > On Wed, Apr 1, 2015 at 9:01 AM, Aljoscha Krettek > > wrote: > > >

Re: A small Project I've been working on

2015-04-01 Thread Till Rohrmann
Where can I start contributing. I've got the feeling that it will be inherently faster than everything else which is out there. On Apr 1, 2015 9:18 AM, "Ufuk Celebi" wrote: > Little side projects ftw. Very nice :-) > > Can you give some points on how this works internally? Is it making use of > a

Re: A small Project I've been working on

2015-04-01 Thread fhueske
My ruby skills are a bit rust(y) but I’d love to contribute. Can you point me to a repository that I can fork? From: Till Rohrmann Sent: ‎Wednesday‎, ‎1‎. ‎April‎, ‎2015 ‎09‎:‎29 To: dev@flink.apache.org Where can I start contributing. I've got the feeling that it will be inherently fa

Re: A small Project I've been working on

2015-04-01 Thread Gyula Fóra
Amazing! :) On Wed, Apr 1, 2015 at 9:41 AM, wrote: > My ruby skills are a bit rust(y) but I’d love to contribute. > > > Can you point me to a repository that I can fork? > > > > > > > From: Till Rohrmann > Sent: ‎Wednesday‎, ‎1‎. ‎April‎, ‎2015 ‎09‎:‎29 > To: dev@flink.apache.org > > > > > > Whe

Re: A small Project I've been working on

2015-04-01 Thread Stephan Ewen
It April, 1st, right? On Wed, Apr 1, 2015 at 9:47 AM, Gyula Fóra wrote: > Amazing! :) > > On Wed, Apr 1, 2015 at 9:41 AM, wrote: > > > My ruby skills are a bit rust(y) but I’d love to contribute. > > > > > > Can you point me to a repository that I can fork? > > > > > > > > > > > > > > From: Til

Re: A small Project I've been working on

2015-04-01 Thread Stephan Ewen
For the Ruby interpreter on Flink, I suggest to implement this in Rust. To implement a system that analyzes data at "web scale", what would be a better language than one used to implement the engine of a "web browser". On Wed, Apr 1, 2015 at 10:05 AM, Stephan Ewen wrote: > It April, 1st, right?

Re: [DISCUSS] Inconsistent naming of intermediate results

2015-04-01 Thread Ufuk Celebi
To summarize so far: all are in favor of a rename. I agree with both of Henry's points regarding the docs. @Stephan: what would you suggest? I would trust your gut feeling on this one. ;) JobResult, ExecutionJobResult, ExecutionResult, etc.? On Tue, Mar 31, 2015 at 8:16 PM, Henry Saputra wrote:

Re: [DISCUSS] Gelly iteration abstractions

2015-04-01 Thread Stephan Ewen
Wow, Vasia, a GC limit error in the Optimizer (pre-flight), not the runtime. That is cannot have been easy to produce ;-) The program does not really look that tricky, I need to take some closer look at this... BTW: Before this algorithm runs well, we really need pinning intermediate results in m

Re: Question about Infinite Streaming Job on Mini Cluster and ITCase

2015-04-01 Thread Márton Balassi
Hey Matthias, Thanks for reporting the Exception thrown, we were not preparing for this use case yet. We fixed it with Gyula, he is pushing a fix for it right now: When the job is cancelled (for example due to shutting down the executor underneath) you should not see that InterruptedException as s

Re: A small Project I've been working on

2015-04-01 Thread Maximilian Michels
I'm in love with the API. So innovative and it feels very Rubyesk, even though I'm a Python lover. Your WordCount shows some really nice features which we should port to the Flink Java and Scala API. Perhaps, by embedding a little Ruby interpreter. Great idea also to port it to Node.js. Together w

Make docs searchable

2015-04-01 Thread Ufuk Celebi
Hey all, I think our documentation has grown to a point where we need to think about how to make it more accessible. I would like to add a custom Google search limited to the docs *excluding* the API docs (otherwise the results are very noisy). Mocks: http://tinypic.com/view.php?pic=2qtlg1k&s=8

Re: Make docs searchable

2015-04-01 Thread Robert Metzger
Cool. I would like to have the ability to search the docs, so +1 for this idea! On Wed, Apr 1, 2015 at 12:10 PM, Ufuk Celebi wrote: > Hey all, > > I think our documentation has grown to a point where we need to think about > how to make it more accessible. > > I would like to add a custom Google

Re: Make docs searchable

2015-04-01 Thread Till Rohrmann
I also like the idea. +1 On Wed, Apr 1, 2015 at 12:20 PM, Robert Metzger wrote: > Cool. I would like to have the ability to search the docs, so +1 for this > idea! > > On Wed, Apr 1, 2015 at 12:10 PM, Ufuk Celebi wrote: > > > Hey all, > > > > I think our documentation has grown to a point where

Re: Make docs searchable

2015-04-01 Thread Stephan Ewen
big +1 On Wed, Apr 1, 2015 at 12:25 PM, Till Rohrmann wrote: > I also like the idea. +1 > > On Wed, Apr 1, 2015 at 12:20 PM, Robert Metzger > wrote: > > > Cool. I would like to have the ability to search the docs, so +1 for this > > idea! > > > > On Wed, Apr 1, 2015 at 12:10 PM, Ufuk Celebi wr

Re: Question about Infinite Streaming Job on Mini Cluster and ITCase

2015-04-01 Thread Matthias J. Sax
Hi, I will pull the fix and try it out. Thanks for the hint with the extra Thread. That should work for me. But you are actually right; my setup is Storm inspired. I thinks its a very natural way to deploy and stop and infinite streaming job. Maybe, you want to adopt to it. The ITCase I am writi

Re: HBase TableOutputFormat fix (Flink 0.8.1)

2015-04-01 Thread Flavio Pompermaier
Any feedback about this? On Tue, Mar 31, 2015 at 7:07 PM, Flavio Pompermaier wrote: > Hi Flink devs, > this is my final report about the HBaseOutputFormat problem (with Flink > 0.8.1) and I hope you could suggest me the best way to make a PR: > > 1) The following code produce the error reported

Re: Make docs searchable

2015-04-01 Thread Maximilian Michels
+1 Nice idea! We would just have to filter the /api/java and /api/scala path.

Re: HBase TableOutputFormat fix (Flink 0.8.1)

2015-04-01 Thread Fabian Hueske
As I said before, I think the configure() method of the original HadoopOutputFormat should be called in the configure() method of the Flink HadoopOutputFormatBase. Flink calls configure() before open() and finalizeOnMaster(), so that should work. Have you checked if that fixes your problem? If yes

Re: HBase TableOutputFormat fix (Flink 0.8.1)

2015-04-01 Thread Maximilian Michels
Hi Flavio, Thanks for looking into this problem. Actually, it's a bit difficult to discuss your changes here because of the formatting/syntax highlighting and missing context of the classes. Usually, we do that in a pull request. Do you have a GitHub account? If so, push your changes to your forke

Re: Make docs searchable

2015-04-01 Thread Fabian Hueske
+1! Would also be good if we could make the documentation less monolithic. There are some really large pages which would benefit from an in-page search ;-) 2015-04-01 13:57 GMT+02:00 Maximilian Michels : > +1 Nice idea! > > We would just have to filter the /api/java and /api/scala path. >

Re: HBase TableOutputFormat fix (Flink 0.8.1)

2015-04-01 Thread Flavio Pompermaier
Ok..I'd like to have this fix in the next release. Should I branch Flink 0.8.1 or 0.9 or which version? On Wed, Apr 1, 2015 at 2:04 PM, Maximilian Michels wrote: > Hi Flavio, > > Thanks for looking into this problem. Actually, it's a bit difficult to > discuss your changes here because of the fo

Re: HBase TableOutputFormat fix (Flink 0.8.1)

2015-04-01 Thread Maximilian Michels
Just base your changes on the current master. On Wed, Apr 1, 2015 at 2:12 PM, Flavio Pompermaier wrote: > Ok..I'd like to have this fix in the next release. Should I branch Flink > 0.8.1 or 0.9 or which version? > > On Wed, Apr 1, 2015 at 2:04 PM, Maximilian Michels wrote: > > > Hi Flavio, > >

Re: HBase TableOutputFormat fix (Flink 0.8.1)

2015-04-01 Thread Fabian Hueske
What ever works best for you. We can easily backport or forwardport the patch. 2015-04-01 14:12 GMT+02:00 Flavio Pompermaier : > Ok..I'd like to have this fix in the next release. Should I branch Flink > 0.8.1 or 0.9 or which version? > > On Wed, Apr 1, 2015 at 2:04 PM, Maximilian Michels wrote:

Re: Question about Infinite Streaming Job on Mini Cluster and ITCase

2015-04-01 Thread Stephan Ewen
As a followup - I think it would be a good thing to add a way to gracefully stop a streaming job. Something that sends "close" to the sources, and they quit. We can use this for graceful shutdown wen re-partitioninig / scaling in or out, ... On Wed, Apr 1, 2015 at 1:29 PM, Matthias J. Sax < mj..

[jira] [Created] (FLINK-1810) UpdateTask message contains unused information

2015-04-01 Thread Ufuk Celebi (JIRA)
Ufuk Celebi created FLINK-1810: -- Summary: UpdateTask message contains unused information Key: FLINK-1810 URL: https://issues.apache.org/jira/browse/FLINK-1810 Project: Flink Issue Type: Improvem

Re: Question about Infinite Streaming Job on Mini Cluster and ITCase

2015-04-01 Thread Matthias J. Sax
I agree. @Marton: The idea with the extra thread does not work, because the method JobClient.submitJobAndWait(...) does not return regularly if ForkableFlinkMiniCluster.shutdown() is called -- instead an exception occurs: > Exception in thread "Thread-8" java.lang.RuntimeException: > org.apache.

[jira] [Created] (FLINK-1811) Allow users to pass custom buffer sizes and timeouts to PersistentKafkaSource

2015-04-01 Thread Robert Metzger (JIRA)
Robert Metzger created FLINK-1811: - Summary: Allow users to pass custom buffer sizes and timeouts to PersistentKafkaSource Key: FLINK-1811 URL: https://issues.apache.org/jira/browse/FLINK-1811 Project

Re: [DISCUSS] Inconsistent naming of intermediate results

2015-04-01 Thread Maximilian Michels
+1 for the renaming proposed by Ufuk. @Stephan: At the moment, the IntermediateDataSet is tight to a JobVertex. So the renaming makes sense. In the future, it might be constructed differently. Only then, JobVertexResult wouldn't make sense anymore. I'm not sure if that will even happen. 4) Result

[jira] [Created] (FLINK-1812) TaskManager shell scripts restart JVMs on process failures

2015-04-01 Thread Stephan Ewen (JIRA)
Stephan Ewen created FLINK-1812: --- Summary: TaskManager shell scripts restart JVMs on process failures Key: FLINK-1812 URL: https://issues.apache.org/jira/browse/FLINK-1812 Project: Flink Issue

Re: [QUESTION] In the DataExchangeMode.BATCH how does the consumer being notified to start consumer the data?

2015-04-01 Thread Henry Saputra
Ah cool. Thanks for the explanation, Ufuk. Really helpful. - Henry On Tuesday, March 31, 2015, Ufuk Celebi wrote: > Hey Henry, > > 1) There is no extra message, but this is piggy backed with the finished > state transition (see Execution#markAsFinished). There it is essentially > the same mecha

Re: Make docs searchable

2015-04-01 Thread Henry Saputra
+1 Superbly needed :) On Wednesday, April 1, 2015, Ufuk Celebi wrote: > Hey all, > > I think our documentation has grown to a point where we need to think about > how to make it more accessible. > > I would like to add a custom Google search limited to the docs *excluding* > the API docs (other

Re: [DISCUSS] Gelly iteration abstractions

2015-04-01 Thread Vasiliki Kalavri
Hey, Wow, Vasia, a GC limit error in the Optimizer (pre-flight), not the runtime. > > That is cannot have been easy to produce ;-) The program does not really > look that tricky, I need to take some closer look at this... > ​It was pretty easy actually :P The program is really simple: in each ite

[jira] [Created] (FLINK-1813) Barrier broadcast sometimes causes illegal state exception in source RecordWriter

2015-04-01 Thread Gyula Fora (JIRA)
Gyula Fora created FLINK-1813: - Summary: Barrier broadcast sometimes causes illegal state exception in source RecordWriter Key: FLINK-1813 URL: https://issues.apache.org/jira/browse/FLINK-1813 Project: Fl

[jira] [Created] (FLINK-1814) Revisit the documentation to add new operator

2015-04-01 Thread Henry Saputra (JIRA)
Henry Saputra created FLINK-1814: Summary: Revisit the documentation to add new operator Key: FLINK-1814 URL: https://issues.apache.org/jira/browse/FLINK-1814 Project: Flink Issue Type: Bug