date:20150223

Re: Stale Synchronous Parallel iterations in Flink

2015-02-23 Thread Kostas Tzoumas

Hi Nam-Luc, Several of your observations in the blog post are to the point. Iterations are already pipelined, and the distributed state that the delta iterations access can be possibly lifted to a parameter server API. We need to work a bit through the details on how fault tolerance and terminati

Re: Stale Synchronous Parallel iterations in Flink

2015-02-23 Thread Stephan Ewen

Hey Tran Nam-Luc! Great post with some really cool thoughts. I just posted this answer to your LinkedIN post. Greetings, Stephan = Nice post, very cool idea! Your understanding of Flink in that respect is really good. I had not heard of SSP before, bu

Re: [DISCUSS] Gelly iteration abstractions

2015-02-23 Thread Stephan Ewen

As a workaround, it should always work to get the Edge and Vertex data set from the graph and use the regular Fink iteration operators? On Sun, Feb 22, 2015 at 4:53 PM, Vasiliki Kalavri wrote: > Hi, > > yes, I was referring to the parallel Boruvka algorithm. There are several > ways to implemen

[DISCUSS] Iterative streaming example

2015-02-23 Thread Szabó Péter

Does everyone know of a good, simple and realistic streaming iteration example? The current example tests a random generator, but it should be replaced by something deterministic in order to be testable. Peter

Re: [DISCUSS] Iterative streaming example

2015-02-23 Thread Stephan Ewen

I think that the Samoa people have quite a few nice examples along the lines of model training with feedback. @Paris: What would be the simplest example? On Mon, Feb 23, 2015 at 11:27 AM, Szabó Péter wrote: > Does everyone know of a good, simple and realistic streaming iteration > example? The

Re: [DISCUSS] Gelly iteration abstractions

2015-02-23 Thread Vasiliki Kalavri

Hi Stephan, yes, this would work for the cases where an algorithm only updates the vertex values or only updates the edge values. What we would like to also support is (a) algorithms where both vertices and edges are updated in one iteration (b) algorithms where the graph structure changes from o

Re: [DISCUSS] Gelly iteration abstractions

2015-02-23 Thread Stephan Ewen

Some things may not work well as "closed-loop" iterations. Is it possible to express those as for-loop iterations? On Mon, Feb 23, 2015 at 1:03 PM, Vasiliki Kalavri wrote: > Hi Stephan, > > yes, this would work for the cases where an algorithm only updates the > vertex values or only updates th

[jira] [Created] (FLINK-1598) Give better error messages when serializers run out of space.

2015-02-23 Thread Stephan Ewen (JIRA)

Stephan Ewen created FLINK-1598: --- Summary: Give better error messages when serializers run out of space. Key: FLINK-1598 URL: https://issues.apache.org/jira/browse/FLINK-1598 Project: Flink Is

Re: [DISCUSS] Iterative streaming example

2015-02-23 Thread Paris Carbone

Hello Peter, Streaming machine learning algorithms make use of iterations quite widely. One simple example is implementing distributed stream learners. There, in many cases you need some central model aggregator, distributed estimators to offload the central node and of course feedback loops to

Re: [DISCUSS] Iterative streaming example

2015-02-23 Thread Szabó Péter

Nice. Thank you guys! @Paris Are there any Flink implementations of this model? The GitHub doc is quite general. Peter 2015-02-23 14:05 GMT+01:00 Paris Carbone : > Hello Peter, > > Streaming machine learning algorithms make use of iterations quite widely. > One simple example is implementing di

[jira] [Created] (FLINK-1599) TypeComperator

2015-02-23 Thread Max Michels (JIRA)

Max Michels created FLINK-1599: -- Summary: TypeComperator Key: FLINK-1599 URL: https://issues.apache.org/jira/browse/FLINK-1599 Project: Flink Issue Type: Bug Components: Distributed Ru

Re: [DISCUSS] Gelly iteration abstractions

2015-02-23 Thread Vasiliki Kalavri

for-loop iterations could cover some cases, I guess, when the number of iterations is known beforehand. Are there currently any restrictions on what can be used inside a for-loop? How are they translated into execution plans? On 23 February 2015 at 13:08, Stephan Ewen wrote: > Some things may no

[jira] [Created] (FLINK-1600) Failure when submitting a job leaves user code libraries in the BLOB manager

2015-02-23 Thread Stephan Ewen (JIRA)

Stephan Ewen created FLINK-1600: --- Summary: Failure when submitting a job leaves user code libraries in the BLOB manager Key: FLINK-1600 URL: https://issues.apache.org/jira/browse/FLINK-1600 Project: Fli

Re: [DISCUSS] Gelly iteration abstractions

2015-02-23 Thread Stephan Ewen

For loops are basically rolled out - they yield long execution plans. On Mon, Feb 23, 2015 at 2:44 PM, Vasiliki Kalavri wrote: > for-loop iterations could cover some cases, I guess, when the number of > iterations is known beforehand. > Are there currently any restrictions on what can be used in

Re: [DISCUSS] Gelly iteration abstractions

2015-02-23 Thread Stephan Ewen

Closed-loop iterations are much more efficient right now. Long for loops suffer from memory fragmentation (an issue that is in the list to fix). Also, closed loops can be stateful (delta iterations) and do not require task re-deployment. On Mon, Feb 23, 2015 at 4:15 PM, Vasiliki Kalavri wrote:

Re: [DISCUSS] Gelly iteration abstractions

2015-02-23 Thread Vasiliki Kalavri

I see that's cool :-) So, what is the advantage of closed-loop versus for-loop iterations? Custom convergence criteria / aggregators and more efficient execution plans? On 23 February 2015 at 15:01, Stephan Ewen wrote: > For loops are basically rolled out - they yield long execution plans. > > O

[jira] [Created] (FLINK-1601) Sometimes the YARNSessionFIFOITCase fails on Travis

2015-02-23 Thread Till Rohrmann (JIRA)

Till Rohrmann created FLINK-1601: Summary: Sometimes the YARNSessionFIFOITCase fails on Travis Key: FLINK-1601 URL: https://issues.apache.org/jira/browse/FLINK-1601 Project: Flink Issue Type:

Re: Stale Synchronous Parallel iterations in Flink

2015-02-23 Thread Nam-Luc Tran

Hello guys, Thank you for your replies. >(1) How to checkpoint such computations in order to recover them upon failures. Approximate snapshots are possible with SSP. In this case, a snapshot (solution set and or working set, what is exactly snapshotted can be discussed) is triggered once a "clock

Could not build up connection to JobManager

2015-02-23 Thread Dulaj Viduranga

I’m getting "Could not build up connection to JobManager.” When i tried to run the wordCount example. Can anyone help? Dulaj

Re: [DISCUSS] Gelly iteration abstractions

2015-02-23 Thread Vasiliki Kalavri

I see, thanks a lot for the answers! To rephrase my original question, would it make sense to define a closed-loop iteration where the state is the whole graph? If you want to take a look at the current implementation of DMST using delta iteration, Andra has made a PR [1]. On a high-level, this al

Re: Could not build up connection to JobManager

2015-02-23 Thread Robert Metzger

Hi, you said in the other email thread that the error only occurs for Wordcount, not for Kmeans. Can you copy me the commands for both examples? I can not really believe that there is a difference between the two jobs. Can you also send us the contents of the jobmanager log file? Best, Robert O

Re: Could not build up connection to JobManager

2015-02-23 Thread Dulaj Viduranga

Yes. It seams it is not a problem with the arguments. I tried two days but different error occurs. It seams the web client can’t connect to the job manager although it is running Right now, I can’t even get the webclient to run. ./bin/start-webclient.sh executes fine but I cannot connect to loca

Re: Could not build up connection to JobManager

2015-02-23 Thread Robert Metzger

Thank you for the quick reply. The log you've send is from the webclient. Can you also send the log of the JobManager? On Mon, Feb 23, 2015 at 7:28 PM, Dulaj Viduranga wrote: > Yes. It seams it is not a problem with the arguments. I tried two days but > different error occurs. It seams the web

Question about scheduling in 0.7

2015-02-23 Thread Nico Scherer

Hi all, we’re currently playing/messing around with scheduling in flink 0.7. We found out that if we run a single job with a certain degree of parallelism, multiple tasks/vertices are executed within a single task manager at the same time (or at least before the prior stage is switched to finis

Re: Question about scheduling in 0.7

2015-02-23 Thread Fabian Hueske

Hi Nico, yes, Flink runs tasks in threads. Each TaskManager runs in its own JVM, everything within a TaskManager is parallelized in threads. Since a TM can offer multiple slots, also tasks across slots run in the same JVM and in different threads. Flink has a pipelined processing model, which mea

Re: Question about scheduling in 0.7

2015-02-23 Thread Stephan Ewen

In addition, to what Fabian wrote: Yes, one slot can run multiple Tasks. In the batch API, one slot can run concurrently one task of each operation (for example one source, one mapper, one reducer, one sink). On Mon, Feb 23, 2015 at 10:38 PM, Fabian Hueske wrote: > Hi Nico, > > yes, Flink runs

Re: Could not build up connection to JobManager

2015-02-23 Thread Stephan Ewen

Hi Dulaj, That error message indicates that the JobManager is not running. Are you sure that the JobManager runs properly? Anything in the JobManager logs? BTW: The 0.9 branch is under heavy development / changes. That is why it may behave a bit different on different days right now. I would reco

[DISCUSS] Discourage using the same class names even though in different packages

2015-02-23 Thread Henry Saputra

Hi All, I am seeing some same class names, even though in different package names, that could confuse new contributors. One of the attractiveness of Spark that it is the code structure is simple to follow than Hadoop (or Hive for that matter). For example we have IntermediateResultPartition in bo

[jira] [Created] (FLINK-1602) Remove 0.6-incubating from Flink website SVN

2015-02-23 Thread Henry Saputra (JIRA)

Henry Saputra created FLINK-1602: Summary: Remove 0.6-incubating from Flink website SVN Key: FLINK-1602 URL: https://issues.apache.org/jira/browse/FLINK-1602 Project: Flink Issue Type: Task

[jira] [Created] (FLINK-1603) Update how to contribute doc to include information to send Github PR instead of attaching patch

2015-02-23 Thread Henry Saputra (JIRA)

Henry Saputra created FLINK-1603: Summary: Update how to contribute doc to include information to send Github PR instead of attaching patch Key: FLINK-1603 URL: https://issues.apache.org/jira/browse/FLINK-1603

Re: [DISCUSS] Discourage using the same class names even though in different packages

2015-02-23 Thread Henry Saputra

Just to be clear that I was not advocating flink to simplify the code just for the sake of clarity :) Flink has a lot to offer by providing simple APIs by hiding complexity to achieve performance. Which I think is one of the key differentiator compare to other general distributed processing platfo

Re: Could not build up connection to JobManager

2015-02-23 Thread Dulaj Viduranga

The JobManager seems to run fine. I don't know. When I tried to run start-local.sh again, It shows the PID of the running JobManager and also :8081 runs fine. I want to contribute to the project and I could get a little boost if I could see the capabilities of FLINK. :) Will it be OK to use 0.

Re: Stale Synchronous Parallel iterations in Flink

Re: Stale Synchronous Parallel iterations in Flink

Re: [DISCUSS] Gelly iteration abstractions

[DISCUSS] Iterative streaming example

Re: [DISCUSS] Iterative streaming example

Re: [DISCUSS] Gelly iteration abstractions

Re: [DISCUSS] Gelly iteration abstractions

[jira] [Created] (FLINK-1598) Give better error messages when serializers run out of space.

Re: [DISCUSS] Iterative streaming example

Re: [DISCUSS] Iterative streaming example

[jira] [Created] (FLINK-1599) TypeComperator

Re: [DISCUSS] Gelly iteration abstractions

[jira] [Created] (FLINK-1600) Failure when submitting a job leaves user code libraries in the BLOB manager

Re: [DISCUSS] Gelly iteration abstractions

Re: [DISCUSS] Gelly iteration abstractions

Re: [DISCUSS] Gelly iteration abstractions

[jira] [Created] (FLINK-1601) Sometimes the YARNSessionFIFOITCase fails on Travis

Re: Stale Synchronous Parallel iterations in Flink

Could not build up connection to JobManager

Re: [DISCUSS] Gelly iteration abstractions

Re: Could not build up connection to JobManager

Re: Could not build up connection to JobManager

Re: Could not build up connection to JobManager

Question about scheduling in 0.7

Re: Question about scheduling in 0.7

Re: Question about scheduling in 0.7

Re: Could not build up connection to JobManager

[DISCUSS] Discourage using the same class names even though in different packages

[jira] [Created] (FLINK-1602) Remove 0.6-incubating from Flink website SVN

[jira] [Created] (FLINK-1603) Update how to contribute doc to include information to send Github PR instead of attaching patch

Re: [DISCUSS] Discourage using the same class names even though in different packages

Re: Could not build up connection to JobManager

32 matches

Site Navigation

Mail list logo

Footer information