Re: DeserializationSchema isEndOfStream usage?

2016-01-21 Thread David Kim
Hi Robert! Thanks for reaching out. I ran into an issue and wasn't sure if this was due to a misconfiguration on my end of if this is a real bug. I have one DataStream and I'm sinking to two different kafka sinks. When the job starts, I run into this error: org.apache.flink.runtime.client.JobExec

Re: Actual byte-streams in multiple-node pipelines

2016-01-21 Thread Tal Maoz
Thanks Stephan and Fabian! You make very valuable points! This really helps steer me in the right direction! It would take some more careful planning and implementing the components you suggested but hopefully it will work in the end... Thanks, Tal On Thu, Jan 21, 2016 at 11:20 AM, Fabian Huesk

Re: Results of testing Flink quickstart against 0.10-SNAPSHOT and 1.0-SNAPSHOT (re. Dependency on non-existent org.scalamacros:quasiquotes_2.11:)

2016-01-21 Thread Prez Cannady
Apologies for the late reply; been on the road. I’ve been blowing away my Maven and Gradle repos for each test to make sure I’m pulling down the latest snapshots. I’ve also looked into the source jars directly from the snapshot repo: 1.0-SNAPSHOT: https://repository.apache.org/content/reposit

Re: Cannot start FlinkMiniCluster.WebServer using different port in FlinkMiniCluster

2016-01-21 Thread Till Rohrmann
Great to hear :-) On Thu, Jan 21, 2016 at 4:55 PM, HungChang wrote: > After adding the dependency it totally works! Thank you a lot! > > > > -- > View this message in context: > http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/Cannot-start-FlinkMiniCluster-WebServer-using-diff

Re: Cannot start FlinkMiniCluster.WebServer using different port in FlinkMiniCluster

2016-01-21 Thread HungChang
After adding the dependency it totally works! Thank you a lot! -- View this message in context: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/Cannot-start-FlinkMiniCluster-WebServer-using-different-port-in-FlinkMiniCluster-tp4414p4455.html Sent from the Apache Flink User M

Re: Cannot start FlinkMiniCluster.WebServer using different port in FlinkMiniCluster

2016-01-21 Thread Till Rohrmann
Could you add flink-runtime-web to your dependencies of your project? It seems as if it is missing in your project. Cheers, Till ​ On Thu, Jan 21, 2016 at 4:45 PM, HungChang wrote: > The following message is obtained after putting > BasicConfigurator.configure() > in main(); > But I don't under

Re: Cannot start FlinkMiniCluster.WebServer using different port in FlinkMiniCluster

2016-01-21 Thread HungChang
The following message is obtained after putting BasicConfigurator.configure() in main(); But I don't understand the reason `flink-runtime-web is not in the classpath`. For me the strange part is using the scala version works well whereas my java version throws exception. 1413 [main] ERROR org.apa

Re: parallelism parameter and output relation

2016-01-21 Thread Serkan Taş
Hi Robert, I found the the real reason for the case. Sorry but missed that the example project was using 0.8.1. It is resolved after replacing with 0.10.1. > 20 Oca 2016 tarihinde 16:40 saatinde, Robert Metzger > şunları yazdı: > > Hi Serkan, > > yes, with parallelism=1, you'll get one fi

Re: Backpressure in the context of JDBCOutputFormat update

2016-01-21 Thread Maximilian Bode
Hi Robert, sorry, I should have been clearer in my initial mail. The two cases I was comparing are: 1) distinct() before Insert (which is necessary as we have a unique key constraint in our database), no distinct() before update 2) distinct() before insert AND distinct() before update The test

Re: Backpressure in the context of JDBCOutputFormat update

2016-01-21 Thread Robert Metzger
Hi Max, is the distinct() operation reducing the size of the DataSet? If so, I assume you have an idempotent update and the job is faster because fewer updates are done? if the distinct() operator is not changing anything, then, the job might be faster because the INSERT is done while Flink is sti

Backpressure in the context of JDBCOutputFormat update

2016-01-21 Thread Maximilian Bode
Hi everyone, in a Flink (0.10.1) job with two JDBCOutputFormat sinks, one of them (doing a database update) is performing slower than the other one (an insert). The job as a whole is also slow as upstream operators are slowed down due to backpressure. I am able to speed up the whole job by intr

Re: Cannot start FlinkMiniCluster.WebServer using different port in FlinkMiniCluster

2016-01-21 Thread HungChang
Thanks for your reply. Yea I'm not sure how to use WebMonitor. For me it's about to write the log into a file in disk that should go to the job manager originally at localhost:8081. Could you please give an brief example how to use it? Best, Sendoh -- View this message in context: http://

Re: Cannot start FlinkMiniCluster.WebServer using different port in FlinkMiniCluster

2016-01-21 Thread Till Rohrmann
leaderRetrievalService will retrieve the leading JobManager. Take a look at LeaderRetrievalUtils in order to see how it is created and what options are supported. actorSystem is the ActorSystem which is used to resolve the leader’s Akka URL into an ActorRef. You can simply create one or use an exis

Re: Unexpected out of bounds error in UnilateralSortMerger

2016-01-21 Thread Theodore Vasiloudis
And this is the one from running with a CSV input, this time I've verified that I'm using the correct version of Flink, according to Till's instructions: The program finished with the following exception: > > org.apache.flink.client.program.ProgramInvocationException: The program > execution fail

Re: Cannot start FlinkMiniCluster.WebServer using different port in FlinkMiniCluster

2016-01-21 Thread HungChang
Thanks for your suggestion. I have some questions to start WebRuntimeMonitor. In startWebRuntimeMonitor what should be called for - leaderRetrievalService: LeaderRetrievalService, - actorSystem: ActorSystem ? My ref: (https://ci.apache.org/projects/flink/flink-docs-master/api/java/org/

Re: Unexpected out of bounds error in UnilateralSortMerger

2016-01-21 Thread Theodore Vasiloudis
This is the stack trace from running with the patched branch: The program finished with the following exception: > > org.apache.flink.client.program.ProgramInvocationException: The program > execution failed: Job execution failed. > at > org.apache.flink.client.program.Client.runBlocking(

Re: Actual byte-streams in multiple-node pipelines

2016-01-21 Thread Fabian Hueske
Hi Tal, you said that most processing will be done in external processes. If these processes are stateful, this might be hard to integrate with Flink's fault-tolerance mechanism. In principle, Flink requires two things to achieve exactly-once processing: 1) A data source that can be replayed from

Re: Could not upload the jar files to the job manager IOException

2016-01-21 Thread Robert Metzger
Hi, this is the log file of your local client submitting the job to Flink's JobManager (master). As you can see from the log, the jar upload failed because of this issue: "Caused by: java.net.SocketException: Connection reset". The JobManager is the service at the other end receiving the file. I'm

Re: Could not upload the jar files to the job manager IOException

2016-01-21 Thread Ana M. Martinez
Hi Robert, Thanks for your answer. Do you mean the log file in (e.g.) flink-0.10.0/log/flink-hadoop-client-ip-172-31-10-193.log? Or you mean another log file? In this log, the error message is as follows: 08:16:03,437 INFO org.apache.flink.runtime.client.JobClient - Job