RE: cascading-flink 1.0 results

2016-03-31 Thread Ken Krugler
Hi Fabian, I figured you might be away - and sorry for interrupting your vacation. I've got ahead and opened issues for these two items. Regards, -- Ken > From: Fabian Hueske > Sent: March 31, 2016 3:44:07pm PDT > To: dev@flink.apache.org > Subject: Re: cascading-flink 1.0 results > > Hi Ken,

Re: cascading-flink 1.0 results

2016-03-31 Thread Fabian Hueske
Hi Ken, I'm currently on vacation and will be back in a week. Would you like to open an issue at the cascading-flink Github project a describe the Scheme.setNumSinkParts() problem? I'll try to fix it when I'm back. Thanks for checking with Chris the ComparePlatformsTest issue. I'll exclude that t

Re: Writing multiple streams to multiple kafka

2016-03-31 Thread Deepak Jha
It works... Thanks On Thu, Mar 31, 2016 at 2:23 AM, Aljoscha Krettek wrote: > Hi, > yes you can output the stages to several different Kafka Topics. If you > don't want to call addSink inside the run() method you somehow have to > return the handle to your stage3 DataStream, for example: > > pri

Re: Submission Problem

2016-03-31 Thread Till Rohrmann
The PR Ufuk referenced should fix the problem. It is a known issue. It's not merged yet, because I wanted to add an improved test case. In the documentation for version 1.1-Snapshot, I've already addressed the problem with the configuration parameter description. Cheers, Till On Mar 31, 2016 6:09

Re: Submission Problem

2016-03-31 Thread Stephan Ewen
I guess the fix is to put the entire ExecutionConfig into a "SerializedValue"? On Thu, Mar 31, 2016 at 5:34 PM, Matthias J. Sax wrote: > StormConfig is set a a global job parameter > > FlinkClient.java line 337ff > > > ExecutionConfig flinkConfig = > topology.getExecutionEnvironment().getConfig(

Re: Apache Flink <=> Apache Ignite integration

2016-03-31 Thread Saikat Maitra
Hi , I agree with Roman and Raul. https://issues.apache.org/jira/browse/IGNITE-813 allows injecting data to into cache via Data Streamer. Integrating with Ignite FileSystem for source and sink will allow for bidirectional connector. It will also allow easier implementation for DataStream transform

[VOTE] Release Apache Flink 1.0.1 (RC1)

2016-03-31 Thread Ufuk Celebi
Dear Flink community, Please vote on releasing the following candidate as Apache Flink version 1.0.1. The commit to be voted on: 4afa401ab3c2b53de115d17a3157e8b80431dd10 Branch: release-1.0.1-rc1 (see https://git1-us-west.apache.org/repos/asf/flink/?p=flink.git;a=shortlog;h=refs/heads/release-1.

Re: Submission Problem

2016-03-31 Thread Matthias J. Sax
StormConfig is set a a global job parameter FlinkClient.java line 337ff > ExecutionConfig flinkConfig = topology.getExecutionEnvironment().getConfig(); > flinkConfig.setGlobalJobParameters(new StormConfig(conf)); On 03/31/2016 05:05 PM, Stephan Ewen wrote: > Hmm, it is wrong that the JobManage

Re: Submission Problem

2016-03-31 Thread Ufuk Celebi
@Till: Isn't this an instance of the bug fixed in https://github.com/apache/flink/pull/1818? On Thu, Mar 31, 2016 at 5:05 PM, Stephan Ewen wrote: > Hmm, it is wrong that the JobManager tries to load that class directly from > the actor message. > All user code should be deserialized lazily. > > H

Re: Submission Problem

2016-03-31 Thread Stephan Ewen
Hmm, it is wrong that the JobManager tries to load that class directly from the actor message. All user code should be deserialized lazily. How is that class passed? Implicitly through some config? On Thu, Mar 31, 2016 at 4:51 PM, Matthias J. Sax wrote: > Here we go... > > StormConfig.class is

Re: Submission Problem

2016-03-31 Thread Matthias J. Sax
Here we go... StormConfig.class is contained in the user jar file. I guess I need to "register" it somehow? Or is it a class loading issue? > 2016-03-31 16:47:33,095 ERROR akka.remote.EndpointWriter > - AssociationError [akka.tcp://flink@127.0.0.1:6123] <- >

Re: Submission Problem

2016-03-31 Thread Till Rohrmann
Could you please rerun the whole job with debug log level and logging of Akka's lifecycle events turned on? Cheers, Till On Thu, Mar 31, 2016 at 4:21 PM, Matthias J. Sax wrote: > enclosed the logs.. maybe you can make some sense out if them. > > On 03/31/2016 02:52 PM, Till Rohrmann wrote: > >

Re: Submission Problem

2016-03-31 Thread Matthias J. Sax
enclosed the logs.. maybe you can make some sense out if them. On 03/31/2016 02:52 PM, Till Rohrmann wrote: > I would assume that something went wrong on the JobManager side. Could you > check the logs if they contain something suspicious? Additionally you could > turn on lifecycle event logging​

Re: Performing consecutive Action operators

2016-03-31 Thread Till Rohrmann
Hi Do, the easiest way is to avoid using methods which trigger an eager execution (collect, count, print) but to define sinks instead. Alternatively, you can persist intermediate results by writing them to disk and continue processing from there. That way, you won't re-calculate all parts of your

Performing consecutive Action operators

2016-03-31 Thread Le Quoc Do
Hi all, Right now, in Flink, if I call to 2 action operators (print, count, collect, ) consecutively, Flink will create 2 independent execution plans. A simple example: DataSet text = env.fromElements( "Some text ….", );

Re: Submission Problem

2016-03-31 Thread Till Rohrmann
I would assume that something went wrong on the JobManager side. Could you check the logs if they contain something suspicious? Additionally you could turn on lifecycle event logging​ for Akka. Cheers, Till ​

Re: a typical ML algorithm flow

2016-03-31 Thread Till Rohrmann
I agree that Flink’s concept of the closed loop iteration does not translate so easily to a more general distributed linear algebra DSL such as Samsara. There one usually writes loops using the for and while primitives. Unfortunately, it is not so trivial to automatically translate a for loop into

Submission Problem

2016-03-31 Thread Matthias J. Sax
Hi, I just tried to submit Flink's Storm-Topology example via command line: bin/flink run ~/workspace_flink/flink/flink-contrib/flink-storm-examples/target/WordCount-StormTopology.jar However, I get a timeout and the program is not submitted. I tracked the problem down to the following statemen

[jira] [Created] (FLINK-3685) Logical error in code for DateSerializer deserialize with reuse

2016-03-31 Thread ZhengBowen (JIRA)
ZhengBowen created FLINK-3685: - Summary: Logical error in code for DateSerializer deserialize with reuse Key: FLINK-3685 URL: https://issues.apache.org/jira/browse/FLINK-3685 Project: Flink Issu

[jira] [Created] (FLINK-3684) CEP operator does not forward watermarks properly

2016-03-31 Thread Till Rohrmann (JIRA)
Till Rohrmann created FLINK-3684: Summary: CEP operator does not forward watermarks properly Key: FLINK-3684 URL: https://issues.apache.org/jira/browse/FLINK-3684 Project: Flink Issue Type: B

Re: Writing multiple streams to multiple kafka

2016-03-31 Thread Aljoscha Krettek
Hi, yes you can output the stages to several different Kafka Topics. If you don't want to call addSink inside the run() method you somehow have to return the handle to your stage3 DataStream, for example: private val env = StreamExecutionEnvironment.getExecutionEnvironment private val src = env.ad

Re: Apache Flink <=> Apache Ignite integration

2016-03-31 Thread Aljoscha Krettek
Hi, it should already be possible to use the Ignite FileSystem to store state since we just use the HDFS FileSystem interface for that. Of course, one would have to properly set up the jars and paths and everything for Flink to pick up the IGFS classes. Cheers, Aljoscha On Wed, 30 Mar 2016 at 16:

Re: RichMapPartitionFunction - problems with collect

2016-03-31 Thread Till Rohrmann
Hi Sergio, could you please provide a complete example (including input data) to reproduce your problem. It is hard to tell what's going wrong when one only sees a fraction of the program. Cheers, Till On Tue, Mar 29, 2016 at 5:58 PM, Sergio Ramírez wrote: > Hi again, > > I've not been able to