Re: Connecting the channel failed: Connection refused

2015-06-24 Thread Aaron Jackson
That was it. host3 was showing localhost - looked a little further and it was missing an entry in /etc/hosts. Thanks for looking into this. Aaron On Wed, Jun 24, 2015 at 2:13 PM, Stephan Ewen wrote: > Aaron, > > Can you check how the TaskManagers register at the JobManager? When you > look at

Re: Pergem exception from web-client

2015-06-24 Thread Flavio Pompermaier
ok thanks Matthias On 24 Jun 2015 21:00, "Matthias J. Sax" wrote: > Hi, > > you need to increase JVM parameter "-XX:MaxPermSize=" > The default value should be something like "64m" > > Just add the flag to variable JVM_ARGS in "bin/webclient.sh" (line 33). > -> Compare "bin/jobmanager.sh" (line 3

Re: Connecting the channel failed: Connection refused

2015-06-24 Thread Stephan Ewen
Aaron, Can you check how the TaskManagers register at the JobManager? When you look at the 'TaskManagers' section in the JobManager's web Interface (at port 8081), what does it say as the TaskManager host names? Does it list "host1", "host2", "host3"...? Thanks, Stephan Am 24.06.2015 20:31 schr

Re: Pergem exception from web-client

2015-06-24 Thread Matthias J. Sax
Hi, you need to increase JVM parameter "-XX:MaxPermSize=" The default value should be something like "64m" Just add the flag to variable JVM_ARGS in "bin/webclient.sh" (line 33). -> Compare "bin/jobmanager.sh" (line 35) -Matthias On 06/24/2015 06:38 PM, Flavio Pompermaier wrote: > Hi to all, >

Re: Connecting the channel failed: Connection refused

2015-06-24 Thread Ufuk Celebi
On 24 Jun 2015, at 16:22, Aaron Jackson wrote: > Thanks. My setup is actually 3 task managers x 4 slots. I played with the > parallelism and found that at low values, the error did not occur. I can > only conclude that there is some form of data shuffling that is occurring > that is sensiti

Pergem exception from web-client

2015-06-24 Thread Flavio Pompermaier
Hi to all, I'm facing an "OutOfMemoryError: PermGen space" running multiple times my job from the web client interface. Where do I need to increase it? The full stacktrace is: org.apache.flink.client.program.ProgramInvocationException: The program's entry point class '' caused an exception d

Re: using value that are not passed as parameter

2015-06-24 Thread Stephan Ewen
Hi Michele! This may help explain things: https://cwiki.apache.org/confluence/display/FLINK/Variables+Closures+vs.+Broadcast+Variables Greetings, Stephan On Wed, Jun 24, 2015 at 6:23 PM, Michele Bertoni < michele1.bert...@mail.polimi.it> wrote: > Hi everybody, > this question may sounds stupid

Re: Datasets union CompilerException

2015-06-24 Thread Flavio Pompermaier
Unfortunately not in public..moreover this error seems to be non deterministic..sometime the programs runs correctly :( On Wed, Jun 24, 2015 at 6:04 PM, Stephan Ewen wrote: > Hmm, that issue must have been introduced as part of a recent change. > > Can you share your program so we can reproduce

using value that are not passed as parameter

2015-06-24 Thread Michele Bertoni
Hi everybody, this question may sounds stupid, but i would like to have it clear what happens if inside a dataset transformation (e.g. a map) I use something that is declared somewhere else, like a variable or a dataset, and not passed as broadcast dataset nor parameter in the constructor of a r

Re: Datasets union CompilerException

2015-06-24 Thread Stephan Ewen
Hmm, that issue must have been introduced as part of a recent change. Can you share your program so we can reproduce abs fix the bug? Am 24.06.2015 16:56 schrieb "Flavio Pompermaier" : > Hi to all, > > I'm running my flink-job on the new 0.9.0 version of Flink and now I get > this exception: > >

Datasets union CompilerException

2015-06-24 Thread Flavio Pompermaier
Hi to all, I'm running my flink-job on the new 0.9.0 version of Flink and now I get this exception: Exception in thread "main" org.apache.flink.optimizer.CompilerException: Bug: Plan generation for Unions picked a ship strategy between binary plan operators. at org.apache.flink.optimizer.traversa

Re: Connecting the channel failed: Connection refused

2015-06-24 Thread Aaron Jackson
Thanks. My setup is actually 3 task managers x 4 slots. I played with the parallelism and found that at low values, the error did not occur. I can only conclude that there is some form of data shuffling that is occurring that is sensitive to the data source. Yes, seems a little odd to me as wel

[ANNOUNCE] Apache Flink 0.9.0 released

2015-06-24 Thread Maximilian Michels
The Apache Flink community is pleased to announce the availability of the 0.9.0 release. Apache Flink is an open source platform for scalable batch and stream data processing. Flinkā€™s core consists of a streaming dataflow engine that provides data distribution, communication, and fault tolerance f

Re: Random Shuffling

2015-06-24 Thread Maximilian Alber
Thanks Sebastian! What do you intend with driver? Before submitting to the cluster? Knowing the dataset size is ok. On Wed, Jun 24, 2015 at 11:08 AM, Sebastian wrote: > A very simple way to achieve is to generate a random variate on the driver > that describes a mapping of datapoints to samples.

Re: Connecting the channel failed: Connection refused

2015-06-24 Thread Ufuk Celebi
Hey Aaron, thanks for preparing the example. I've checked it out and tried it with a similar setup (12 task managers with 1 slots each, running the job with parallelism of 12). I couldn't reproduce the problem. What have you configured in the "slaves" file? I think Flink does not allow you to

Re: Random Shuffling

2015-06-24 Thread Sebastian
A very simple way to achieve is to generate a random variate on the driver that describes a mapping of datapoints to samples. Then you simply join the dataset with this mapping to generate the samples. This approach requires you to know the size of the dataset in advance, but has the advantage

Re: Random Shuffling

2015-06-24 Thread Maximilian Alber
That's not the point. In Machine Learning one often divides a data set X into f.e. three sets, one for the training, one for the validation, one for the final testing. The sets are usually created randomly according to some ratio. Thus it would be important to keep the ratio and to do the whole pro

Re: Best way to write data to HDFS by Flink

2015-06-24 Thread Stephan Ewen
Hi Hawin! If you are creating code for such an output into different files/partitions, it would be amazing if you could contribute this code to Flink. It seems like a very common use case, so this functionality will be useful to other user as well! Greetings, Stephan On Tue, Jun 23, 2015 at 3:

Re: Documentation Error

2015-06-24 Thread Maximilian Michels
Hi Max, Thanks for noticing! Fixed on the master and for the 0.9.1 release. Cheers, Max On Tue, Jun 23, 2015 at 5:09 PM, Maximilian Alber < alber.maximil...@gmail.com> wrote: > Hi Flinksters, > > just some minor: > http://ci.apache.org/projects/flink/flink-docs-master/setup/yarn_setup.html > in

Re: Random Shuffling

2015-06-24 Thread Stephan Ewen
If you do "rebalance()", it will redistribute elements round-robin fashion, which should give you very even partition sizes. On Tue, Jun 23, 2015 at 11:51 AM, Maximilian Alber < alber.maximil...@gmail.com> wrote: > Thank you! > > Still I cannot guarantee the size of each partition, or can I? > S