Re: PartitionedByHash input annotation?

2015-05-21 Thread Fabian Hueske
Hi Alex, did my previous mail answer these questions as well? Cheers, Fabian 2015-05-18 22:03 GMT+02:00 Alexander Alexandrov < alexander.s.alexand...@gmail.com>: > In the dawn of Flink when Flink Operators were still called PACTs (short > for Parallelization Contracts) the system used to suppor

Re: Read corpus documents sentence by sentence instead of linewise

2015-05-21 Thread Stephan Ewen
If you want the inputs to be chunked by sentence, you can try and split sentences by the period character. You can do this with the DelimitedInputFormat, by setting the delimiter. The readAsText uses actually a special case delimited input format that splits at line breaks. Greetings, Stephan

Re: Projection exception with "Thrifted" Parquet data

2015-05-21 Thread Stephan Ewen
Hi Flavio! The latest master should have a fix for the problem. The issue is tracked here: https://issues.apache.org/jira/browse/FLINK-2071 Greetings, Stephan On Thu, May 21, 2015 at 2:24 PM, Flavio Pompermaier wrote: > Great! Thanks for the great support Stephan > > On Thu, May 21, 2015 at

Re: Projection exception with "Thrifted" Parquet data

2015-05-21 Thread Flavio Pompermaier
Great! Thanks for the great support Stephan On Thu, May 21, 2015 at 2:22 PM, Stephan Ewen wrote: > Hi! > > It seems that the "project()" operator wants to write an empty instance > of type "Person" as part of the object, which is not possible for thrift. > > We can remove that instance, since i

Re: Projection exception with "Thrifted" Parquet data

2015-05-21 Thread Stephan Ewen
Hi! It seems that the "project()" operator wants to write an empty instance of type "Person" as part of the object, which is not possible for thrift. We can remove that instance, since it is not really needed. It was intended to be a reusable object instance, but it has become obsolete. Stephan

Re: k means - waiting for dataset

2015-05-21 Thread Till Rohrmann
Concerning your first problem that you only see one resulting centroid, your code looks good modulo the parts you haven't posted. However, your problem could simply be caused by a bad selection of initial centroids. If, for example, all centroids except for one don't get any points assigned, then

Projection exception with "Thrifted" Parquet data

2015-05-21 Thread Flavio Pompermaier
Hi to all, I'm using a code that basically is originated from the Flink-Parquet example at https://github.com/FelixNeutatz/parquet-flinktacular/blob/master/java/thrift/flink/src/main/java/flink/parquet/ParquetThriftExample.java . If I do the following everything works fine: DataSet> input = read

Re: k means - waiting for dataset

2015-05-21 Thread Stephan Ewen
Hi! This problem should not depend on any user code. There are no user-code dependent actors in Flink. Is there more stack trace that you can send us? It looks like it misses the core exception that is causing the issue is not part of the stack trace. Greetings, Stephan On Thu, May 21, 2015 a

Re: ClassReader could not be created

2015-05-21 Thread Stephan Ewen
This looks suspicious, but it should actually be also a consequence of a failure or disconnect between the TaskManager and the JobManager. Can you send us the whole log to have a closer look? Thanks, Stephan On Thu, May 21, 2015 at 10:59 AM, Flavio Pompermaier wrote: > Could it be this the ma

Re: k means - waiting for dataset

2015-05-21 Thread Pa Rö
hi, the exception came with version 0.9. with version 0.8.1 came no exception, but the results are foobar. here my main: public static void main(String[] args) { //load properties Properties pro = new Properties(); try { pro.load(new FileInputStream("./resource

Re: k means - waiting for dataset

2015-05-21 Thread Till Rohrmann
Hi Paul, could you share your code with us so that we see whether there is any error. Does this error also occurs with 0.9-SNAPSHOT? Cheers, Till Che On Thu, May 21, 2015 at 11:11 AM, Pa Rö wrote: > hi flink community, > > i have implement k-means for clustering temporal geo data. i use the

k means - waiting for dataset

2015-05-21 Thread Pa Rö
hi flink community, i have implement k-means for clustering temporal geo data. i use the following github project and my own data structure: https://github.com/apache/flink/blob/master/flink-examples/flink-java-examples/src/main/java/org/apache/flink/examples/java/clustering/KMeans.java not i hav

Re: ClassReader could not be created

2015-05-21 Thread Flavio Pompermaier
Could it be this the main failure reason? 09:45:58,650 WARN akka.remote.ReliableDeliverySupervisor - Association with remote system [akka.tcp://flink@192.168.234.83:6123] has failed, address is now gated for [5000] ms. Reason is: [Disassociated]. 09:45:58,831 WARN Remoting - Tried to a

Re: Spark and Flink

2015-05-21 Thread Pa Rö
thanks a lot for ur help, now i split my project, it's works. 2015-05-19 15:44 GMT+02:00 Alexander Alexandrov < alexander.s.alexand...@gmail.com>: > Sorry, we're using a forked version which changed groupID. > > 2015-05-19 15:15 GMT+02:00 Till Rohrmann : > >> I guess it's a typo: "eu.stratosphere

Re: ClassReader could not be created

2015-05-21 Thread Stephan Ewen
Hi! Interruptions usually happen as part of cancelling. Has the job failed for some other reason (and that exception is only a followup) ? Or it this the root cause of the failure. Stephan On Thu, May 21, 2015 at 9:55 AM, Flavio Pompermaier wrote: > Now I'm able to run my job but after a whi

Re: ClassReader could not be created

2015-05-21 Thread Flavio Pompermaier
Now I'm able to run my job but after a while I get this other exception: 09:43:49,383 INFO org.apache.flink.runtime.taskmanager.TaskManager - Unregistering task and sending final execution state FINISHED to JobManager for task CHAIN DataSource (at createInput(ExecutionEnvironment.java:490) (