[jira] [Created] (FLINK-4872) Type erasure problem exclusively on cluster execution

2016-10-20 Thread Martin Junghanns (JIRA)
Martin Junghanns created FLINK-4872: --- Summary: Type erasure problem exclusively on cluster execution Key: FLINK-4872 URL: https://issues.apache.org/jira/browse/FLINK-4872 Project: Flink

Re: Type erasure problem solely on cluster execution

2016-10-19 Thread Martin Junghanns
blem with the forwarding annotations in more detail? I would be interested in the error message and how the semantic annotation is provided (@ForwardFields or withForwardedFields()). Thanks, Fabian 2016-10-19 8:52 GMT+02:00 Martin Junghanns : Hi, I am running into a type erasure problem which

Type erasure problem solely on cluster execution

2016-10-18 Thread Martin Junghanns
Hi, I am running into a type erasure problem which only occurs when I execute the code using a Flink cluster (1.1.2). I created a Gist [1] which reproduces the problem. I also added a unit test to show that it does not fail in local and collection mode. Maybe it is also interesting to mentio

Re: Type problem in RichFlatMapFunction when using GenericArray type

2016-10-10 Thread Martin Junghanns
: Hello Martin, Could you include the error you are getting? Regards, Chesnay On 10.10.2016 13:31, Martin Junghanns wrote: Hi, I ran into a problem when using generic arrays in a tuple. I wrote a minimal program to reproduce the error [1]. The problem seems to be related to the order of tuple

Re: Type problem in RichFlatMapFunction when using GenericArray type

2016-10-10 Thread Martin Junghanns
:351) at org.apache.flink.runtime.taskmanager.Task.run(Task.java:584) at java.lang.Thread.run(Thread.java:745) On 10.10.2016 13:39, Chesnay Schepler wrote: Hello Martin, Could you include the error you are getting? Regards, Chesnay On 10.10.2016 13:31, Martin Junghanns wrote: Hi

Type problem in RichFlatMapFunction when using GenericArray type

2016-10-10 Thread Martin Junghanns
Hi, I ran into a problem when using generic arrays in a tuple. I wrote a minimal program to reproduce the error [1]. The problem seems to be related to the order of tuple fields. When I switch Tuple2 to Tuple2 and perform the join on field 0, everything works as expected. Using Flink 1.1.2.

Re: Gelly Library. Need an example

2016-09-17 Thread Martin Junghanns
Hi all, thanks for reporting the issue! I just looked into it. The VertexGroupReduce outputs two semantically different tuples: one for each vertex without the value (null) and one tuple representing the whole group including the value. As Till pointed out, this crashes if the value has no se

[jira] [Created] (FLINK-3618) Rename abstract UDF classes in Scatter-Gather implementation

2016-03-16 Thread Martin Junghanns (JIRA)
Martin Junghanns created FLINK-3618: --- Summary: Rename abstract UDF classes in Scatter-Gather implementation Key: FLINK-3618 URL: https://issues.apache.org/jira/browse/FLINK-3618 Project: Flink

[jira] [Created] (FLINK-3611) Wrong link in CONTRIBUTING.md

2016-03-13 Thread Martin Junghanns (JIRA)
Martin Junghanns created FLINK-3611: --- Summary: Wrong link in CONTRIBUTING.md Key: FLINK-3611 URL: https://issues.apache.org/jira/browse/FLINK-3611 Project: Flink Issue Type: Bug

[jira] [Created] (FLINK-3272) Generalize vertex value type in ConnectedComponents

2016-01-22 Thread Martin Junghanns (JIRA)
Martin Junghanns created FLINK-3272: --- Summary: Generalize vertex value type in ConnectedComponents Key: FLINK-3272 URL: https://issues.apache.org/jira/browse/FLINK-3272 Project: Flink

Re: [gelly] partition-centric iterations

2016-01-07 Thread Martin Junghanns
Hi, this would be a very nice addition! I had a glimpse look into the PC implementation and the two library algorithms and when you get the idea, it is easy to follow what's happening. The benchmark results are also very promising. I got some questions about partitions: 1) I was wondering if the

[jira] [Created] (FLINK-3122) Generalize value type in LabelPropagation

2015-12-05 Thread Martin Junghanns (JIRA)
Martin Junghanns created FLINK-3122: --- Summary: Generalize value type in LabelPropagation Key: FLINK-3122 URL: https://issues.apache.org/jira/browse/FLINK-3122 Project: Flink Issue Type

[jira] [Created] (FLINK-3118) Check if MessageFunction implements ResultTypeQueryable

2015-12-04 Thread Martin Junghanns (JIRA)
Martin Junghanns created FLINK-3118: --- Summary: Check if MessageFunction implements ResultTypeQueryable Key: FLINK-3118 URL: https://issues.apache.org/jira/browse/FLINK-3118 Project: Flink

Re: Serialization problem in CollectionEnvironment

2015-11-27 Thread Martin Junghanns
ization as a fallback strategy. That’s a bug in >> these components. I’ll file an issue and open a PR for it. >> >> Thanks a lot for finding this problem Martin! >> >> Cheers, >> Till >> ​ >> >> On Fri, Nov 27, 2015 at 11:59 AM, Martin Jun

Re: Serialization problem in CollectionEnvironment

2015-11-27 Thread Martin Junghanns
, Martin Junghanns wrote: Hi, While building IT tests which extend MultipleProgramsTestBase, I encountered a problem with serialization: I posted a minimal example here: https://gist.github.com/s1ck/566796df5f35ee1de6f9 This runs fine with LocalEnvironment. However, when executing this in

Serialization problem in CollectionEnvironment

2015-11-27 Thread Martin Junghanns
Hi, While building IT tests which extend MultipleProgramsTestBase, I encountered a problem with serialization: I posted a minimal example here: https://gist.github.com/s1ck/566796df5f35ee1de6f9 This runs fine with LocalEnvironment. However, when executing this in CollectionEnvironment, it le

Re: Null Pointer Exception in tests but only in COLLECTION mode

2015-11-24 Thread Martin Junghanns
Hi Max, fixed in https://github.com/apache/flink/pull/1396 Best, Martin On 24.11.2015 13:46, Maximilian Michels wrote: Hi André, hi Martin, This looks very much like a bug. Martin, I would be happy if you opened a JIRA issue. Thanks, Max On Sun, Nov 22, 2015 at 12:27 PM, Martin Junghanns

[jira] [Created] (FLINK-3064) Missing size check in GroupReduceOperatorBase leads to NPE

2015-11-23 Thread Martin Junghanns (JIRA)
Martin Junghanns created FLINK-3064: --- Summary: Missing size check in GroupReduceOperatorBase leads to NPE Key: FLINK-3064 URL: https://issues.apache.org/jira/browse/FLINK-3064 Project: Flink

Re: Null Pointer Exception in tests but only in COLLECTION mode

2015-11-22 Thread Martin Junghanns
Hi, What he meant was MultipleProgramsTestBase, not FlinkTestBase. I debugged this a bit. The NPE is thrown in https://github.com/apache/flink/blob/master/flink-java/src/main/java/org/apache/flink/api/java/operators/AggregateOperator.java#L296 since current can be null if the input iterator i

Re: SETUP ISSUES

2015-11-19 Thread Martin Junghanns
Hi John, Flink has a nice documentation for all APIs [1]. To get you started with the correct Maven dependencies for the DataSet API, please have a look at "Linking With Flink" [2]. On that site, you will also find a lot of Java and Scala examples. Please let us know, if you need help. Best, Ma

Re: neo4j - Flink connector

2015-11-12 Thread Martin Junghanns
o use neo4j. I think I can help you much better as soon as I have your current pom file + code. [1] http://www.apache.org/legal/resolved.html#category-a [2] https://github.com/s1ck/flink-neo4j On Wed, Nov 11, 2015 at 7:38 PM, Martin Junghanns wrote: Hi, I am a bit stuck with that dependency pro

Re: neo4j - Flink connector

2015-11-11 Thread Martin Junghanns
Hi, I am a bit stuck with that dependency problem. Any help would be appreciated as I would like to continue working on the formats. Thanks! Best, Martin On 07.11.2015 17:28, Martin Junghanns wrote: Hi Robert, Thank you for the hints. I tried to narrow down the error: Flink version: 0.10

Re: neo4j - Flink connector

2015-11-07 Thread Martin Junghanns
format. On Sat, Oct 31, 2015 at 9:51 AM, Martin Junghanns < m.jungha...@mailbox.org> wrote: Hi, I wanted to give you a little update. I created a non-parallel InputFormat which reads Cypher results from Neo4j into Tuples [1]. It can be used like the JDBCInputFormat: String q = "MA

Re: neo4j - Flink connector

2015-11-06 Thread Martin Junghanns
ay be > valuable for an output format. > > On Sat, Oct 31, 2015 at 9:51 AM, Martin Junghanns > wrote: > >> Hi, >> >> I wanted to give you a little update. I created a non-parallel >> InputFormat which reads Cypher results from Neo4j into Tuples [1]. >&

Re: Error when building the docs

2015-11-05 Thread Martin Junghanns
remove Jekyll 3.x and install Jekyll 2.x, you can build docs. I’m using Jekyll 2.5.3 to build the docs. Regards, Chiwan Park On November 6, 2015 at 4:58:34 AM, Martin Junghanns (m.jungha...@mailbox.org) wrote: Hi, not sure if that's an issue or just a misconfiguration (not familiar with

[jira] [Created] (FLINK-2981) Update README for building docs

2015-11-05 Thread Martin Junghanns (JIRA)
Martin Junghanns created FLINK-2981: --- Summary: Update README for building docs Key: FLINK-2981 URL: https://issues.apache.org/jira/browse/FLINK-2981 Project: Flink Issue Type: Improvement

Error when building the docs

2015-11-05 Thread Martin Junghanns
Hi, not sure if that's an issue or just a misconfiguration (not familiar with Ruby). I followed the docs/README.md and ran into: s1ck@s1ck-T450s:~/Devel/Java/flink$ docs/build_docs.sh Configuration file: /home/s1ck/Devel/Java/flink/docs/_config.yml /home/s1ck/Devel/Java/flink/docs/_plugins/remo

Re: neo4j - Flink connector

2015-10-31 Thread Martin Junghanns
Hi, I wanted to give you a little update. I created a non-parallel InputFormat which reads Cypher results from Neo4j into Tuples [1]. It can be used like the JDBCInputFormat: String q = "MATCH (p1:Page)-[:Link]->(p2) RETURN id(p1), id(p2)"; Neo4jInputFormat> neoInput = Neo4jInputFormat.buildNe

Fast strategy for intersect

2015-10-28 Thread Martin Junghanns
Hi all! While working on FLINK-2905, I was wondering what a good (and fast) way to compute the intersect between two data sets (Gelly vertices in my case) with unknown size would be. I came up with three ways to solve this: Consider two sets: DataSet> verticesLeft = this.getVertices(); Dat

Re: [gelly] Spargel model rework

2015-10-27 Thread Martin Junghanns
Hi, At our group, we also moved several algorithms from Giraph to Gelly and ran into some confusing issues (first in understanding, second during implementation) caused by the conceptional differences you described. If there are no concrete advantages (performance mainly) in the Spargel impl

[jira] [Created] (FLINK-2910) Combine tests for binary graph operators

2015-10-24 Thread Martin Junghanns (JIRA)
Martin Junghanns created FLINK-2910: --- Summary: Combine tests for binary graph operators Key: FLINK-2910 URL: https://issues.apache.org/jira/browse/FLINK-2910 Project: Flink Issue Type

[jira] [Created] (FLINK-2905) Add intersect method to Graph class

2015-10-23 Thread Martin Junghanns (JIRA)
Martin Junghanns created FLINK-2905: --- Summary: Add intersect method to Graph class Key: FLINK-2905 URL: https://issues.apache.org/jira/browse/FLINK-2905 Project: Flink Issue Type: New

Re: Parsing JSON

2015-10-20 Thread Martin Junghanns
An example can be found here: Code to read JSON from line: https://github.com/dbs-leipzig/gradoop/blob/master/gradoop-flink/src/main/java/org/gradoop/io/json/JsonReader.java#L57 How it is called: https://github.com/dbs-leipzig/gradoop/blob/master/gradoop-flink/src/main/java/org/gradoop/model/im

[jira] [Created] (FLINK-2590) DataSetUtils.zipWithUniqueID creates duplicate IDs

2015-08-28 Thread Martin Junghanns (JIRA)
Martin Junghanns created FLINK-2590: --- Summary: DataSetUtils.zipWithUniqueID creates duplicate IDs Key: FLINK-2590 URL: https://issues.apache.org/jira/browse/FLINK-2590 Project: Flink Issue

Re: Gelly Blog Post

2015-08-23 Thread Martin Junghanns
Hi, this is a very nice blog post! I added some minor comments. I am really excited about the future work on partition-centric computation and graph partitioning! And thanks for guiding me to DataSetUtils.zipWithUniqueIds()! I should switch to 0.10-SNAPSHOT :) Best, Martin On 22.08.2015

Re: Join hints for the Gelly functions

2015-08-22 Thread Martin Junghanns
Hi, I guess enforcing a Join Strategy by default is not the best option since you can't assume what the user did before actually calling the Gelly functions and how the data looks like (maybe its one of the 1% graphs where the relation is the other way around or the vertex data set is very la

[jira] [Created] (FLINK-2411) Add basic graph summarization algorithm

2015-07-27 Thread Martin Junghanns (JIRA)
Martin Junghanns created FLINK-2411: --- Summary: Add basic graph summarization algorithm Key: FLINK-2411 URL: https://issues.apache.org/jira/browse/FLINK-2411 Project: Flink Issue Type: New