Re: Containment Join Support

2015-07-18 Thread Vasiliki Kalavri
Hi Martin, I'm really glad to see that you've started using Gelly :) I think that a graph summarization library method would be a great addition! Let me know if you need help and if you want to discuss ideas or other methods. Cheers, Vasia. On 17 July 2015 at 12:25, Martin Junghanns wrote: >

Re: Containment Join Support

2015-07-17 Thread Martin Junghanns
Hi Fabian, hi Stephen, thanks for answering my question. Good hint with the list replication, I will benchmark this vs. cross + filter. Best, Martin Am 17.07.2015 um 11:17 schrieb Stephan Ewen: I would rewrite this to replicate the list into tuples: "foreach x in list: emit (x, list)" Then

Re: Containment Join Support

2015-07-17 Thread Stephan Ewen
I would rewrite this to replicate the list into tuples: "foreach x in list: emit (x, list)" Then join on fields 0. This replicates the lists, but makes the join very efficient. On Fri, Jul 17, 2015 at 12:26 AM, Fabian Hueske wrote: > Hi Martin, > > good to hear that you like Flink :-) > AFAIK,

Re: Containment Join Support

2015-07-16 Thread Fabian Hueske
Hi Martin, good to hear that you like Flink :-) AFAIK, there are no plans to add a containment join. The Flink community is currently working on adding support for outer joins. Regarding a containment join, I am not sure about the number of use cases. I would rather try to implement it on top of F

Containment Join Support

2015-07-16 Thread Martin Junghanns
Hi everyone, at first, thanks for building this great framework! We are using Flink and especially Gelly for building a graph analytics stack (gradoop.com). I was wondering if there is a [planned] support for a containment join operator. Consider the following example: DataSet> left := {[0, 1],