[jira] [Created] (FLINK-1939) Add Parquet Documentation to Wiki

2015-04-26 Thread Felix Neutatz (JIRA)
Felix Neutatz created FLINK-1939: Summary: Add Parquet Documentation to Wiki Key: FLINK-1939 URL: https://issues.apache.org/jira/browse/FLINK-1939 Project: Flink Issue Type: Task Co

Adding a new operator

2015-04-26 Thread Andra Lungu
Hey guys, I am trying to add a new runtime operator; To this end, I am following the guide here: http://ci.apache.org/projects/flink/flink-docs-master/internals/add_operator.html and the code itself. >From what I understood, the run() in ReduceDriver, for instance, should be called every time a

Re: Adding a new operator

2015-04-26 Thread Markus Holzemer
Hey Andrea, perhaps you are looking at the wrong ReduceDriver? As you can see in the DriverStrategy enum there is several different ReduceDrivers depending on the strategy the optimizer chooses. best, Markus 2015-04-26 12:26 GMT+02:00 Andra Lungu : > Hey guys, > > I am trying to add a new runtim

[jira] [Created] (FLINK-1940) StockPrice example cannot be visualized

2015-04-26 Thread Gyula Fora (JIRA)
Gyula Fora created FLINK-1940: - Summary: StockPrice example cannot be visualized Key: FLINK-1940 URL: https://issues.apache.org/jira/browse/FLINK-1940 Project: Flink Issue Type: Bug Com

Re: Adding a new operator

2015-04-26 Thread Andra Lungu
Yes Markus, ds.reduce() -> AllReduceDriver ds.groupBy().reduce() -> ReduceDriver It's very intuitive ;) On Sun, Apr 26, 2015 at 12:34 PM, Markus Holzemer < holzemer.mar...@googlemail.com> wrote: > Hey Andrea, > perhaps you are looking at the wrong ReduceDriver? > As you can see in the DriverStr

[jira] [Created] (FLINK-1941) Add documentation for Gelly-GSA

2015-04-26 Thread Vasia Kalavri (JIRA)
Vasia Kalavri created FLINK-1941: Summary: Add documentation for Gelly-GSA Key: FLINK-1941 URL: https://issues.apache.org/jira/browse/FLINK-1941 Project: Flink Issue Type: Task Comp

[jira] [Created] (FLINK-1942) Add configuration options to Gelly-GSA

2015-04-26 Thread Vasia Kalavri (JIRA)
Vasia Kalavri created FLINK-1942: Summary: Add configuration options to Gelly-GSA Key: FLINK-1942 URL: https://issues.apache.org/jira/browse/FLINK-1942 Project: Flink Issue Type: Improvement

[jira] [Created] (FLINK-1943) Add Gelly-GSA compiler and translation tests

2015-04-26 Thread Vasia Kalavri (JIRA)
Vasia Kalavri created FLINK-1943: Summary: Add Gelly-GSA compiler and translation tests Key: FLINK-1943 URL: https://issues.apache.org/jira/browse/FLINK-1943 Project: Flink Issue Type: Test

[jira] [Created] (FLINK-1944) Add a Gelly-GSA PageRank example

2015-04-26 Thread Vasia Kalavri (JIRA)
Vasia Kalavri created FLINK-1944: Summary: Add a Gelly-GSA PageRank example Key: FLINK-1944 URL: https://issues.apache.org/jira/browse/FLINK-1944 Project: Flink Issue Type: Task Affects V

how can handle left outer join for any two dataset this dataset inlcude any filed number

2015-04-26 Thread hager sallah
how can handle left outer join for any two dataset this dataset inlcude any filed number example on two dataset data set one ExecutionEnvironment env = ExecutionEnvironment.getExecutionEnvironment(); DataSet> customer=env.readCsvFile("/home/hadoop/Desktop/Dataset/customer.csv") .fieldDelimiter('

Re: how can handle left outer join for any two dataset this dataset inlcude any filed number

2015-04-26 Thread Alexander Alexandrov
Hey there, Please use the user mailing list for user-related questions (this list is for Flink internals only). At the moment outer joins are not directly supported in Flink, but there are good indications that this will change in the next 4-8 weeks. For the time being, you can use a CoGroup with

Re: Join with a custom predicate

2015-04-26 Thread Alexander Alexandrov
I thought about your problem over the weekend. Unfortunately the algorithm that you describe does not fit "regular" equi-join semantics, but I think it could be "fitted" with a more complex dataflow. To achieve that, I would partition the (active) domain of the two datasets on fine-granular interv