Reducing runtime of Flink planner

Niklas Teichmann Mon, 07 Jan 2019 05:06:03 -0800

Hi everybody,

I have a question concerning the planner for the Flink Table / Batch API.

At the moment I try to use a library called Cypher for Apache Flink, aproject that tries to implementthe graph database query language Cypher on Apache Flink (CAPF,https://github.com/soerenreichardt/cypher-for-apache-flink).

The problem is that the planner seemingly takes a very long time toplan and optimize the job created by CAPF. This example job in jsonformat


https://pastebin.com/J84grsjc

takes on a 24 GB data set about 20 minutes to plan and about 5 minutesto run the job. That seems very long for a job of this size.


Do you have any idea why this is the case?
Is there a way to give the planner hints to reduce the planning time?

Thanks in advance!
Niklas
--

Reducing runtime of Flink planner

Reply via email to