[DISCUS] Flink SQL Client dependency management

Timo Walther Mon, 26 Feb 2018 10:34:11 -0800

Hi everyone,

as you may know a first minimum version of FLIP-24 [1] for the upcomingFlink SQL Client has been merged to the master. We also mergedpossibilities to discover and configure table sources without a singleline of code using string-based properties [2] and Java service providerdiscovery.

We are now facing the issue of how to manage dependencies in this newenvironment. It is different from how regular Flink projects are created(by setting up a a new Maven project and build a jar or fat jar).Ideally, a user should be able to select from a set of preparedconnectors, catalogs, and formats. E.g., if a Kafka connector and Avroformat is needed, all that should be required is to move a"flink-kafka.jar" and "flink-avro.jar" into the "sql_lib" directory thatis shipped to a Flink cluster together with the SQL query.

The question is how do we want to offer those JAR files in the future?We see two options:

1) We prepare Maven build profiles for all offered modules and provide ashell script for building fat jars. A script call could look like"./sql-client-dependency.sh kafka 0.10". It would automatically downloadwhat is needed and place the JAR file in the library folder. Thisapproach would keep our development effort low but would require Mavento be present and builds to pass on different environments (e.g. Windows).

2) We build fat jars for these modules with every Flink release that canbe hostet somewhere (e.g. Apache infrastructure, but not Maven central).This would make it very easy to add a dependency by downloading theprepared JAR files. However, it would require to build and host largefat jars for every connector (and version) with every Flink major andminor release. The size of such a repository might grow quickly.

What do you think? Do you see other options to make adding dependenciesas possible?



Regards,

Timo


[1] https://cwiki.apache.org/confluence/display/FLINK/FLIP-24+-+SQL+Client

[2] https://issues.apache.org/jira/browse/FLINK-8240

[DISCUS] Flink SQL Client dependency management

Reply via email to