Re: Sqoop-like module in Flink

2016-04-25 Thread Fabian Hueske
Hi Flavio, sorry for not replying earlier. I think there is definitely need to improve the JdbcInputFormat. All your points wrt to the current JdbcInputFormat are valid and fixing them would be a big improvement and highly welcome contribution, IMO. I am not so sure about adding a flink-sqoop mod

Re: Sqoop-like module in Flink

2016-04-15 Thread Stefano Bortoli
Hi Flavio, I think this can be very handy when you have to run jobs Sqoop-like but you need to run the process with few resources. As for Cascading, Flink could do the heavy-lifting and make the scan of large relational databases more robust. Of course to make it work in real world, the JDBC Input

Sqoop-like module in Flink

2016-04-13 Thread Flavio Pompermaier
Hi to all, we've recently migrated our sqoop[1] import process to a Flink job, using an improved version of the Flink JDBC Input Format[2] that is able to exploit the parallelism of the cluster (the current Flink version implements NonParallelInput). Still need to improve the mapping part of sql t