For MySQL you would either want to use Debezium's connector (which can
handle bulk dump + incremental CDC, but requires direct access to the
binlog) or the JDBC connector (does an initial bulk dump + incremental
queries, but has limitations compared to a "true" CDC solution).
Sqoop and the JDBC co
I'm looking for ways to bulk/incremental import from MySQL database to
HDFS. Currently I got Sqoop that does the bulk import creating a Hive
table.
Wanted to know the pros/cons of using JDBC connector instead of Sqoop and
are there any MySQL config changes expected (like binlog configuration in
th