Jingsong Lee created FLINK-19121: ------------------------------------ Summary: Avoid access HDFS in HiveBulkWriterFactory Key: FLINK-19121 URL: https://issues.apache.org/jira/browse/FLINK-19121 Project: Flink Issue Type: New Feature Components: Connectors / Hive Affects Versions: 1.11.1, 1.12.0 Reporter: Jingsong Lee
In HadoopPathBasedBulkWriter, getSize will invoke `FileSystem.exists` and `FileSystem.getFileStatus`, but it is invoked per record. There will be lots of visits to HDFS, may make HDFS pressure too high. -- This message was sent by Atlassian Jira (v8.3.4#803005)