Hi all,

We're interested in being able to filter files using the new FileSource API
<https://nightlies.apache.org/flink/flink-docs-release-1.14/api/java/org/apache/flink/connector/file/src/FileSource.html>.
Are there plans to add it? If there's existing work, we would be happy to
help push this forward through contributions.

It seems like things are almost there. FileSource encapsulates filtering
functionality into FileEnumerator
<https://nightlies.apache.org/flink/flink-docs-release-1.14/api/java/org/apache/flink/connector/file/src/FileSource.html>.
However, the FileEnumerator is not parametrizable, it's currently hard-coded
<https://github.com/apache/flink/blob/498ee85f970cb25fb905374cb6c577e9984adc30/flink-connectors/flink-connector-files/src/main/java/org/apache/flink/connector/file/src/FileSource.java#L213-L221>.
One potential way to enable filtering files is to be able to pass a
custom FileEnumerator.

Thanks in advance,
Kevin

Reply via email to