Re: BucketingSink broken in flink 1.4.0 ?

Chesnay Schepler Wed, 10 Jan 2018 05:39:38 -0800

Your analysis looks correct, the code in question will never properlydetect hadoop file systems. I'll open a jira.

Your suggestion to replace it with getUnguardedFileSystem() was my firstinstinct as well.


Good job debugging this.

On 10.01.2018 14:17, jelmer wrote:

Hi I am trying to convert some jobs from flink 1.3.2 to flink 1.4.0
But i am running into the issue that the bucketing sink will alwaystry and connect to hdfs://localhost:12345/ instead of the hfds url ihave specified in the constructor
If i look at the code at
https://github.com/apache/flink/blob/master/flink-connectors/flink-connector-filesystem/src/main/java/org/apache/flink/streaming/connectors/fs/bucketing/BucketingSink.java#L1125
It tries to create the hadoop filesystem like this
final org.apache.flink.core.fs.FileSystem flinkFs =org.apache.flink.core.fs.FileSystem.get(path.toUri());
final FileSystem hadoopFs = (flinkFs instanceof HadoopFileSystem) ?
((HadoopFileSystem) flinkFs).getHadoopFileSystem() : null;

But FileSystem.getUnguardedFileSystem will always return a
But FileSystem.get will always return a SafetyNetWrapperFileSystem sothe instanceof check will never indicate that its a hadoop filesystem
Am i missing something or is this a bug and if so what would be thecorrect fix ? I guess replacing FileSystem.get withFileSystem.getUnguardedFileSystem would fix it but I am afraid I lackthe context to know if that would be safe

Re: BucketingSink broken in flink 1.4.0 ?

Reply via email to