Re: UUID in part files

2021-02-07 Thread Yun Gao
Hi Dan The SQL add the uuid by default is for the case that users want execute multiple bounded sql and append to the same directory (hive table), thus a uuid is attached to avoid overriding the previous output. The datastream could be viewed as providing the low-level api and thus it does not ad

UUID in part files

2021-02-07 Thread Dan Hill
Hi. *Context* I'm migrating my Flink SQL job to DataStream. When switching to StreamingFileSink, I noticed that the part files now do not have a uuid in them. "part-0-0" vs "part-{uuid string}-0-0". This is easy to add with OutputFileConfig. *Question* Is there a reason why the base OutputFile