Re: Compress DataSink Output

2016-08-19 Thread Wesley Kerr
That looks good. Thanks! On Fri, Aug 19, 2016 at 6:15 AM Robert Metzger wrote: > Hi Wes, > > Flink's own OutputFormats don't support compression, but we have some > tools to use Hadoop's OutputFormats with Flink [1], and those support > compression: > https://hadoop.apache.org/docs/stable/api/o

Re: Compress DataSink Output

2016-08-19 Thread Robert Metzger
Hi Wes, Flink's own OutputFormats don't support compression, but we have some tools to use Hadoop's OutputFormats with Flink [1], and those support compression: https://hadoop.apache.org/docs/stable/api/org/apache/hadoop/mapreduce/lib/output/FileOutputFormat.html Let me know if you need more info

Compress DataSink Output

2016-08-17 Thread Wesley Kerr
Hello - Forgive me if this has been asked before, but I'm trying to determine the best way to add compression to DataSink Outputs (starting with TextOutputFormat). Realistically I would like each partition file (based on parallelism) to be compressed independently with gzip, but am open to other