Hello, I am looking to contribute a ProtoParquetWriter support which can be used in Bulk format for the StreamingFileSink api. There has been earlier discussions on this in the user mailing list: https://goo.gl/ya2StL and thought it would be a good addition to have.
For implementation, looking at the current APIs present at ProtoParquetWriter with the parguet project (http://tinyurl.com/y378be42), it looks like there is some different in the interface between Avro and Proto writes (ProtoParquetWriter does not have a builder class as well as not interface with Outputfile). Due to this, I was looking at directly extending the ParquetWriter within Flink to define the Builder static class and have newer interfaces. This is needed as the bulk writer takes a builder to crate the ParquetWriter in the bulkWriter.Factory. ( http://tinyurl.com/yyg9cn9b) Any thoughts if this is a reasonable approach? Thanks Kailash