Hi guys I'm opening this thread to discuss supporting the Apache Arrow format in Flink. Arrow is a language-independent columnar memory format that has become widely used in different systems, and It can also serve as an inter-exchange format between other systems. So, using it directly in the Flink system will be nice. We also received some requests from slack[1][2] and jira[3]. In our company's internal usage, we have used flink-python moudle's ArrowReader and ArrowWriter to support this work. But it still can not integrate with the current flink-formats framework closely. So, I'd like to introduce the flink-arrow formats module to support the arrow format naturally. Looking forward to some suggestions.
Best, Aitozi [1]: https://apache-flink.slack.com/archives/C03GV7L3G2C/p1677915016551629 [2]: https://apache-flink.slack.com/archives/C03GV7L3G2C/p1666326443152789 [3]: https://issues.apache.org/jira/browse/FLINK-10929