I'm in favor of the confusingly-named version='2.0' default. I note that such decisions are hampered by our lack of integration / compatibility testing with other Parquet consumers to know whether they will understand all of the data that we write.
On Tue, Dec 15, 2020 at 10:50 AM Antoine Pitrou <anto...@python.org> wrote: > > > Le 15/12/2020 à 17:46, Joris Van den Bossche a écrit : > > > > No, I actually mean 2.2. (I don't think a 1.2 version exists, at least > > according to the git tags) > > You can compare the thrift file for version 2.1 ( > > https://github.com/apache/parquet-format/blob/parquet-format-2.1.0/src/thrift/parquet.thrift) > > vs 2.2 ( > > https://github.com/apache/parquet-format/blob/apache-parquet-format-2.2.0/src/thrift/parquet.thrift), > > in which a set of ConvertedTypes where added. > > I don't understand. In which version did logical types appear then? > > (frankly, Parquet features are a mess to navigate) > > Regards > > Antoine.