Jingsong Lee created FLINK-22202: ------------------------------------ Summary: Thread safety in ParquetColumnarRowInputFormat Key: FLINK-22202 URL: https://issues.apache.org/jira/browse/FLINK-22202 Project: Flink Issue Type: Bug Components: Formats (JSON, Avro, Parquet, ORC, SequenceFile) Reporter: Jingsong Lee Assignee: Jingsong Lee Fix For: 1.13.0
In a {{VectorizedColumnBatch}}, the dictionary will be lazied deserialized. If there are multiple batches at the same time, there may be thread safety problems, because the deserialization of the dictionary depends on some internal structures. We need set numBatchesToCirculate to 1 for ParquetColumnarRowInputFormat. -- This message was sent by Atlassian Jira (v8.3.4#803005)