Re: Parquet data stream group converter error

2020-07-04 Thread Jesse Lord
@flink.apache.org Subject: Re: Parquet data stream group converter error Hi, > MessageType schema = reader.getFooter().getFileMetaData().getSchema(); The first thing I'd suggest is to verify that the file contains a valid schema and can be read by some other program, e.g. parquet-tools s

Re: Parquet data stream group converter error

2020-07-03 Thread Khachatryan Roman
Hi, > MessageType schema = reader.getFooter().getFileMetaData().getSchema(); The first thing I'd suggest is to verify that the file contains a valid schema and can be read by some other program, e.g. parquet-tools schema or cat [1]. Regards, Roman On Thu, Jul 2, 2020 at 11:36 PM Jesse Lord wro

Parquet data stream group converter error

2020-07-02 Thread Jesse Lord
I am trying to read a parquet file into a datastream and then register that stream as a temporary table. This file is created by spark 2.4 in HDFS on AWS EMR. I am using flink version 1.10.0 with EMR 5.30. I am getting the following error: Caused by: org.apache.flink.streaming.runtime.tasks.Asy