Hi Caizhi, Could you tell me more details about streaming joins that you suggested? Did you mean putting the Hive table data into a Kafka/Kinesis and joining the main stream with the hive table data streaming with a very long watermark?
In my use case, the hive table is an account dimension table and I wanted to join an event stream with the account dimension in Flink. I thought a lookup table source would work for my use case, but I got a performance problem as mentioned above. If there's a good solution, I'm open to it. I just need to confirm if Flink is not happy with a large Hive table data. Jason. On Sun, Feb 6, 2022 at 7:01 PM Caizhi Weng <tsreape...@gmail.com> wrote: > Hi! > > Each parallelism of the lookup operation will load all data from the > lookup table source, so you're loading 10GB of data to each parallelism and > storing them in JVM memory. That is not only slow but also very > memory-consuming. > > Have you tried joining your main stream with the hive table directly (that > is, using streaming joins instead of lookup joins)? Does that meet your > need or why do you have to use lookup joins? > > Jason Yi <93t...@gmail.com> 于2022年2月5日周六 08:01写道: > >> Hello, >> >> I created external tables on Hive with data in s3 and wanted to use those >> tables as a lookup table in Flink. >> >> When I used an external table containing a small size of data as a lookup >> table, Flink quickly loaded the data into TM memory and did a Temporal join >> to an event stream. But, when I put an external table containing ~10GB of >> data, Flink took so long to load the data and finally returned a timeout >> error. (I set the heartbeat.timeout to 200000) >> >> Is there a way to make Flink read Hive data faster? Or is this normal? >> MySQL lookup tables would be recommended when we have a large size of >> dimension data? >> >> Here's the test environment: >> - 1.14.0 Flink >> - EMR 6.5 >> - Hive 3.1.2 installed on EMR >> - Hive with a default MetaStore on EMR used. (Not MySQL or Glue >> Metastore) >> - Parquet source data in s3 for the external table on Hive >> >> Below is part of the Flink log produced while loading the Hive table >> data. Flink seemed to open one parquet file multiple times and moved to >> another parquet file to open. I wonder if this is normal. Why Flink didn't >> read data from multiple files in parallel. I'm not sure if this is a >> problem caused by the default Hive Metastore. >> >> ...... >> 2022-02-04 22:42:54,839 INFO >> org.apache.flink.table.filesystem.FileSystemLookupFunction [] - >> Populating lookup join cache >> 2022-02-04 22:42:54,839 INFO >> org.apache.flink.table.filesystem.FileSystemLookupFunction [] - >> Populating lookup join cache >> 2022-02-04 22:42:55,083 INFO org.apache.hadoop.mapred.FileInputFormat >> [] - Total input files to process : 12 >> 2022-02-04 22:42:55,084 INFO >> org.apache.flink.connectors.hive.read.HiveTableInputFormat [] - Use >> flink parquet ColumnarRowData reader. >> 2022-02-04 22:42:55,096 INFO org.apache.hadoop.mapred.FileInputFormat >> [] - Total input files to process : 12 >> 2022-02-04 22:42:55,097 INFO >> org.apache.flink.connectors.hive.read.HiveTableInputFormat [] - Use >> flink parquet ColumnarRowData reader. >> 2022-02-04 22:42:55,105 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:42:55,116 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:42:55,169 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:42:55,172 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:42:57,782 INFO >> org.apache.flink.connectors.hive.read.HiveTableInputFormat [] - Use >> flink parquet ColumnarRowData reader. >> 2022-02-04 22:42:57,783 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:42:57,799 INFO >> org.apache.flink.connectors.hive.read.HiveTableInputFormat [] - Use >> flink parquet ColumnarRowData reader. >> 2022-02-04 22:42:57,801 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:42:57,851 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:42:57,851 INFO >> org.apache.flink.connectors.hive.read.HiveTableInputFormat [] - Use >> flink parquet ColumnarRowData reader. >> 2022-02-04 22:42:57,853 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:42:57,897 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:42:57,898 INFO >> org.apache.flink.connectors.hive.read.HiveTableInputFormat [] - Use >> flink parquet ColumnarRowData reader. >> 2022-02-04 22:42:57,899 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:42:57,908 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:42:57,950 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:01,581 INFO >> org.apache.flink.connectors.hive.read.HiveTableInputFormat [] - Use >> flink parquet ColumnarRowData reader. >> 2022-02-04 22:43:01,582 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:01,592 INFO >> org.apache.flink.connectors.hive.read.HiveTableInputFormat [] - Use >> flink parquet ColumnarRowData reader. >> 2022-02-04 22:43:01,594 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:01,678 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:01,679 INFO >> org.apache.flink.connectors.hive.read.HiveTableInputFormat [] - Use >> flink parquet ColumnarRowData reader. >> 2022-02-04 22:43:01,680 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:01,682 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:01,682 INFO >> org.apache.flink.connectors.hive.read.HiveTableInputFormat [] - Use >> flink parquet ColumnarRowData reader. >> 2022-02-04 22:43:01,684 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:01,727 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:01,732 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:03,210 INFO >> org.apache.flink.connectors.hive.read.HiveTableInputFormat [] - Use >> flink parquet ColumnarRowData reader. >> 2022-02-04 22:43:03,211 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:03,268 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:03,268 INFO >> org.apache.flink.connectors.hive.read.HiveTableInputFormat [] - Use >> flink parquet ColumnarRowData reader. >> 2022-02-04 22:43:03,269 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:03,313 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:03,315 INFO >> org.apache.flink.connectors.hive.read.HiveTableInputFormat [] - Use >> flink parquet ColumnarRowData reader. >> 2022-02-04 22:43:03,316 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:03,377 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:03,377 INFO >> org.apache.flink.connectors.hive.read.HiveTableInputFormat [] - Use >> flink parquet ColumnarRowData reader. >> 2022-02-04 22:43:03,378 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:03,427 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:07,845 INFO >> org.apache.flink.connectors.hive.read.HiveTableInputFormat [] - Use >> flink parquet ColumnarRowData reader. >> 2022-02-04 22:43:07,846 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:07,908 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:07,909 INFO >> org.apache.flink.connectors.hive.read.HiveTableInputFormat [] - Use >> flink parquet ColumnarRowData reader. >> 2022-02-04 22:43:07,910 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:07,942 INFO >> org.apache.flink.connectors.hive.read.HiveTableInputFormat [] - Use >> flink parquet ColumnarRowData reader. >> 2022-02-04 22:43:07,943 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:07,999 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:08,002 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:08,003 INFO >> org.apache.flink.connectors.hive.read.HiveTableInputFormat [] - Use >> flink parquet ColumnarRowData reader. >> 2022-02-04 22:43:08,004 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:08,053 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:10,404 INFO >> org.apache.flink.connectors.hive.read.HiveTableInputFormat [] - Use >> flink parquet ColumnarRowData reader. >> 2022-02-04 22:43:10,406 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:11,908 INFO >> org.apache.flink.connectors.hive.read.HiveTableInputFormat [] - Use >> flink parquet ColumnarRowData reader. >> 2022-02-04 22:43:11,910 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:11,945 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:11,945 INFO >> org.apache.flink.connectors.hive.read.HiveTableInputFormat [] - Use >> flink parquet ColumnarRowData reader. >> 2022-02-04 22:43:11,946 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:11,963 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:11,964 INFO >> org.apache.flink.connectors.hive.read.HiveTableInputFormat [] - Use >> flink parquet ColumnarRowData reader. >> 2022-02-04 22:43:11,964 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:11,996 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:12,013 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:15,101 INFO >> org.apache.flink.connectors.hive.read.HiveTableInputFormat [] - Use >> flink parquet ColumnarRowData reader. >> 2022-02-04 22:43:15,102 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:15,117 INFO >> org.apache.flink.connectors.hive.read.HiveTableInputFormat [] - Use >> flink parquet ColumnarRowData reader. >> 2022-02-04 22:43:15,118 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:15,168 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:15,175 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:18,337 INFO >> org.apache.flink.connectors.hive.read.HiveTableInputFormat [] - Use >> flink parquet ColumnarRowData reader. >> 2022-02-04 22:43:18,338 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:18,410 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:18,467 INFO >> org.apache.flink.connectors.hive.read.HiveTableInputFormat [] - Use >> flink parquet ColumnarRowData reader. >> 2022-02-04 22:43:18,468 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:18,523 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00000-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:18,651 INFO >> org.apache.flink.connectors.hive.read.HiveTableInputFormat [] - Use >> flink parquet ColumnarRowData reader. >> 2022-02-04 22:43:18,676 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00001-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:18,722 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00001-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:18,778 INFO >> org.apache.flink.connectors.hive.read.HiveTableInputFormat [] - Use >> flink parquet ColumnarRowData reader. >> 2022-02-04 22:43:18,779 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00001-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:18,835 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00001-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:19,903 INFO >> org.apache.flink.connectors.hive.read.HiveTableInputFormat [] - Use >> flink parquet ColumnarRowData reader. >> 2022-02-04 22:43:19,904 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00001-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:19,952 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00001-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:19,952 INFO >> org.apache.flink.connectors.hive.read.HiveTableInputFormat [] - Use >> flink parquet ColumnarRowData reader. >> 2022-02-04 22:43:19,953 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00001-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:19,996 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00001-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:23,308 INFO >> org.apache.flink.connectors.hive.read.HiveTableInputFormat [] - Use >> flink parquet ColumnarRowData reader. >> 2022-02-04 22:43:23,309 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00001-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:23,363 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00001-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:23,364 INFO >> org.apache.flink.connectors.hive.read.HiveTableInputFormat [] - Use >> flink parquet ColumnarRowData reader. >> 2022-02-04 22:43:23,364 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00001-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> 2022-02-04 22:43:23,406 INFO >> com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem [] - Opening >> 's3://bucket/path/to/files/part-00001-55f0ff62-bf83-4eac-8ce8-308bd9efda24-c000.snappy.parquet' >> for reading >> ...... >> >