Hi, > on some days parquet was created by hive 2.1.1 and on some days it was > created by using glue … > After some drill down i saw schema of columns inside both type of parquet > file using parquet tool and found different data types for some column ... > optional int32 action_date (DATE); > optional binary action_date (UTF8);
Those two column types aren't convertible implicitly between each other, which is probably the problem. You'll have to create 2 different external tables and separate the files into different directories, create tables with appropriate schema and write a view to do UNION ALL. If your goal is to have good performance for Hive, it is best to write the data from Hive & maintain the schema on write. Cheers, Gopal