Hi,

> on some days parquet was created by hive 2.1.1 and on some days it was 
> created by using glue
…
> After some drill down i saw schema of columns inside both type of parquet 
> file using parquet tool and found different data types for some column
...
> optional int32 action_date (DATE);
> optional binary action_date (UTF8);

Those two column types aren't convertible implicitly between each other, which 
is probably the problem.

You'll have to create 2 different external tables and separate the files into 
different directories, create tables with appropriate schema and write a view 
to do UNION ALL.

If your goal is to have good performance for Hive, it is best to write the data 
from Hive & maintain the schema on write.

Cheers,
Gopal


Reply via email to