sivabalan narayanan created HUDI-3391:
-----------------------------------------

             Summary: presto and hive beeline fails to read MOR table w/ 2 or 
more array fields
                 Key: HUDI-3391
                 URL: https://issues.apache.org/jira/browse/HUDI-3391
             Project: Apache Hudi
          Issue Type: Task
          Components: reader-core
            Reporter: sivabalan narayanan


We have an issue reported by user 
[here|[https://github.com/apache/hudi/issues/2657].] Looks like w/ 0.10.0 or 
later, spark datasource read works, but hive beeline does not work. Even 
spark.sql (hive table) querying works as well. 

Another related ticket: 
[https://github.com/apache/hudi/issues/3834#issuecomment-997307677]

 

Steps that I tried:

[https://gist.github.com/nsivabalan/fdb8794104181f93b9268380c7f7f079]

>From beeline, you will encounter below exception
{code:java}
Failed with exception 
java.io.IOException:org.apache.hudi.org.apache.avro.SchemaParseException: Can't 
redefine: array {code}
All linked ticket states that upgrading parquet to 1.11.0 or greater should 
work. We need to try it out w/ latest master and go from there. 

 

 

 

 



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to