dongwoo.kim created FLINK-32760:
-----------------------------------

             Summary: Version Conflict in flink-sql-connector-hive for 
shaded.parquet prefix packages
                 Key: FLINK-32760
                 URL: https://issues.apache.org/jira/browse/FLINK-32760
             Project: Flink
          Issue Type: Bug
          Components: Connectors / Hive
            Reporter: dongwoo.kim
         Attachments: image-2023-08-05-14-50-47-806.png

h2. Summary

In https://issues.apache.org/jira/browse/FLINK-23074 it seems like shading 
parquet dependency from *hive-exec* is done. 

But I think this is not enough and causing errors like below when I try to read 
parquet file using sql-gateway which requires both *flink-parquet* and 
*flink-sql-connector-hive* dependencies.

!image-2023-08-05-14-50-47-806.png|width=1392,height=909!

 
h2. {color:#172b4d}Cause{color}

{color:#172b4d}Parquet dependency not only includes org.apache.parquet but also 
shaded.parquet prefix dependencies. 
([ref|https://github.com/apache/parquet-mr/blob/515734c373f69b5250e8b63eb3d1c973da893b63/pom.xml#L72]
 {color}

{color:#172b4d}- flink-parquet depends on Parquet 1.12.3 with shaded Thrift 
0.16.0 (prefix: {{{}shaded.parquet{}}}){color}

{color:#172b4d}- flink-sql-connector-hive depends on hive-exec 3.1.3 with 
Parquet 1.10.0 and shaded Thrift 0.9.3 (prefix: {{{}shaded.parquet{}}}){color}

{color:#172b4d}- Code compiled against Thrift 0.16.0 attempts to run against 
0.9.3, causing the error.{color}
h2. {color:#172b4d}Proposed solution{color}

Adding new shading rule to flink-sql-connector-hive project.

I have confirmed that if we add below rule, the above error is resolved.
{code:xml}
<relocation>
     <pattern>shaded.parquet</pattern>
     <shadedPattern>shaded.parquet.flink.hive.shaded</shadedPattern>
</relocation>{code}
 

I would be happy to implement it if the proposal is accepted. Thanks

 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to