[ 
https://issues.apache.org/jira/browse/IMPALA-14700?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18062381#comment-18062381
 ] 

Balazs Hevele commented on IMPALA-14700:
----------------------------------------

Patch at [https://gerrit.cloudera.org/#/c/24059/]
Note: This change requires a parquet version higher than we currently have 
(which is 1.12.3) for the LZ4_RAW enum value.

> Add support for Parquet's LZ4_RAW compression
> ---------------------------------------------
>
>                 Key: IMPALA-14700
>                 URL: https://issues.apache.org/jira/browse/IMPALA-14700
>             Project: IMPALA
>          Issue Type: Task
>          Components: Backend
>    Affects Versions: Impala 5.0.0
>            Reporter: Joe McDonnell
>            Assignee: Balazs Hevele
>            Priority: Major
>
> Parquet's current LZ4 compression uses a framing mechanism from Hadoop. 
> Parquet decided to deprecate this and instead introduced the LZ4_RAW 
> compression without the Hadoop framing. See 
> https://issues.apache.org/jira/browse/PARQUET-1996 / 
> https://issues.apache.org/jira/browse/PARQUET-2032
> We should add support for reading / writing LZ4_RAW. This should be fairly 
> simple, as LZ4_RAW just uses the block compression directly. It should 
> correspond to Lz4Compressor rather than Lz4BlockCompressor.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to