[ 
https://issues.apache.org/jira/browse/IMPALA-12108?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Joe McDonnell reassigned IMPALA-12108:
--------------------------------------

    Assignee: Joe McDonnell

> Add support for writing data with LZ4's high compression mode
> -------------------------------------------------------------
>
>                 Key: IMPALA-12108
>                 URL: https://issues.apache.org/jira/browse/IMPALA-12108
>             Project: IMPALA
>          Issue Type: Improvement
>          Components: Backend
>    Affects Versions: Impala 4.3.0
>            Reporter: Joe McDonnell
>            Assignee: Joe McDonnell
>            Priority: Major
>              Labels: ramp-up
>
> LZ4 has a high compression mode that gets higher compression ratios than 
> Snappy while maintaining high decompression speeds. The tradeoff is that 
> compression is very slow. We should add support for writing data with LZ4 
> high compression mode. This would let us get a sense of the performance for 
> writing and reading.
> See this benchmark on the LZ4 page:
> https://github.com/lz4/lz4#benchmarks
> In my hand tests, Parquet/LZ4 is about 13% smaller than Parquet/Snappy, but 
> it retains the fast decompression.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to