BELUGA BEHR created HIVE-16663:
----------------------------------
Summary: String Caching For Rows
Key: HIVE-16663
URL: https://issues.apache.org/jira/browse/HIVE-16663
Project: Hive
Issue Type: Improvement
Components: Beeline
Affects Versions: 2.0.1
Reporter: BELUGA BEHR
Priority: Minor
It is very common that there are many repeated values in the result set of a
query. As it currently stands, beeline does not attempt to cache any of these
values and therefore it consumes a lot of memory.
Adding a string cache may save a lot of memory. There are organizations that
use beeline to perform ETL processing of result sets into CSV. This will
better support those organizations.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)