[ 
https://issues.apache.org/jira/browse/HIVE-3153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13425553#comment-13425553
 ] 

Ashutosh Chauhan edited comment on HIVE-3153 at 7/31/12 6:12 AM:
-----------------------------------------------------------------

Nopes, this doesn't change format of file in any way, thus no backward 
compatibility issue. It enables lazy initialization of codecs from init() to 
flushRecords(), where they are actually needed, thus decreasing the memory 
footprint. 
                
      was (Author: ashutoshc):
    Nopes, this doesn't change format of file in any way. It enables lazy 
initialization of codecs from init() to flushRecords(), where they are actually 
needed, thus decreasing the memory footprint. 
                  
> Release codecs and output streams between flushes of RCFile
> -----------------------------------------------------------
>
>                 Key: HIVE-3153
>                 URL: https://issues.apache.org/jira/browse/HIVE-3153
>             Project: Hive
>          Issue Type: Improvement
>          Components: Compression
>            Reporter: Owen O'Malley
>            Assignee: Owen O'Malley
>         Attachments: hive-3153.patch
>
>
> Currently, RCFile writer holds a compression codec per a file and a 
> compression output stream per a column. Especially for queries that use 
> dynamic partitions this quickly consumes a lot of memory.
> I'd like flushRecords to get a codec from the pool and create the compression 
> output stream in flushRecords.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to