[ https://issues.apache.org/jira/browse/HIVE-12995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15197014#comment-15197014 ]
Lefty Leverenz commented on HIVE-12995: --------------------------------------- Doc note: This adds configuration parameter *hive.orc.splits.allow.synthetic.fileid* to HiveConf.java, so it will need to be documented in the ORC section of Configuration Properties for release 2.1.0. * [Configuration Properties -- ORC File Format | https://cwiki.apache.org/confluence/display/Hive/Configuration+Properties#ConfigurationProperties-ORCFileFormat] Should it also be mentioned in the llap documentation? (The parameter description doesn't say anything about llap.) * [LLAP design document | https://issues.apache.org/jira/secure/attachment/12665704/LLAPdesigndocument.pdf] attached to HIVE-7926 > LLAP: Synthetic file ids need collision checks > ---------------------------------------------- > > Key: HIVE-12995 > URL: https://issues.apache.org/jira/browse/HIVE-12995 > Project: Hive > Issue Type: Bug > Components: llap > Affects Versions: 2.1.0 > Reporter: Gopal V > Assignee: Sergey Shelukhin > Labels: TODOC2.1 > Fix For: 2.1.0 > > Attachments: HIVE-12995.01.patch, HIVE-12995.02.patch, > HIVE-12995.03.patch, HIVE-12995.04.patch, HIVE-12995.patch > > > LLAP synthetic file ids do not have any way of checking whether a collision > occurs other than a data-error. > Synthetic file-ids have only been used with unit tests so far - but they will > be needed to add cache mechanisms to non-HDFS filesystems. > In case of Synthetic file-ids, it is recommended that we track the full-tuple > (path, mtime, len) in the cache so that a cache-hit for the synthetic file-id > can be compared against the parameters & only accepted if those match. -- This message was sent by Atlassian JIRA (v6.3.4#6332)