[ 
https://issues.apache.org/jira/browse/HIVE-18340?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16356426#comment-16356426
 ] 

Ke Jia commented on HIVE-18340:
-------------------------------

[~stakiar]:
{quote}Hive-on-Tez's has an implementation of DynamicValueRegistry that uses 
some special Tez APIs such as ProcessorContext#waitForAllInputsReady, how are 
we simulating this in HoS?
{quote}
[~kellyzly],Yes, For HoS, I  flush the runtime filter info (min/max and bloom 
filter) to hdfs in SparkRuntimeFilterPruningSinkOperator operator  and get the 
info from hdfs in SparkRuntimeFilterPruner , which is similar as 
SparkPartitionPruningSinkOperator and SparkDynamicPartitionPruner class in 
Spark DPP.

{quote}It would be nice to have some qtests to help visualize what the explain 
plan with RF would look like

{quote}

I upload the HIVE-18340.2.patch to add qtest "spark_runtime_filter_pruning.q" 
and "spark_runtime_filter_pruning.q.out". 

Thanks [~stakiar], [~kellyzly] for your review!

 

> Dynamic Min-Max/BloomFilter runtime-filtering in HoS
> ----------------------------------------------------
>
>                 Key: HIVE-18340
>                 URL: https://issues.apache.org/jira/browse/HIVE-18340
>             Project: Hive
>          Issue Type: New Feature
>          Components: Spark
>    Affects Versions: 3.0.0
>            Reporter: Ke Jia
>            Assignee: Ke Jia
>            Priority: Major
>         Attachments: HIVE-18340.1.patch, HIVE-18340.2.patch
>
>
> Tez implemented Dynamic Min-Max/BloomFilter runtime-filtering in HIVE-15269 
> and we should implement the same in HOS.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to