alamb commented on code in PR #14245:
URL: https://github.com/apache/datafusion/pull/14245#discussion_r1932087589


##########
datafusion/sqllogictest/test_files/joins.slt:
##########
@@ -4264,8 +4266,10 @@ logical_plan
 physical_plan
 01)GlobalLimitExec: skip=0, fetch=2
 02)--NestedLoopJoinExec: join_type=Full, filter=c2@0 >= c2@1
-03)----MemoryExec: partitions=1, partition_sizes=[1]
-04)----MemoryExec: partitions=1, partition_sizes=[1]
+03)----GlobalLimitExec: skip=0, fetch=2

Review Comment:
   This plans is incorrect due to 
   - https://github.com/apache/datafusion/issues/14335
   
   (not this PR)



##########
datafusion/physical-optimizer/src/limit_pushdown.rs:
##########
@@ -247,7 +246,14 @@ pub fn pushdown_limit_helper(
             }
         } else {
             // Add fetch or a `LimitExec`:
-            global_state.satisfied = true;
+            // If the plan's children have limit, we shouldn't change the 
global state to true,
+            // because the children limit will be overridden if the global 
state is changed.
+            if pushdown_plan.children().iter().any(|child| {

Review Comment:
   It looks to me like most of the rest of this rule is implemented in terms of 
`LimitExec` (defined in this file) to abstract away directly looking for 
`GlobalLimitExec` and `LocalLimitExec`
   
   I think you could do something like this instead to be more consistent with 
the rest of the codebase
   
   ```rust
               if pushdown_plan.children().iter().any(|&child| {
                   extract_limit(child).is_some()
               }) {
                   global_state.satisfied = false;
               }
   ```
   
   However, this works too and fixes the bug so I think it is fine as is. 



##########
datafusion/sqllogictest/test_files/limit.slt:
##########
@@ -711,3 +711,37 @@ OFFSET 3 LIMIT 2;
 
 statement ok
 drop table ordered_table;
+
+# Test limit pushdown with subquery
+statement ok
+create table testSubQueryLimit (a int, b int) as values (1,2), (2,3), (3,4);
+
+query IIII
+select * from testSubQueryLimit as t1 join (select * from testSubQueryLimit 
limit 1) limit 10;
+----
+1 2 1 2
+2 3 1 2
+3 4 1 2
+
+query TT
+explain select * from testSubQueryLimit as t1 join (select * from 
testSubQueryLimit limit 1) limit 10;
+----
+logical_plan
+01)Limit: skip=0, fetch=10
+02)--Cross Join: 
+03)----SubqueryAlias: t1
+04)------Limit: skip=0, fetch=10
+05)--------TableScan: testsubquerylimit projection=[a, b], fetch=10
+06)----Limit: skip=0, fetch=1

Review Comment:
   this plan looks good to me -- the Limit 1 is still here



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to