mertak-synnada commented on code in PR #14499:
URL: https://github.com/apache/datafusion/pull/14499#discussion_r1942597562


##########
datafusion/sqllogictest/test_files/aggregate.slt:
##########
@@ -5032,18 +5032,17 @@ logical_plan
 03)----Aggregate: groupBy=[[aggregate_test_100.c3]], 
aggr=[[min(aggregate_test_100.c1)]]
 04)------TableScan: aggregate_test_100 projection=[c1, c3]
 physical_plan
-01)GlobalLimitExec: skip=0, fetch=5

Review Comment:
   Yes, as @zhuqi-lucas  mentioned, only Limit operators support skip, and 
limit_pushdown is adding a Limit operator if skip exists, so this is only 
affecting plans without skip.
   
   Here's the query result with skip:
   
   ```
   query TT
   EXPLAIN SELECT DISTINCT c3, min(c1) FROM aggregate_test_100 group by c3 
limit 5 offset 3;
   ----
   logical_plan
   01)Limit: skip=3, fetch=5
   02)--Aggregate: groupBy=[[aggregate_test_100.c3, 
min(aggregate_test_100.c1)]], aggr=[[]]
   03)----Aggregate: groupBy=[[aggregate_test_100.c3]], 
aggr=[[min(aggregate_test_100.c1)]]
   04)------TableScan: aggregate_test_100 projection=[c1, c3]
   physical_plan
   01)GlobalLimitExec: skip=3, fetch=5
   02)--CoalescePartitionsExec: fetch=8
   03)----AggregateExec: mode=FinalPartitioned, gby=[c3@0 as c3, 
min(aggregate_test_100.c1)@1 as min(aggregate_test_100.c1)], aggr=[], lim=[8]
   04)------CoalesceBatchesExec: target_batch_size=8192
   05)--------RepartitionExec: partitioning=Hash([c3@0, 
min(aggregate_test_100.c1)@1], 4), input_partitions=4
   06)----------AggregateExec: mode=Partial, gby=[c3@0 as c3, 
min(aggregate_test_100.c1)@1 as min(aggregate_test_100.c1)], aggr=[], lim=[8]
   07)------------AggregateExec: mode=FinalPartitioned, gby=[c3@0 as c3], 
aggr=[min(aggregate_test_100.c1)]
   08)--------------CoalesceBatchesExec: target_batch_size=8192
   09)----------------RepartitionExec: partitioning=Hash([c3@0], 4), 
input_partitions=4
   10)------------------AggregateExec: mode=Partial, gby=[c3@1 as c3], 
aggr=[min(aggregate_test_100.c1)]
   11)--------------------RepartitionExec: partitioning=RoundRobinBatch(4), 
input_partitions=1
   12)----------------------CsvExec: file_groups={1 group: 
[[WORKSPACE_ROOT/testing/data/csv/aggregate_test_100.csv]]}, projection=[c1, 
c3], has_header=true
   ```



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to