danielhumanmod commented on PR #1652:
URL: 
https://github.com/apache/datafusion-ballista/pull/1652#issuecomment-4368061240

   > At the same time I am validating this change via TPC-H query
   
   Run TPC-H Q3 against SF1 with target_partitions=8 on a single-executor local 
cluster:
   ```
   SET datafusion.execution.target_partitions = 8;
   
   EXPLAIN ANALYZE
   select l_orderkey, sum(l_extendedprice * (1 - l_discount)) as revenue, 
o_orderdate, o_shippriority
   from customer, orders, lineitem
   where c_mktsegment = 'BUILDING' and c_custkey = o_custkey and l_orderkey = 
o_orderkey
     and o_orderdate < date '1995-03-15' and l_shipdate > date '1995-03-15'
   group by l_orderkey, o_orderdate, o_shippriority
   order by revenue desc, o_orderdate
   limit 10;
   ```
   
   Before
   ```
    ShuffleReaderExec: partitioning: Hash([l_orderkey@0], 8), 
metrics=[output_rows=24.08 M, elapsed_compute=478.20ms, output_bytes=0.0 B, 
output_batches=0]        
   ```
   
   After:
   ```
    ShuffleReaderExec: partitioning: Hash([l_orderkey@0], 8), 
metrics=[output_rows=3.24 M, elapsed_compute=75.88ms, output_bytes=0.0 B, 
output_batches=0]        
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to