danielhumanmod commented on PR #1652:
URL:
https://github.com/apache/datafusion-ballista/pull/1652#issuecomment-4368061240
> At the same time I am validating this change via TPC-H query
Run TPC-H Q3 against SF1 with target_partitions=8 on a single-executor local
cluster:
```
SET datafusion.execution.target_partitions = 8;
EXPLAIN ANALYZE
select l_orderkey, sum(l_extendedprice * (1 - l_discount)) as revenue,
o_orderdate, o_shippriority
from customer, orders, lineitem
where c_mktsegment = 'BUILDING' and c_custkey = o_custkey and l_orderkey =
o_orderkey
and o_orderdate < date '1995-03-15' and l_shipdate > date '1995-03-15'
group by l_orderkey, o_orderdate, o_shippriority
order by revenue desc, o_orderdate
limit 10;
```
Before
```
ShuffleReaderExec: partitioning: Hash([l_orderkey@0], 8),
metrics=[output_rows=24.08 M, elapsed_compute=478.20ms, output_bytes=0.0 B,
output_batches=0]
```
After:
```
ShuffleReaderExec: partitioning: Hash([l_orderkey@0], 8),
metrics=[output_rows=3.24 M, elapsed_compute=75.88ms, output_bytes=0.0 B,
output_batches=0]
```
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]