please provide the detail way to generate the tpcds data in TPCDSQueryBenchmark

2023-05-15 Thread zhangliyun
hi i want to set up a tpcds benchmark to test some performance of some spark feature i saw in TPCDSQueryBenchmark , it need send the --data-location to the class, my question is how to generate the tpcds data in this benchmark ``` /** * Benchmark to measure TPCDS query performance. * To

Re: please provide the detail way to generate the tpcds data in TPCDSQueryBenchmark

2023-05-15 Thread Manu Zhang
Hi Kelly, You may follow the steps in the benchmark GitHub workflow https://github.com/apache/spark/blob/master/.github/workflows/benchmark.yml Regards, Manu On Mon, May 15, 2023 at 5:49 PM zhangliyun wrote: > hi > > i want to set up a tpcds benchmark to test some performance of some > spark