2010YOUY01 commented on PR #14902: URL: https://github.com/apache/datafusion/pull/14902#issuecomment-2687143242
> > The data generation will take long time for big data. > > How bad is it? I can try to dig into the problem and try to improve it on the side of `falsa` (generation library). I tried generating 4 join tables in the largest dataset, it took around 8 minutes on my MacBook with M4 pro chip. I think this is definitely not a problem for our benchmarking use case. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For additional commands, e-mail: github-h...@datafusion.apache.org