paleolimbot commented on PR #24: URL: https://github.com/apache/sedona-db/pull/24#issuecomment-3276193755
I can see how there are two types of benchmarks that are equally valuable: - Integration-style benchmarks that use the defaults and read from Parquet (e.g., check the perceived speed of a relatively realistic query) - Unit-style benchmarks that are just a way to check if our particular implementation of a scalar function/iteration overhead is reasonable. Running these from memory on one thread (or the same number of threads) is possibly more comparable but forcing a single thread is maybe unrealistic because in practice some of our per-batch and per-item overhead is amortized over multiple threads and is possibly not something we should spend time optimizing yet. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
