Re: [PR] Run queries in python benchmarks using only one thread [sedona-db]

via GitHub Wed, 10 Sep 2025 12:09:00 -0700


paleolimbot commented on PR #24:
URL: https://github.com/apache/sedona-db/pull/24#issuecomment-3276193755


   I can see how there are two types of benchmarks that are equally valuable:
   
   - Integration-style benchmarks that use the defaults and read from Parquet 
(e.g., check the perceived speed of a relatively realistic query)
   - Unit-style benchmarks that are just a way to check if our particular 
implementation of a scalar function/iteration overhead is reasonable. Running 
these from memory on one thread (or the same number of threads) is possibly 
more comparable but forcing a single thread is maybe unrealistic because in 
practice some of our per-batch and per-item overhead is amortized over multiple 
threads and is possibly not something we should spend time optimizing yet.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Re: [PR] Run queries in python benchmarks using only one thread [sedona-db]

Reply via email to