tobixdev commented on issue #16799: URL: https://github.com/apache/datafusion/issues/16799#issuecomment-3270869325
Tested with [RDF Fusion](https://github.com/tobixdev/rdf-fusion). Anyone else observing significant performance regressions with the new NL Join? DataFusion 49: 4.2 ms <img width="1633" height="646" alt="Image" src="https://github.com/user-attachments/assets/f21a1d1d-8876-484d-b000-107bbe49481d" /> DataFusion 50: 190.2 ms <img width="1633" height="646" alt="Image" src="https://github.com/user-attachments/assets/8b2e247d-174d-4de1-995e-450b811872bd" /> What I can take away from the flamegraphs is that `ScalarValue::to_array_of_size` may require some optimization for `UnionArray`s and that I think something is different regarding the evaluation of physical expressions. We did not observe any changes to the execution plan for any query so I think this is due to the NL Join. I don't think that this necessarily prevents a release, as we are using DF with "unusual" data. However, the issue may warrant a follow-up ticket. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For additional commands, e-mail: github-h...@datafusion.apache.org