Re: [I] ExternalSorter Fails to Spill Dictionaries [datafusion]

2025-03-03 Thread via GitHub
comphead closed issue #4658: ExternalSorter Fails to Spill Dictionaries URL: https://github.com/apache/datafusion/issues/4658 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To

Re: [I] ExternalSorter Fails to Spill Dictionaries [datafusion]

2025-02-24 Thread via GitHub
davidhewitt commented on issue #4658: URL: https://github.com/apache/datafusion/issues/4658#issuecomment-2680773931 Ok great, I'll work on that today 👍 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to g

Re: [I] ExternalSorter Fails to Spill Dictionaries [datafusion]

2025-02-24 Thread via GitHub
tustvold commented on issue #4658: URL: https://github.com/apache/datafusion/issues/4658#issuecomment-2679171994 > I assume by row format you mean [arrow-row](https://arrow.apache.org/rust/arrow_row/index.html), however it's not clear to me if there's a standard way to serialize these to a

Re: [I] ExternalSorter Fails to Spill Dictionaries [datafusion]

2025-02-24 Thread via GitHub
davidhewitt commented on issue #4658: URL: https://github.com/apache/datafusion/issues/4658#issuecomment-2678889088 It looks like an alternative to using the row format in datafusion might be to support delta-encoded dictionaries in `arrow-rs`. https://github.com/apache/arrow-rs/issues/6783

Re: [I] ExternalSorter Fails to Spill Dictionaries [datafusion]

2025-02-24 Thread via GitHub
davidhewitt commented on issue #4658: URL: https://github.com/apache/datafusion/issues/4658#issuecomment-2678487900 @tustvold we're hitting this issue in present-day at Pydantic. It's possibly been made more relevant by the [recent fixes to spill heuristics](https://github.com/apache/datafu