github
Thread
Date
Later messages
Messages by Date
2026/06/18
Re: [PR] IN LIST: reinterpret small-width types for bitmap filters [datafusion]
via GitHub
2026/06/18
Re: [PR] IN LIST: add direct-probe hash filter for large primitive lists [datafusion]
via GitHub
2026/06/18
[PR] IN LIST: unify bitmap filter implementations [datafusion]
via GitHub
2026/06/18
Re: [I] Avoid concatenating record batches in joins to alleviate memory pressure [datafusion]
via GitHub
2026/06/18
Re: [PR] IN LIST: add branchless filter for small primitive lists [datafusion]
via GitHub
2026/06/18
[PR] refactor: centralize date_bin per-row mapping [datafusion]
via GitHub
2026/06/18
Re: [PR] feat(unparser): support binary literals [datafusion]
via GitHub
2026/06/18
Re: [I] Unparser: support unparsing binary scalars [datafusion]
via GitHub
2026/06/18
[PR] chore(deps): bump actions/checkout from 6.0.3 to 7.0.0 [datafusion-ballista]
via GitHub
2026/06/18
Re: [I] [Spark] SparkWidthBucket return_type is Int32, should be Int64 to match Spark [datafusion]
via GitHub
2026/06/18
Re: [PR] bugfix: changed return type of spark's width_bucket to i64 [datafusion]
via GitHub
2026/06/18
Re: [I] pref: Use builtin compression for arrow ipc writer [datafusion-comet]
via GitHub
2026/06/18
Re: [PR] feat(unparser): support binary literals [datafusion]
via GitHub
2026/06/18
Re: [PR] bugfix: changed return type of spark's width_bucket to i64 [datafusion]
via GitHub
2026/06/18
Re: [I] array_union result ordering versus DataFusion is unverified [datafusion-comet]
via GitHub
2026/06/18
Re: [PR] chore: fix `ConstantFolding` rule exclusion for benchmarks [datafusion-comet]
via GitHub
2026/06/18
Re: [I] chore: Remove invalid `spark.sql.optimizer.constantFolding.enabled` configuration from java benchmarks. [datafusion-comet]
via GitHub
2026/06/18
Re: [PR] chore: add ordering tests for `array_union` [datafusion-comet]
via GitHub
2026/06/18
[PR] feat: add docker-compose.quick.yml and fix onboarding docs [datafusion-ballista]
via GitHub
2026/06/18
[PR] fix: round large UInt64 values without narrowing [datafusion]
via GitHub
2026/06/18
Re: [I] Call for Presentations: Community Showcase: Regular series for sharing what you're building with DataFusion [datafusion]
via GitHub
2026/06/18
Re: [PR] fix: Correct array_contains behavior for Spark-style null semantics [datafusion-comet]
via GitHub
2026/06/18
Re: [PR] perf: do not build parquet pruning predicates if no page index [datafusion]
via GitHub
2026/06/18
Re: [PR] Io dynamic [datafusion]
via GitHub
2026/06/18
Re: [PR] Add DecomposeAggregate optimizer to rewrite AVG as SUM/COUNT [datafusion]
via GitHub
2026/06/18
Re: [PR] feat: initialize TopK dynamic filter threshold from parquet statistics [datafusion]
via GitHub
2026/06/18
[PR] Join avoid concat [datafusion]
via GitHub
2026/06/18
[I] Avoid concatenating record batches in joins to alleviate memory pressure [datafusion]
via GitHub
2026/06/18
Re: [PR] feat: add datafusion-json crate with json_get_str scaffolding [datafusion]
via GitHub
2026/06/18
Re: [PR] feat: add datafusion-json crate with json_get_str scaffolding [datafusion]
via GitHub
2026/06/18
Re: [I] Natively support time-window grouping expressions: window, session_window, window_time [datafusion-comet]
via GitHub
2026/06/18
Re: [I] CI is broken [datafusion-comet]
via GitHub
2026/06/18
Re: [PR] feat: logical plan protobuf representation for range repartitioning [datafusion]
via GitHub
2026/06/18
Re: [I] Support logical protobuf serialization for range repartitioning [datafusion]
via GitHub
2026/06/18
Re: [PR] feat: logical plan protobuf representation for range repartitioning [datafusion]
via GitHub
2026/06/18
Re: [PR] feat: logical plan protobuf representation for range repartitioning [datafusion]
via GitHub
2026/06/18
Re: [PR] feat: logical plan protobuf representation for range repartitioning [datafusion]
via GitHub
2026/06/18
[PR] feat: logical plan protobuf representation for range repartitioning [datafusion]
via GitHub
2026/06/18
Re: [I] Snapshot tests in physical_optimizer are not deterministic across CPU-count environments [datafusion]
via GitHub
2026/06/18
Re: [I] Snapshot tests in physical_optimizer are not deterministic across CPU-count environments [datafusion]
via GitHub
2026/06/18
Re: [I] Call for Presentations: Community Showcase: Regular series for sharing what you're building with DataFusion [datafusion]
via GitHub
2026/06/18
Re: [I] OpenLineage support [datafusion]
via GitHub
2026/06/18
[PR] chore: fix `ConstantFolding` rule exclusion [datafusion-comet]
via GitHub
2026/06/18
Re: [PR] Intermediate result blocked approach to aggregation memory management [datafusion]
via GitHub
2026/06/18
Re: [PR] chore: tweak CI execution memory params [datafusion-comet]
via GitHub
2026/06/18
Re: [PR] Add optional native Lance scan support [datafusion-comet]
via GitHub
2026/06/18
Re: [PR] feat: Stage based fallback [datafusion-comet]
via GitHub
2026/06/18
[PR] chore: add ordering tests for `array_union` [datafusion-comet]
via GitHub
2026/06/18
Re: [PR] chore: add array tests with NaN handling [datafusion-comet]
via GitHub
2026/06/18
Re: [I] [Bug] array_distinct / array_union / array_except do not canonicalize NaN like Spark [datafusion-comet]
via GitHub
2026/06/18
Re: [I] [Bug] array_max and array_min disagree with Spark on NaN ordering [datafusion-comet]
via GitHub
2026/06/18
Re: [PR] refactor: move array expression support checks to getSupportLevel [datafusion-comet]
via GitHub
2026/06/18
Re: [PR] feat: Implement map-to-string casting [datafusion-comet]
via GitHub
2026/06/18
Re: [PR] fix: exclude release scratch dirs from RAT and license skill docs [datafusion-comet]
via GitHub
2026/06/18
Re: [PR] feat: introduce pluggable SpillFile trait and TempFileFactory for custom spill backends [datafusion]
via GitHub
2026/06/18
Re: [PR] IN LIST: add two-stage filter for Utf8 and LargeUtf8 [datafusion]
via GitHub
2026/06/18
Re: [PR] IN LIST: add two-stage filter for Utf8 and LargeUtf8 [datafusion]
via GitHub
2026/06/18
Re: [PR] [physical-plan]: remove deprecated spill_record_batch_by_size [datafusion]
via GitHub
2026/06/18
Re: [PR] feat: introduce pluggable SpillFile trait and TempFileFactory for custom spill backends [datafusion]
via GitHub
2026/06/18
[PR] [physical-plan]: remove deprecated spill_record_batch_by_size [datafusion]
via GitHub
2026/06/18
Re: [I] Call for Presentations: Community Showcase: Regular series for sharing what you're building with DataFusion [datafusion]
via GitHub
2026/06/18
Re: [PR] feat: introduce pluggable SpillFile trait and TempFileFactory for custom spill backends [datafusion]
via GitHub
2026/06/18
Re: [PR] refactor: Simplify `approx_distinct` (-200 LoC) [datafusion]
via GitHub
2026/06/18
Re: [PR] feat: introduce pluggable SpillFile trait and TempFileFactory for custom spill backends [datafusion]
via GitHub
2026/06/18
Re: [PR] feat: introduce pluggable SpillFile trait and TempFileFactory for custom spill backends [datafusion]
via GitHub
2026/06/18
Re: [PR] feat: introduce pluggable SpillFile trait and TempFileFactory for custom spill backends [datafusion]
via GitHub
2026/06/18
Re: [PR] feat: introduce pluggable SpillFile trait and TempFileFactory for custom spill backends [datafusion]
via GitHub
2026/06/18
Re: [PR] feat: introduce pluggable SpillFile trait and TempFileFactory for custom spill backends [datafusion]
via GitHub
2026/06/18
Re: [PR] feat: introduce pluggable SpillFile trait and TempFileFactory for custom spill backends [datafusion]
via GitHub
2026/06/18
Re: [PR] feat: introduce pluggable SpillFile trait and TempFileFactory for custom spill backends [datafusion]
via GitHub
2026/06/18
Re: [I] Unify LRU memory-limiting caches into one generic cache [datafusion]
via GitHub
2026/06/18
Re: [I] Unify LRU memory-limiting caches into one generic cache [datafusion]
via GitHub
2026/06/18
[PR] chore: tweak CI execution memory params [datafusion-comet]
via GitHub
2026/06/18
Re: [PR] feat: Native Parquet Iceberg Data File Writes In Comet [datafusion-comet]
via GitHub
2026/06/18
Re: [PR] IN LIST: reinterpret FixedSizeBinary for primitive fast paths [datafusion]
via GitHub
2026/06/18
Re: [PR] IN LIST: reinterpret FixedSizeBinary for primitive fast paths [datafusion]
via GitHub
2026/06/18
Re: [PR] IN LIST: reinterpret FixedSizeBinary for primitive fast paths [datafusion]
via GitHub
2026/06/18
Re: [PR] Add StatisticsContext parameter to partition_statistics [datafusion]
via GitHub
2026/06/18
Re: [I] Let partition_statistics accept pre-computed children statistics [datafusion]
via GitHub
2026/06/18
Re: [PR] Unify LRU memory-limiting caches into one generic cache [datafusion]
via GitHub
2026/06/18
Re: [PR] IN LIST: reinterpret FixedSizeBinary for primitive fast paths [datafusion]
via GitHub
2026/06/18
Re: [PR] Add StatisticsContext parameter to partition_statistics [datafusion]
via GitHub
2026/06/18
Re: [PR] Unify LRU memory-limiting caches into one generic cache [datafusion]
via GitHub
2026/06/18
Re: [PR] Unify LRU memory-limiting caches into one generic cache [datafusion]
via GitHub
2026/06/18
[PR] chore: add array tests with NaN handling [datafusion-comet]
via GitHub
2026/06/18
Re: [I] Call for Presentations: Community Showcase: Regular series for sharing what you're building with DataFusion [datafusion]
via GitHub
2026/06/18
Re: [PR] fix: Consider column names' case when aliasing tables [datafusion]
via GitHub
2026/06/18
Re: [PR] refactor: move aggregate expression support checks to getSupportLevel [datafusion-comet]
via GitHub
2026/06/18
Re: [PR] refactor: move arithmetic and math support checks to getSupportLevel [datafusion-comet]
via GitHub
2026/06/18
Re: [PR] Rich t kid/implement multi dictionary aggr [datafusion]
via GitHub
2026/06/18
Re: [PR] IN LIST: reinterpret FixedSizeBinary for primitive fast paths [datafusion]
via GitHub
2026/06/18
Re: [PR] IN LIST: reinterpret FixedSizeBinary for primitive fast paths [datafusion]
via GitHub
2026/06/18
Re: [PR] Rich t kid/implement multi dictionary aggr [datafusion]
via GitHub
2026/06/18
Re: [PR] Unify LRU memory-limiting caches into one generic cache [datafusion]
via GitHub
2026/06/18
Re: [PR] fix: isolate anonymous file statistics cache [datafusion]
via GitHub
2026/06/18
Re: [I] [EPIC] Implement Range Partitioning [datafusion]
via GitHub
2026/06/18
[PR] Prune implicit FD group keys in SQL aggregates [datafusion]
via GitHub
2026/06/18
[I] TPC-DS q39 regression after adding primary key constraints: aggregate GROUP BY includes many unreferenced FD columns [datafusion]
via GitHub
2026/06/18
Re: [PR] Rich t kid/implement multi dictionary aggr [datafusion]
via GitHub
2026/06/18
[PR] fix: exclude release scratch dirs from RAT and sync bash rat excludes [datafusion-comet]
via GitHub
2026/06/18
[PR] Parallel bounded RANGE-frame window functions without PARTITION BY (draft) [datafusion]
via GitHub
2026/06/18
Re: [I] Call for Presentations: Community Showcase: Regular series for sharing what you're building with DataFusion [datafusion]
via GitHub
2026/06/18
Re: [I] Update ClickBench benchmarks with DataFusion 54.0.0 (when released) [datafusion]
via GitHub
2026/06/18
Re: [I] Call for Presentations: Community Showcase: Regular series for sharing what you're building with DataFusion [datafusion]
via GitHub
2026/06/18
Re: [D] DISCUSSION: DataFusion Meetup in Asia and China 2026 [datafusion]
via GitHub
2026/06/18
[PR] docs: Add Shanghai Apache DataFusion Meetup to events page [datafusion]
via GitHub
2026/06/18
Re: [PR] refactor: move string expression support checks to getSupportLevel [datafusion-comet]
via GitHub
2026/06/18
Re: [PR] refactor: move string expression support checks to getSupportLevel [datafusion-comet]
via GitHub
2026/06/18
Re: [I] Call for Presentations: Community Showcase: Regular series for sharing what you're building with DataFusion [datafusion]
via GitHub
2026/06/18
Re: [I] Call for Presentations: Community Showcase: Regular series for sharing what you're building with DataFusion [datafusion]
via GitHub
2026/06/18
Re: [PR] Fix shared TopK early exit with shared prefix threshold [datafusion]
via GitHub
2026/06/18
Re: [PR] Fix shared TopK early exit with shared prefix threshold [datafusion]
via GitHub
2026/06/18
Re: [PR] Fix shared TopK early exit with shared prefix threshold [datafusion]
via GitHub
2026/06/18
Re: [PR] Fix shared TopK early exit with shared prefix threshold [datafusion]
via GitHub
2026/06/18
Re: [I] Reduce Github Action Usage [datafusion]
via GitHub
2026/06/18
Re: [PR] refactor: move aggregate expression support checks to getSupportLevel [datafusion-comet]
via GitHub
2026/06/18
Re: [PR] refactor: move aggregate expression support checks to getSupportLevel [datafusion-comet]
via GitHub
2026/06/18
Re: [PR] refactor: move aggregate expression support checks to getSupportLevel [datafusion-comet]
via GitHub
2026/06/18
Re: [PR] refactor: move array expression support checks to getSupportLevel [datafusion-comet]
via GitHub
2026/06/18
[I] Support `MapType` for `ElementAt` [datafusion-comet]
via GitHub
2026/06/18
Re: [PR] refactor: move array expression support checks to getSupportLevel [datafusion-comet]
via GitHub
2026/06/18
Re: [PR] refactor: move string expression support checks to getSupportLevel [datafusion-comet]
via GitHub
2026/06/18
Re: [PR] refactor: move string expression support checks to getSupportLevel [datafusion-comet]
via GitHub
2026/06/18
Re: [PR] refactor: move arithmetic and math support checks to getSupportLevel [datafusion-comet]
via GitHub
2026/06/18
Re: [I] Call for Presentations: Community Showcase: Regular series for sharing what you're building with DataFusion [datafusion]
via GitHub
2026/06/18
Re: [PR] perf: preserve dictionary encoding for lower/upper to avoid materializing low-cardinality columns [datafusion]
via GitHub
2026/06/18
[PR] Empty2null [datafusion-comet]
via GitHub
2026/06/18
Re: [PR] Fix shared TopK early exit with shared prefix threshold [datafusion]
via GitHub
2026/06/18
Re: [I] Cleanup: Name build-row and matchable-map presence checks in hash join [datafusion]
via GitHub
2026/06/18
[PR] refactor: name build-row and matchable-map presence checks in hash join [datafusion]
via GitHub
2026/06/18
Re: [PR] Fix DuckDB unparse for optimized join projections [datafusion]
via GitHub
2026/06/18
[I] [Bug] unix_timestamp support level disagrees with documented TimestampNTZ tz-conversion divergence [datafusion-comet]
via GitHub
2026/06/18
[I] array_union result ordering versus DataFusion is unverified [datafusion-comet]
via GitHub
2026/06/18
[I] [Bug] map_from_arrays / map_from_entries do not enforce null-key rejection or spark.sql.mapKeyDedupPolicy [datafusion-comet]
via GitHub
2026/06/18
[I] [Bug] make_timestamp does not throw under spark.sql.ansi.enabled=true [datafusion-comet]
via GitHub
2026/06/18
Re: [PR] perf: preserve dictionary encoding for lower/upper to avoid materializing low-cardinality columns [datafusion]
via GitHub
2026/06/18
[PR] ClickHouse: Support unparenthesized IN right-hand side [datafusion-sqlparser-rs]
via GitHub
2026/06/18
Re: [PR] perf: preserve dictionary encoding for lower/upper to avoid materializing low-cardinality columns [datafusion]
via GitHub
2026/06/18
Re: [PR] perf: preserve dictionary encoding for lower/upper to avoid materializing low-cardinality columns [datafusion]
via GitHub
2026/06/18
Re: [PR] refactor: name build-row and matchable-map presence checks in hash join [datafusion]
via GitHub
2026/06/18
Re: [PR] feat: support file-level parquet row selections [datafusion]
via GitHub
2026/06/18
Re: [PR] feat: use aligned slice access during bulk append in SparkUnsafeArray [datafusion-comet]
via GitHub
2026/06/18
Re: [PR] feat: support file-level parquet row selections [datafusion]
via GitHub
2026/06/18
[PR] refactor: name build-row and matchable-map presence checks in hash join [datafusion]
via GitHub
2026/06/18
Re: [PR] feat: Native Parquet Iceberg Data File Writes In Comet [datafusion-comet]
via GitHub
2026/06/18
Re: [PR] feat: support native Comet scan of plain Delta Lake tables [datafusion-comet]
via GitHub
2026/06/18
[I] `IN` operator rejects a ClickHouse query-parameter placeholder as its right-hand side without parenthesis [datafusion-sqlparser-rs]
via GitHub
2026/06/18
[PR] refactor: move aggregate expression support checks to getSupportLevel [datafusion-comet]
via GitHub
2026/06/18
Re: [I] Improve stage encoding size by removing unrelated partition location [datafusion-ballista]
via GitHub
2026/06/18
Re: [PR] feat: Native Parquet Iceberg Data File Writes In Comet [datafusion-comet]
via GitHub
2026/06/18
Re: [I] Improve stage encoding size by removing unrelated partition location [datafusion-ballista]
via GitHub
2026/06/18
[I] Improve stage encoding size by removing unrelated partition location [datafusion-ballista]
via GitHub
2026/06/18
Re: [I] Improve stage encoding size by removing unrelated partition location [datafusion-ballista]
via GitHub
2026/06/18
[PR] refactor: move array expression support checks to getSupportLevel [datafusion-comet]
via GitHub
2026/06/18
Re: [I] Call for Presentations: Community Showcase: Regular series for sharing what you're building with DataFusion [datafusion]
via GitHub
2026/06/18
Re: [PR] fix: CAST(MapType AS MapType) falls back even though native ... [datafusion-comet]
via GitHub
2026/06/18
Re: [PR] fix: CAST(MapType AS MapType) falls back even though native ... [datafusion-comet]
via GitHub
2026/06/18
Re: [I] [Feature] Support Spark expression: create_map [datafusion-comet]
via GitHub
2026/06/18
Re: [I] EPIC: Support `Literal` with nested types [datafusion-comet]
via GitHub
2026/06/18
[PR] refactor: move string expression support checks to getSupportLevel [datafusion-comet]
via GitHub
2026/06/18
Re: [PR] Optimize Dictionary groupings [datafusion]
via GitHub
2026/06/18
Re: [PR] refactor(shim): add new shim infra with `enableIfVer` macro to avoid shims duplication [datafusion-comet]
via GitHub
2026/06/18
Re: [PR] [#21878] extensive test for multi-dictionary column group bys [datafusion]
via GitHub
2026/06/18
[PR] fix: propagate nested cast errors [datafusion-comet]
via GitHub
2026/06/18
Re: [I] Call for Presentations: Community Showcase: Regular series for sharing what you're building with DataFusion [datafusion]
via GitHub
2026/06/18
Re: [I] Physical plan does not support logical expression Exists [datafusion]
via GitHub
2026/06/18
[I] Physical plan does not support logical expression Exists [datafusion]
via GitHub
2026/06/18
[PR] refactor: move arithmetic and math support checks to getSupportLevel [datafusion-comet]
via GitHub
2026/06/18
[I] Move static support decisions from serde convert into getSupportLevel [datafusion-comet]
via GitHub
2026/06/18
Re: [PR] minor: remove redundant scheduler info loggers [datafusion-ballista]
via GitHub
2026/06/18
Re: [PR] refactor: Simplify `approx_distinct` (-200 LoC) [datafusion]
via GitHub
2026/06/18
Re: [PR] minor: remove redundant scheduler info loggers [datafusion-ballista]
via GitHub
2026/06/18
Re: [PR] feat: use aligned slice access during bulk append in SparkUnsafeArray [datafusion-comet]
via GitHub
2026/06/18
Re: [PR] feat: use aligned slice access during bulk append in SparkUnsafeArray [datafusion-comet]
via GitHub
2026/06/18
Re: [PR] feat: Implement TimeType support - Infrastructure - shuffle [datafusion-comet]
via GitHub
2026/06/18
Re: [PR] fix: materialize ConstantColumnVector on Comet's serialize/export paths [datafusion-comet]
via GitHub
2026/06/18
Re: [I] Centralize `approx_distinct` grouped HLL dispatch [datafusion]
via GitHub
2026/06/18
Re: [I] Centralize `approx_distinct` grouped HLL dispatch [datafusion]
via GitHub
2026/06/18
Re: [PR] feat: informational message channel + generic native-available hint [datafusion-comet]
via GitHub
2026/06/18
Re: [PR] Intermediate result blocked approach to aggregation memory management [datafusion]
via GitHub
2026/06/18
Re: [PR] feat: decouple partition location from executor metadata [datafusion-ballista]
via GitHub
2026/06/18
Re: [PR] fix: decline CreateArray with struct-nullability-divergent children [datafusion-comet]
via GitHub
2026/06/18
Re: [PR] fix: decline CreateArray with struct-nullability-divergent children [datafusion-comet]
via GitHub
2026/06/18
[PR] minor: remove redundant scheduler info loggers [datafusion-ballista]
via GitHub
2026/06/18
Re: [I] Native scan file-read failures should surface as Spark's FAILED_READ_FILE.NO_HINT [datafusion-comet]
via GitHub
2026/06/18
Re: [PR] feat: surface native parquet read failures as FAILED_READ_FILE [datafusion-comet]
via GitHub
2026/06/18
Re: [PR] perf: optimize object store requests when reading CSV [datafusion]
via GitHub
2026/06/18
Re: [I] Cleanup: Name build-row and matchable-map presence checks in hash join [datafusion]
via GitHub
2026/06/18
Re: [PR] implement map_agg [datafusion]
via GitHub
2026/06/18
Re: [PR] implement map_agg [datafusion]
via GitHub
2026/06/18
Re: [PR] implement map_agg [datafusion]
via GitHub
2026/06/18
Re: [PR] implement map_agg [datafusion]
via GitHub
2026/06/18
Re: [PR] implement map_agg [datafusion]
via GitHub
2026/06/18
Re: [D] DISCUSSION: DataFusion Meetup in Asia and China 2026 [datafusion]
via GitHub
2026/06/18
Re: [D] DISCUSSION: DataFusion Meetup in Asia and China 2026 [datafusion]
via GitHub
2026/06/18
Re: [PR] Unify LRU memory-limiting caches into one generic cache [datafusion]
via GitHub
2026/06/18
Re: [PR] feat: decouple partition location from executor metadata [datafusion-ballista]
via GitHub
2026/06/18
Re: [PR] feat: decouple partition location from executor metadata [datafusion-ballista]
via GitHub
2026/06/18
Re: [PR] refactor: Unify LRU memory-limiting caches into one generic cache [datafusion]
via GitHub
2026/06/18
Re: [PR] refactor: Unify LRU memory-limiting caches into one generic cache [datafusion]
via GitHub
Later messages