github
Thread
Date
Earlier messages
Later messages
Messages by Date
2026/06/26
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/26
[PR] perf: avoid excessive timer calls [datafusion-comet]
via GitHub
2026/06/26
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/26
[I] perf: avoid excessive timer calls in `ScanExec::get_next_batch` [datafusion-comet]
via GitHub
2026/06/26
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/26
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/26
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/26
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/26
Re: [PR] perf: coalesce single-column sort runs to cut merge fan-in [datafusion]
via GitHub
2026/06/26
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/26
Re: [PR] fix: optimize_projections failure with struct-field join keys [datafusion]
via GitHub
2026/06/26
Re: [PR] perf: coalesce single-column sort runs to cut merge fan-in [datafusion]
via GitHub
2026/06/26
Re: [PR] perf: coalesce single-column sort runs to cut merge fan-in [datafusion]
via GitHub
2026/06/26
Re: [PR] fix: optimize_projections failure with struct-field join keys [datafusion]
via GitHub
2026/06/26
Re: [PR] perf: coalesce single-column sort runs to cut merge fan-in [datafusion]
via GitHub
2026/06/26
Re: [PR] feat: add datafusion-json crate with json_get_str scaffolding [datafusion]
via GitHub
2026/06/26
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/26
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/26
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/26
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/26
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/26
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/26
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/26
Re: [PR] perf: coalesce single-column sort runs to cut merge fan-in [datafusion]
via GitHub
2026/06/26
Re: [PR] perf: coalesce single-column sort runs to cut merge fan-in [datafusion]
via GitHub
2026/06/26
Re: [PR] perf: coalesce single-column sort runs to cut merge fan-in [datafusion]
via GitHub
2026/06/26
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/26
Re: [PR] perf: coalesce single-column sort runs to cut merge fan-in [datafusion]
via GitHub
2026/06/26
Re: [PR] perf: coalesce single-column sort runs to cut merge fan-in [datafusion]
via GitHub
2026/06/26
Re: [PR] perf: coalesce single-column sort runs to cut merge fan-in [datafusion]
via GitHub
2026/06/26
Re: [PR] feat: add datafusion-json crate with json_get_str scaffolding [datafusion]
via GitHub
2026/06/26
Re: [PR] perf: coalesce single-column sort runs to cut merge fan-in [datafusion]
via GitHub
2026/06/26
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/26
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/26
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/26
Re: [PR] perf: coalesce single-column sort runs to cut merge fan-in [datafusion]
via GitHub
2026/06/26
Re: [PR] perf: coalesce single-column sort runs to cut merge fan-in [datafusion]
via GitHub
2026/06/26
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/26
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/26
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/26
[PR] perf: coalesce single-column sort runs to cut merge fan-in [datafusion]
via GitHub
2026/06/26
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/26
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/26
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/26
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/26
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/26
Re: [PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/26
Re: [PR] IN LIST: add UInt16 bitmap filter [datafusion]
via GitHub
2026/06/26
Re: [PR] feat: add `poll_now_notify` to `poll_loop` and `on_work_available` callback [datafusion-ballista]
via GitHub
2026/06/26
Re: [PR] Add range partitioning sqllogictest fixture [datafusion]
via GitHub
2026/06/26
[PR] chore(deps): bump object_store from 0.13.2 to 0.14.0 in /native [datafusion-comet]
via GitHub
2026/06/26
Re: [PR] feat: Support Decimal type in `approx_distinct` [datafusion]
via GitHub
2026/06/26
[PR] chore(deps): bump the all-other-cargo-deps group in /native with 3 updates [datafusion-comet]
via GitHub
2026/06/26
[PR] chore(deps): bump actions/cache from 5 to 6 [datafusion-comet]
via GitHub
2026/06/26
Re: [I] Add AQE to DataFusion [datafusion]
via GitHub
2026/06/26
Re: [PR] chore(ci): Deploy WebTUI to nightlies.a.o [datafusion-ballista]
via GitHub
2026/06/26
Re: [I] http://scheduler_host:port/ should redirect to https://nightlies.apache.org/ballista/<ballista version> [datafusion-ballista]
via GitHub
2026/06/26
Re: [I] Comet produces bloated results in comparison with Spark [datafusion-comet]
via GitHub
2026/06/26
Re: [I] Comet produces bloated results in comparison with Spark [datafusion-comet]
via GitHub
2026/06/26
Re: [PR] Add range partitioning sqllogictest fixture [datafusion]
via GitHub
2026/06/26
[PR] refactor: make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/26
Re: [PR] feat: Support Decimal type in `approx_distinct` [datafusion]
via GitHub
2026/06/25
Re: [PR] Fix/python cargo lock drift [datafusion-ballista]
via GitHub
2026/06/25
[PR] Fix/python cargo lock drift [datafusion-ballista]
via GitHub
2026/06/25
Re: [PR] feat: Support Decimal type in `approx_distinct` [datafusion]
via GitHub
2026/06/25
Re: [PR] feat: Support Decimal type in `approx_distinct` [datafusion]
via GitHub
2026/06/25
Re: [PR] Fix final hash aggregate output regression by materializing once [datafusion]
via GitHub
2026/06/25
[PR] feat: release tokio runtime on driver/executor exit [datafusion-comet]
via GitHub
2026/06/25
Re: [PR] chore(deps): bump taiki-e/install-action from 2.82.3 to 2.82.4 [datafusion-ballista]
via GitHub
2026/06/25
Re: [PR] Fix final hash aggregate output regression by materializing once [datafusion]
via GitHub
2026/06/25
Re: [PR] chore(deps): bump env_logger from 0.11.10 to 0.11.11 [datafusion-ballista]
via GitHub
2026/06/25
Re: [PR] Fix final hash aggregate output regression by materializing once [datafusion]
via GitHub
2026/06/25
Re: [PR] [#21878] extensive test for multi-dictionary column group bys [datafusion]
via GitHub
2026/06/25
Re: [PR] [9195] optimize group value bytes [datafusion]
via GitHub
2026/06/25
Re: [PR] Fix final hash aggregate output regression by materializing once [datafusion]
via GitHub
2026/06/25
Re: [PR] [codex POC] Add query fusion optimizer [datafusion]
via GitHub
2026/06/25
Re: [PR] Fix final hash aggregate output regression by materializing once [datafusion]
via GitHub
2026/06/25
Re: [I] [EPIC] Improve window function performance for large windows [datafusion]
via GitHub
2026/06/25
Re: [PR] Fix final hash aggregate output regression by materializing once [datafusion]
via GitHub
2026/06/25
Re: [PR] Fix final hash aggregate output regression by materializing once [datafusion]
via GitHub
2026/06/25
Re: [PR] Fix final hash aggregate output regression by materializing once [datafusion]
via GitHub
2026/06/25
Re: [PR] Fix final hash aggregate output regression by materializing once [datafusion]
via GitHub
2026/06/25
Re: [PR] Fix final hash aggregate output regression by materializing once [datafusion]
via GitHub
2026/06/25
Re: [PR] fix: release interning state before terminal output to fix hash aggregate regression (#23178) [datafusion]
via GitHub
2026/06/25
Re: [PR] fix: release interning state before terminal output to fix hash aggregate regression (#23178) [datafusion]
via GitHub
2026/06/25
Re: [PR] Fix final hash aggregate output regression by materializing once [datafusion]
via GitHub
2026/06/25
Re: [PR] Fix final hash aggregate output regression by materializing once [datafusion]
via GitHub
2026/06/25
Re: [PR] Fix final hash aggregate output regression by materializing once [datafusion]
via GitHub
2026/06/25
[PR] chore(deps): bump env_logger from 0.11.10 to 0.11.11 [datafusion-ballista]
via GitHub
2026/06/25
[PR] chore(deps): bump taiki-e/install-action from 2.82.3 to 2.82.4 [datafusion-ballista]
via GitHub
2026/06/25
Re: [PR] add benchmarks for single column group-values traits [datafusion]
via GitHub
2026/06/25
Re: [PR] add benchmarks for single column group-values traits [datafusion]
via GitHub
2026/06/25
Re: [PR] Fix final hash aggregate output regression by materializing once [datafusion]
via GitHub
2026/06/25
Re: [PR] fix: Fix peak memory display in `EXPLAIN ANALYZE` for multiple operators [datafusion]
via GitHub
2026/06/25
Re: [I] Make file-statistics cache keys schema-aware [datafusion]
via GitHub
2026/06/25
[PR] doc: More comments on GroupedHashAggregateStream refactor [datafusion]
via GitHub
2026/06/25
Re: [PR] fix: Fix peak memory display in `EXPLAIN ANALYZE` for multiple operators [datafusion]
via GitHub
2026/06/25
Re: [PR] [9195] optimize group value bytes [datafusion]
via GitHub
2026/06/25
Re: [PR] fix: disable migration aggregate by default [datafusion]
via GitHub
2026/06/25
Re: [I] Improve internal worker parallelism support [datafusion]
via GitHub
2026/06/25
Re: [PR] Validate coerce int96 config 17498 [datafusion]
via GitHub
2026/06/25
Re: [PR] feat: multiple columns in count distinct [datafusion]
via GitHub
2026/06/25
Re: [PR] bench: add key-only payload sort benchmarks [datafusion]
via GitHub
2026/06/25
Re: [PR] Adds dynamic filter support for NestedLoopJoinExec [datafusion]
via GitHub
2026/06/25
Re: [PR] feat: substrait support of `Decimal32` & `Decimal64` [datafusion]
via GitHub
2026/06/25
Re: [PR] Fix final hash aggregate output regression by materializing once [datafusion]
via GitHub
2026/06/25
Re: [PR] feat: add Spark-compatible arrays_zip function [datafusion]
via GitHub
2026/06/25
Re: [PR] fix: NOT IN with NULL subquery returns wrong results under SortMergeJoin [datafusion]
via GitHub
2026/06/25
Re: [PR] fix: NOT IN with NULL subquery returns wrong results under SortMergeJoin [datafusion]
via GitHub
2026/06/25
Re: [PR] fix: keep null-aware anti-join NULLs in the pushed dynamic filter [datafusion]
via GitHub
2026/06/25
Re: [PR] fix(enforce_sorting): remap sort requirement through ProjectionExec on pushdown [datafusion]
via GitHub
2026/06/25
Re: [PR] fix: graceful error for deeply nested expressions instead of stack overflow [datafusion]
via GitHub
2026/06/25
Re: [PR] fix: graceful error for deeply nested expressions instead of stack overflow [datafusion]
via GitHub
2026/06/25
Re: [PR] chore: use `Vec` instead of `OffsetBuilder` [datafusion]
via GitHub
2026/06/25
Re: [I] Support filter pushdown through `SortMergeJoinExec` [datafusion]
via GitHub
2026/06/25
Re: [PR] feat: Create dynamic filters in SortMergeJoin [datafusion]
via GitHub
2026/06/25
Re: [PR] feat: Create dynamic filters in SortMergeJoin [datafusion]
via GitHub
2026/06/25
Re: [PR] feat: add Spark-compatible arrays_zip function [datafusion]
via GitHub
2026/06/25
[PR] fix(enforce_sorting): remap sort requirement through ProjectionExec on pushdown [datafusion]
via GitHub
2026/06/25
Re: [PR] [9195] optimize group value bytes [datafusion]
via GitHub
2026/06/25
Re: [PR] chore: use `Vec` instead of `OffsetBuilder` [datafusion]
via GitHub
2026/06/25
Re: [PR] chore: use `Vec` instead of `OffsetBuilder` [datafusion]
via GitHub
2026/06/25
[PR] fix: guard `test_stack_overflow` against a deep-recursion stack overflow [datafusion]
via GitHub
2026/06/25
Re: [PR] chore: use `Vec` instead of `OffsetBuilder` [datafusion]
via GitHub
2026/06/25
Re: [PR] chore: use `Vec` instead of `OffsetBuilder` [datafusion]
via GitHub
2026/06/25
Re: [PR] chore: use `Vec` instead of `OffsetBuilder` [datafusion]
via GitHub
2026/06/25
Re: [PR] chore: use `Vec` instead of `OffsetBuilder` [datafusion]
via GitHub
2026/06/25
Re: [PR] fix: disable migration aggregate by default [datafusion]
via GitHub
2026/06/25
Re: [PR] chore: use `Vec` instead of `OffsetBuilder` [datafusion]
via GitHub
2026/06/25
Re: [PR] fix: materialize ConstantColumnVector on Comet's serialize/export paths [datafusion-comet]
via GitHub
2026/06/25
Re: [PR] fix: `array_compact` handle edge case with NULLs [datafusion]
via GitHub
2026/06/25
Re: [I] Use Vectorized Partition Kernel for Window Frame Calculation [datafusion]
via GitHub
2026/06/25
Re: [PR] feat: introduce pluggable SpillFile trait and TempFileFactory for custom spill backends [datafusion]
via GitHub
2026/06/25
Re: [I] Use Vectorized Partition Kernel for Window Frame Calculation [datafusion]
via GitHub
2026/06/25
Re: [I] Remove usage of Accumulator from window functions [datafusion]
via GitHub
2026/06/25
Re: [I] Remove usage of Accumulator from window functions [datafusion]
via GitHub
2026/06/25
Re: [PR] feat: introduce pluggable SpillFile trait and TempFileFactory for custom spill backends [datafusion]
via GitHub
2026/06/25
Re: [PR] chore: surface DataFusion 54 PruningMetrics and Ratio in CometNativeScan metrics [datafusion-comet]
via GitHub
2026/06/25
Re: [PR] feat: Add SQL planner, physical planner, and TableProvider hook for MERGE INTO [datafusion]
via GitHub
2026/06/25
Re: [I] Add AQE to DataFusion [datafusion]
via GitHub
2026/06/25
Re: [PR] Parallel bounded RANGE-frame window functions without PARTITION BY (draft) [datafusion]
via GitHub
2026/06/25
Re: [PR] chore: surface DataFusion 54 PruningMetrics and Ratio in CometNativeScan metrics [datafusion-comet]
via GitHub
2026/06/25
[I] [EPIC] Improve the per-window performance for large windows [datafusion]
via GitHub
2026/06/25
[PR] chore: surface DataFusion 54 PruningMetrics and Ratio in CometNativeScan metrics [datafusion-comet]
via GitHub
2026/06/25
Re: [PR] IN LIST: add UInt16 bitmap filter [datafusion]
via GitHub
2026/06/25
Re: [PR] fix: fall back for decimal SUM/AVG over sliding window frames (window audit) [datafusion-comet]
via GitHub
2026/06/25
Re: [I] [Bug] SUM(decimal) over a sliding window frame returns wrapped out-of-range value on overflow instead of NULL [datafusion-comet]
via GitHub
2026/06/25
Re: [I] Optimize how table partitions are pruned [datafusion]
via GitHub
2026/06/25
Re: [I] Optimize how table partitions are pruned [datafusion]
via GitHub
2026/06/25
Re: [PR] chore: use `Vec` instead of `OffsetBuilder` [datafusion]
via GitHub
2026/06/25
Re: [I] [EPIC] Merge EnforceDistribution + EnforceSorting into a single EnsureRequirements rule for correctness and idempotency [datafusion]
via GitHub
2026/06/25
Re: [I] Window aggregates output order broken due to hash repartitioning [datafusion]
via GitHub
2026/06/25
Re: [I] Window aggregates output order broken due to hash repartitioning [datafusion]
via GitHub
2026/06/25
[PR] [branch-54] backport #23192 `array_compact` handle edge case with NULLs [datafusion]
via GitHub
2026/06/25
[PR] chore: use `Vec` instead of `OffsetBuilder` [datafusion]
via GitHub
2026/06/25
Re: [PR] perf: unwrap identity casts in schema adapter to enable Parquet stats pruning [datafusion-comet]
via GitHub
2026/06/25
Re: [PR] perf: unwrap identity casts in schema adapter to enable Parquet stats pruning [datafusion-comet]
via GitHub
2026/06/25
Re: [I] [EPIC] Fix performance regressions when enabling parquet filter pushdown (late materialization) [datafusion]
via GitHub
2026/06/25
Re: [PR] feat: add datafusion-json crate with json_get_str scaffolding [datafusion]
via GitHub
2026/06/25
[I] Add AQE to DataFusion [datafusion]
via GitHub
2026/06/25
Re: [PR] [PoC/Proposal] AQE-lite: change plan properties at runtime based on stats from pipeline breakers [datafusion]
via GitHub
2026/06/25
Re: [I] [EPIC] Fix performance regressions when enabling parquet filter pushdown (late materialization) [datafusion]
via GitHub
2026/06/25
Re: [PR] docs: comet docs design overhaul- phase 1 [datafusion-comet]
via GitHub
2026/06/25
Re: [PR] [PoC/Proposal] AQE-lite: change plan properties at runtime based on stats from pipeline breakers [datafusion]
via GitHub
2026/06/25
Re: [PR] variant: Integrate datafusion-variant into Datafusion [datafusion]
via GitHub
2026/06/25
Re: [PR] variant: Integrate datafusion-variant into Datafusion [datafusion]
via GitHub
2026/06/25
[PR] fix: fall back for decimal SUM/AVG over sliding window frames (window audit) [datafusion-comet]
via GitHub
2026/06/25
[I] [Bug] AVG(decimal) over a window always falls back to Spark on Spark 4.x (AvgDecimal window branch is dead) [datafusion-comet]
via GitHub
2026/06/25
[PR] unwrap identity casts in schema adapter for filters [datafusion-comet]
via GitHub
2026/06/25
Re: [PR] fix: `array_compact` handle edge case with NULLs [datafusion]
via GitHub
2026/06/25
Re: [I] `HashJoinExec` / `NestedLoopJoinExec` projection `Some(vec![])` becomes `None` after ser/de [datafusion]
via GitHub
2026/06/25
Re: [PR] fix: preserve empty projection when ser/de `HashJoinExec` and `NestedLoopJoinExec` [datafusion]
via GitHub
2026/06/25
Re: [PR] Migrate case conversion and substr_index to fallible string view builder APIs [datafusion]
via GitHub
2026/06/25
[PR] Bump version to 54.0.0 [datafusion-python]
via GitHub
2026/06/25
Re: [PR] fix: `array_compact` handle edge case with NULLs [datafusion]
via GitHub
2026/06/25
[I] [Bug] SUM(decimal) over a sliding window frame returns wrapped out-of-range value on overflow instead of NULL [datafusion-comet]
via GitHub
2026/06/25
Re: [PR] chore: remove 3.13 freethreaded builds [datafusion-python]
via GitHub
2026/06/25
Re: [PR] fix: `array_compact` handle edge case with NULLs [datafusion]
via GitHub
2026/06/25
Re: [PR] fix: `array_compact` handle edge case with NULLs [datafusion]
via GitHub
2026/06/25
Re: [PR] IN LIST: add UInt16 bitmap filter [datafusion]
via GitHub
2026/06/25
Re: [I] Enable `spark.comet.sparkToColumnar.enabled` when running Spark SQL tests [datafusion-comet]
via GitHub
2026/06/25
Re: [I] Enable `spark.comet.sparkToColumnar.enabled` when running Spark SQL tests [datafusion-comet]
via GitHub
2026/06/25
Re: [I] chore: Publish specific documentation for each supported Spark version [datafusion-comet]
via GitHub
2026/06/25
Re: [I] chore: Publish specific documentation for each supported Spark version [datafusion-comet]
via GitHub
2026/06/25
Re: [PR] IN LIST: add UInt16 bitmap filter [datafusion]
via GitHub
2026/06/25
Re: [PR] feat: Add native collect_list aggregate support [datafusion-comet]
via GitHub
2026/06/25
Re: [I] [EPIC] Fix performance regressions when enabling parquet filter pushdown (late materialization) [datafusion]
via GitHub
2026/06/25
Re: [PR] feat: introduce pluggable SpillFile trait and TempFileFactory for custom spill backends [datafusion]
via GitHub
2026/06/25
Re: [PR] chore: remove 3.13 freethreaded builds [datafusion-python]
via GitHub
2026/06/25
[PR] feat: route Unsupported through codegen dispatch for opt-in serdes [datafusion-comet]
via GitHub
2026/06/25
Re: [PR] fix: `array_compact` handle edge case with NULLs [datafusion]
via GitHub
2026/06/25
Re: [PR] feat: Stage based fallback [datafusion-comet]
via GitHub
2026/06/25
[PR] chore: remove 3.13 freethreaded builds [datafusion-python]
via GitHub
2026/06/25
Re: [PR] chore: resolve audits after DF54.0.0 update to skill [datafusion-python]
via GitHub
2026/06/25
Re: [I] Improve ArrowWriter performance for fixed-length vectors [datafusion-comet]
via GitHub
2026/06/25
[PR] [9195] optimize group value bytes [datafusion]
via GitHub
2026/06/25
Re: [PR] fix: materialize ConstantColumnVector on Comet's serialize/export paths [datafusion-comet]
via GitHub
2026/06/25
Re: [PR] fix: materialize ConstantColumnVector on Comet's serialize/export paths [datafusion-comet]
via GitHub
2026/06/25
Re: [I] Release DataFusion `54.1.0` (minor/patch) Release [datafusion]
via GitHub
2026/06/25
Re: [PR] Support co-partitioned range hash joins [datafusion]
via GitHub
Earlier messages
Later messages