Messages by Date
-
2025/04/04
Re: [PR] (WIP) Upgrading to arrow 55 [datafusion]
via GitHub
-
2025/04/04
Re: [I] Add documentation for benchmarking Comet in AWS with S3 data source [datafusion-comet]
via GitHub
-
2025/04/04
Re: [PR] (WIP) Upgrading to arrow 55 [datafusion]
via GitHub
-
2025/04/04
Re: [PR] parquet reader: move pruning predicate creation from ParquetSource to ParquetOpener [datafusion]
via GitHub
-
2025/04/04
Re: [PR] (WIP) Upgrading to arrow 55 [datafusion]
via GitHub
-
2025/04/04
Re: [PR] (WIP) Upgrading to arrow 55 [datafusion]
via GitHub
-
2025/04/04
Re: [PR] (WIP) Upgrading to arrow 55 [datafusion]
via GitHub
-
2025/04/04
Re: [PR] minor: Organize fields inside SortMergeJoinStream [datafusion]
via GitHub
-
2025/04/04
Re: [PR] (WIP) Upgrading to arrow 55 [datafusion]
via GitHub
-
2025/04/04
Re: [PR] (WIP) Upgrading to arrow 55 [datafusion]
via GitHub
-
2025/04/04
Re: [I] Organize fields inside `SortMergeJoinStream` [datafusion]
via GitHub
-
2025/04/04
Re: [PR] (WIP) Upgrading to arrow 55 [datafusion]
via GitHub
-
2025/04/04
Re: [PR] (WIP) Upgrading to arrow 55 [datafusion]
via GitHub
-
2025/04/04
Re: [PR] (WIP) Upgrading to arrow 55 [datafusion]
via GitHub
-
2025/04/04
Re: [PR] (WIP) Upgrading to arrow 55 [datafusion]
via GitHub
-
2025/04/04
Re: [PR] (WIP) Upgrading to arrow 55 [datafusion]
via GitHub
-
2025/04/04
Re: [PR] Blog post about user defined window functions [datafusion-site]
via GitHub
-
2025/04/04
Re: [I] Extend TopK early termination to partially sorted inputs [datafusion]
via GitHub
-
2025/04/04
Re: [PR] (WIP) Upgrading to arrow 55 [datafusion]
via GitHub
-
2025/04/04
Re: [PR] (WIP) Upgrading to arrow 55 [datafusion]
via GitHub
-
2025/04/04
Re: [PR] (WIP) Upgrading to arrow 55 [datafusion]
via GitHub
-
2025/04/04
Re: [PR] (WIP) Upgrading to arrow 55 [datafusion]
via GitHub
-
2025/04/04
Re: [PR] Fix clippy lint on rust 1.86 [datafusion-sqlparser-rs]
via GitHub
-
2025/04/04
Re: [PR] Test: configuration fuzzer for (external) sort queries [datafusion]
via GitHub
-
2025/04/04
Re: [I] Blog post about user defined window functions [datafusion]
via GitHub
-
2025/04/04
Re: [PR] STRING_AGG missing functionality [datafusion]
via GitHub
-
2025/04/04
Re: [PR] Test: configuration fuzzer for (external) sort queries [datafusion]
via GitHub
-
2025/04/04
Re: [I] Add SQL examples to window functions: `nth_value`, etc [datafusion]
via GitHub
-
2025/04/04
Re: [I] A complete solution for stable and safe sort with spill [datafusion]
via GitHub
-
2025/04/04
Re: [I] Apache DataFusion Google Summer of Code (GSoC) 2025 Application Guidelines [datafusion]
via GitHub
-
2025/04/04
Re: [PR] perf: Introduce sort prefix computation for early TopK exit optimization on partially sorted input [datafusion]
via GitHub
-
2025/04/04
[PR] Support additional DuckDB integer types such as HUGEINT, UHUGEINT, etc [datafusion-sqlparser-rs]
via GitHub
-
2025/04/04
[PR] Blog post about user defined window functions [datafusion-site]
via GitHub
-
2025/04/04
Re: [PR] perf: Add TopK benchmarks as variation over the `sort_tpch` benchmarks [datafusion]
via GitHub
-
2025/04/04
Re: [PR] chore: update clickbench [datafusion]
via GitHub
-
2025/04/04
Re: [PR] Docs : Added Sql examples for window Functions : `nth_val` , etc [datafusion]
via GitHub
-
2025/04/04
[PR] Minor: rm session downcast [datafusion]
via GitHub
-
2025/04/03
Re: [PR] feat: add test to check for `ctx.read_json()` [datafusion-ballista]
via GitHub
-
2025/04/03
Re: [PR] feat: Improve fetch partition performance, support skip validation arrow ipc files [datafusion-ballista]
via GitHub
-
2025/04/03
Re: [PR] parquet reader: move pruning predicate creation from ParquetSource to ParquetOpener [datafusion]
via GitHub
-
2025/04/03
Re: [PR] parquet reader: move pruning predicate creation from ParquetSource to ParquetOpener [datafusion]
via GitHub
-
2025/04/03
Re: [I] Blog post about user defined window functions [datafusion]
via GitHub
-
2025/04/03
Re: [I] Optimize repartitioning logic in ShuffleWriterExec using interleave_record_batch [datafusion-comet]
via GitHub
-
2025/04/03
Re: [PR] Add coerce int96 option for Parquet to support different TimeUnits, test int96_from_spark.parquet from parquet-testing [datafusion]
via GitHub
-
2025/04/03
Re: [I] [comet-parquet-exec] Track remaining test failures in POC 1 & 2 [datafusion-comet]
via GitHub
-
2025/04/03
Re: [PR] Migrate datafusion/sql tests to insta, part5 [datafusion]
via GitHub
-
2025/04/03
Re: [PR] Docs : Added Sql examples for window Functions : `nth_val` , etc [datafusion]
via GitHub
-
2025/04/03
Re: [I] Audit-check fails in main branch [datafusion]
via GitHub
-
2025/04/03
Re: [I] Internal error: PhysicalExpr Column references bound error, Failure in spilling for `AggregateMode::Single` [datafusion]
via GitHub
-
2025/04/03
Re: [I] Remove record_batch! macro once upstream updates [datafusion]
via GitHub
-
2025/04/03
Re: [I] Blog post about user defined window functions [datafusion]
via GitHub
-
2025/04/03
Re: [I] Audit-check fails in main branch [datafusion]
via GitHub
-
2025/04/03
Re: [I] Run all benchmarks on merge to main branch [datafusion]
via GitHub
-
2025/04/03
Re: [I] limit max disk usage for spilling queries [datafusion]
via GitHub
-
2025/04/03
Re: [I] A complete solution for stable and safe sort with spill [datafusion]
via GitHub
-
2025/04/03
Re: [I] A complete solution for stable and safe sort with spill [datafusion]
via GitHub
-
2025/04/03
Re: [I] Erroneous warning on unset options during FFI table operation [datafusion]
via GitHub
-
2025/04/03
[PR] chore: update clickbench [datafusion]
via GitHub
-
2025/04/03
Re: [I] `count` fails for FFI Table Providers [datafusion]
via GitHub
-
2025/04/03
Re: [I] `cargo audit` is failing on main [datafusion]
via GitHub
-
2025/04/03
Re: [PR] fix: Queries similar to `count-bug` produce incorrect results [datafusion]
via GitHub
-
2025/04/03
Re: [PR] ExecutionPlan: add APIs for filter pushdown & optimizer rule to apply them [datafusion]
via GitHub
-
2025/04/03
Re: [I] [comet-parquet-exec] Track remaining test failures in POC 1 & 2 [datafusion-comet]
via GitHub
-
2025/04/03
Re: [PR] fix: Queries similar to `count-bug` produce incorrect results [datafusion]
via GitHub
-
2025/04/03
Re: [I] Similar to the "count-bug" case that produces incorrect results [datafusion]
via GitHub
-
2025/04/03
Re: [PR] Minor: add Arc for statistics in FileGroup [datafusion]
via GitHub
-
2025/04/03
Re: [PR] Enable repartitioning on MemTable. [datafusion]
via GitHub
-
2025/04/03
Re: [I] Table function supports non-literal args [datafusion]
via GitHub
-
2025/04/03
Re: [I] Trivial WHERE filter not eliminated when combined with CTE [datafusion]
via GitHub
-
2025/04/03
Re: [PR] Add short circuit evaluation for `AND` and `OR` [datafusion]
via GitHub
-
2025/04/03
Re: [I] `cargo audit` is failing on main [datafusion]
via GitHub
-
2025/04/03
Re: [I] `cargo audit` is failing on main [datafusion]
via GitHub
-
2025/04/03
Re: [PR] parquet reader: move pruning predicate creation from ParquetSource to ParquetOpener [datafusion]
via GitHub
-
2025/04/03
Re: [I] Include Apple macOS support in jars in Maven central [datafusion-comet]
via GitHub
-
2025/04/03
Re: [PR] Migrate datafusion/sql tests to insta, part5 [datafusion]
via GitHub
-
2025/04/03
Re: [I] 【TPCH】Comet do not show performance advantages over native Spark? [datafusion-comet]
via GitHub
-
2025/04/03
Re: [I] TPCH DataGen Not working [datafusion-comet]
via GitHub
-
2025/04/03
Re: [I] Add more developer documentation [datafusion-comet]
via GitHub
-
2025/04/03
Re: [I] 【TPCH】Comet do not show performance advantages over native Spark? [datafusion-comet]
via GitHub
-
2025/04/03
[PR] Fix clippy lint on rust 1.86 [datafusion-sqlparser-rs]
via GitHub
-
2025/04/03
Re: [PR] Docs : Added Sql examples for window Functions : `nth_val` , etc [datafusion]
via GitHub
-
2025/04/03
Re: [PR] parquet reader: move pruning predicate creation from ParquetSource to ParquetOpener [datafusion]
via GitHub
-
2025/04/03
Re: [PR] parquet reader: move pruning predicate creation from ParquetSource to ParquetOpener [datafusion]
via GitHub
-
2025/04/03
Re: [PR] Fix Possible Congestion Scenario in `SortPreservingMergeExec` [datafusion]
via GitHub
-
2025/04/03
Re: [PR] Migrate datafusion/sql tests to insta, part5 [datafusion]
via GitHub
-
2025/04/03
Re: [PR] Migrate datafusion/sql tests to insta, part5 [datafusion]
via GitHub
-
2025/04/03
Re: [PR] Migrate datafusion/sql tests to insta, part5 [datafusion]
via GitHub
-
2025/04/03
Re: [I] [EPIC] A collection of tickets for improving sorting larger than memory datasets / spilling sorts [datafusion]
via GitHub
-
2025/04/03
Re: [I] [comet-parquet-exec] Track remaining test failures in POC 1 & 2 [datafusion-comet]
via GitHub
-
2025/04/03
Re: [I] [comet-parquet-exec] Track remaining test failures in POC 1 & 2 [datafusion-comet]
via GitHub
-
2025/04/03
Re: [PR] Migrate datafusion/sql tests to insta, part5 [datafusion]
via GitHub
-
2025/04/03
Re: [PR] Migrate datafusion/sql tests to insta, part5 [datafusion]
via GitHub
-
2025/04/03
Re: [PR] perf: replace `merge` `uninitiated_partitions` `VecDeque<usize>` with custom fixed size queue [datafusion]
via GitHub
-
2025/04/03
Re: [PR] Migrate datafusion/sql tests to insta, part5 [datafusion]
via GitHub
-
2025/04/03
Re: [PR] perf: replace `merge` `uninitiated_partitions` `VecDeque<usize>` with custom fixed size queue [datafusion]
via GitHub
-
2025/04/03
[I] `cargo audit` is failing on main [datafusion]
via GitHub
-
2025/04/03
Re: [I] Remove unwraps in `hash_array_small_decimal` [datafusion-comet]
via GitHub
-
2025/04/03
Re: [PR] Migrate datafusion/sql tests to insta, part5 [datafusion]
via GitHub
-
2025/04/03
Re: [PR] Update concepts-readings-events.md [datafusion]
via GitHub
-
2025/04/03
Re: [I] [substrait] Build basic test suite to validate produced Substrait plans [datafusion]
via GitHub
-
2025/04/03
Re: [PR] Add topk information into tree explain plans [datafusion]
via GitHub
-
2025/04/03
Re: [PR] Add topk information into tree explain plans [datafusion]
via GitHub
-
2025/04/03
Re: [I] Add `topk` information into `tree` explain plans [datafusion]
via GitHub
-
2025/04/03
Re: [I] Blog post about user defined window functions [datafusion]
via GitHub
-
2025/04/03
Re: [I] Reduce number of tokio blocking threads in SortExec spill [datafusion]
via GitHub
-
2025/04/03
Re: [PR] tpcbench.py add --query support to run custom query [datafusion-ray]
via GitHub
-
2025/04/03
Re: [PR] Minor: add Arc for statistics in FileGroup [datafusion]
via GitHub
-
2025/04/03
Re: [PR] Fix Possible Congestion Scenario in `SortPreservingMergeExec` [datafusion]
via GitHub
-
2025/04/03
Re: [PR] Migrate datafusion/sql tests to insta, part5 [datafusion]
via GitHub
-
2025/04/03
Re: [PR] perf: replace `merge` `uninitiated_partitions` `VecDeque<usize>` with custom fixed size queue [datafusion]
via GitHub
-
2025/04/03
[I] Improve time for SortPreservingMerge stream / uninitiated_partitions VecDeque<usize> [datafusion]
via GitHub
-
2025/04/03
Re: [I] Explore integration with Delta Lake [datafusion-comet]
via GitHub
-
2025/04/03
Re: [PR] perf: replace `merge` `uninitiated_partitions` `VecDeque<usize>` with custom fixed size queue [datafusion]
via GitHub
-
2025/04/03
Re: [PR] chore: return `404` for api requests if path does not exist [datafusion-ballista]
via GitHub
-
2025/04/03
[I] Extend benchmarking to "TopK" queries [datafusion]
via GitHub
-
2025/04/03
Re: [PR] Add dynamic pruning filters from TopK state [datafusion]
via GitHub
-
2025/04/03
Re: [PR] Migrate datafusion/sql tests to insta, part5 [datafusion]
via GitHub
-
2025/04/03
Re: [PR] Add topk information into tree explain plans [datafusion]
via GitHub
-
2025/04/03
Re: [PR] fix: update group by columns for merge phase after spill [datafusion]
via GitHub
-
2025/04/03
Re: [I] Internal error: PhysicalExpr Column references bound error, Failure in spilling for `AggregateMode::Single` [datafusion]
via GitHub
-
2025/04/03
Re: [PR] fix: update group by columns for merge phase after spill [datafusion]
via GitHub
-
2025/04/03
Re: [PR] feat: Add config `max_temp_directory_size` to limit max disk usage for spilling queries [datafusion]
via GitHub
-
2025/04/03
Re: [PR] feat: Add config `max_temp_directory_size` to limit max disk usage for spilling queries [datafusion]
via GitHub
-
2025/04/03
Re: [I] address failure caused by method signature change in SPARK-48791 [datafusion-comet]
via GitHub
-
2025/04/03
Re: [I] Dynamic pruning filters from TopK state (optimize `ORDER BY LIMIT` queries) [datafusion]
via GitHub
-
2025/04/03
Re: [I] Dynamic pruning filters from TopK state (optimize `ORDER BY LIMIT` queries) [datafusion]
via GitHub
-
2025/04/03
[PR] Introduce DynamicFilterSource and DynamicPhysicalExpr [datafusion]
via GitHub
-
2025/04/03
[PR] chore: return `404` for api requests if path does not exist [datafusion-ballista]
via GitHub
-
2025/04/03
Re: [I] Reduce number of tokio blocking threads in SortExec spill [datafusion]
via GitHub
-
2025/04/03
[PR] chore: fix clippy issues after update to rust 1.86 [datafusion-ballista]
via GitHub
-
2025/04/03
Re: [PR] parquet reader: move pruning predicate creation from ParquetSource to ParquetOpener [datafusion]
via GitHub
-
2025/04/03
Re: [PR] Fix duplicate unqualified Field name (schema error) on join queries [datafusion]
via GitHub
-
2025/04/03
Re: [PR] Fix duplicate unqualified Field name (schema error) on join queries [datafusion]
via GitHub
-
2025/04/03
Re: [PR] Respect ignore_nulls in array_agg [datafusion]
via GitHub
-
2025/04/03
Re: [I] Native scan panic with native_iceberg_compat on hdfs [datafusion-comet]
via GitHub
-
2025/04/03
Re: [PR] fix: avoid panic caused by close null handle of parquet reader [datafusion-comet]
via GitHub
-
2025/04/03
[PR] ExecutionPlan: add APIs for filter pushdown & optimizer rule to apply them [datafusion]
via GitHub
-
2025/04/03
Re: [PR] Migrate datafusion/sql tests to insta, part4 [datafusion]
via GitHub
-
2025/04/03
Re: [I] Decorrelate scalar subqueries with more complex filter expressions [datafusion]
via GitHub
-
2025/04/03
[I] Erroneous warning on unset options during FFI table operation [datafusion]
via GitHub
-
2025/04/03
Re: [I] Different semantics of casting from int64 to timestamp between Comet and Spark [datafusion-comet]
via GitHub
-
2025/04/03
Re: [I] Different semantics of casting from int64 to timestamp between Comet and Spark [datafusion-comet]
via GitHub
-
2025/04/03
Re: [PR] chore: Remove some unwraps in hashing code [datafusion-comet]
via GitHub
-
2025/04/03
Re: [I] Add more developer documentation [datafusion-comet]
via GitHub
-
2025/04/03
Re: [PR] Docs : Added Sql exmaples for window Functions : nth_val , etc [datafusion]
via GitHub
-
2025/04/03
Re: [PR] Migrate datafusion/sql tests to insta, part4 [datafusion]
via GitHub
-
2025/04/03
Re: [PR] Add `statistics_by_partition API` to ExecutionPlan [datafusion]
via GitHub
-
2025/04/03
Re: [PR] Format `Date32` to string given timestamp specifiers [datafusion]
via GitHub
-
2025/04/03
Re: [PR] Migrate datafusion/sql tests to insta, part4 [datafusion]
via GitHub
-
2025/04/03
Re: [PR] Fix duplicate unqualified Field name (schema error) on join queries [datafusion]
via GitHub
-
2025/04/03
Re: [PR] Add `statistics_by_partition API` to ExecutionPlan [datafusion]
via GitHub
-
2025/04/03
Re: [PR] Add `statistics_by_partition API` to ExecutionPlan [datafusion]
via GitHub
-
2025/04/03
Re: [PR] Add `statistics_by_partition API` to ExecutionPlan [datafusion]
via GitHub
-
2025/04/03
Re: [PR] Add `statistics_by_partition API` to ExecutionPlan [datafusion]
via GitHub
-
2025/04/03
Re: [I] Spark executor fail to start occasionally with SIGILL [datafusion-comet]
via GitHub
-
2025/04/03
Re: [I] Spark executor fail to start occasionally with SIGILL [datafusion-comet]
via GitHub
-
2025/04/03
Re: [PR] Introduce load-balanced `split_groups_by_statistics` method [datafusion]
via GitHub
-
2025/04/03
[I] Making comet native operators write spill files to spark local dir [datafusion-comet]
via GitHub
-
2025/04/03
Re: [PR] fix: update group by columns for merge phase after spill [datafusion]
via GitHub
-
2025/04/03
Re: [PR] Remove redundant statistics from FileScanConfig [datafusion]
via GitHub
-
2025/04/03
[PR] Minor: add Arc for statistics in FileGroup [datafusion]
via GitHub
-
2025/04/03
Re: [PR] Add `statistics_by_partition API` to ExecutionPlan [datafusion]
via GitHub
-
2025/04/03
Re: [PR] Add `statistics_by_partition API` to ExecutionPlan [datafusion]
via GitHub
-
2025/04/03
Re: [PR] chore: Remove some unwraps in hashing code [datafusion-comet]
via GitHub
-
2025/04/03
Re: [I] Implement cast from Long to Timestamp [datafusion-comet]
via GitHub
-
2025/04/03
Re: [PR] fix: Make AQE capable of converting Comet shuffled joins to Comet broadcast hash joins [datafusion-comet]
via GitHub
-
2025/04/03
Re: [PR] parquet reader: move pruning predicate creation from ParquetSource to ParquetOpener [datafusion]
via GitHub
-
2025/04/03
Re: [PR] Migrate datafusion/sql tests to insta, part4 [datafusion]
via GitHub
-
2025/04/03
Re: [I] Implement cast from Long to Timestamp [datafusion-comet]
via GitHub
-
2025/04/03
Re: [I] Different semantics of casting from int64 to timestamp between Comet and Spark [datafusion-comet]
via GitHub
-
2025/04/03
Re: [PR] minor: Fix clippy warnings [datafusion-comet]
via GitHub
-
2025/04/03
[PR] parquet reader: move pruning predicate creation from ParquetSource to ParquetOpener [datafusion]
via GitHub
-
2025/04/03
Re: [PR] Add `statistics_by_partition API` to ExecutionPlan [datafusion]
via GitHub
-
2025/04/03
Re: [PR] Add `statistics_by_partition API` to ExecutionPlan [datafusion]
via GitHub
-
2025/04/03
Re: [I] Organize fields inside `SortMergeJoinStream` [datafusion]
via GitHub
-
2025/04/03
Re: [PR] Migrate datafusion/sql tests to insta, part4 [datafusion]
via GitHub
-
2025/04/03
Re: [I] Spark executor fail to start occasionally with SIGILL [datafusion-comet]
via GitHub
-
2025/04/03
Re: [PR] Add `statistics_by_partition API` to ExecutionPlan [datafusion]
via GitHub
-
2025/04/03
Re: [PR] Migrate datafusion/sql tests to insta, part4 [datafusion]
via GitHub
-
2025/04/03
Re: [PR] Fix: after repartitioning, statistics should be inexact [datafusion]
via GitHub
-
2025/04/03
Re: [PR] perf: Add TopK benchmarks as variation over the `sort_tpch` benchmarks [datafusion]
via GitHub
-
2025/04/03
Re: [PR] perf: Add TopK benchmarks as variation over the `sort_tpch` benchmarks [datafusion]
via GitHub
-
2025/04/03
Re: [PR] Add all missing table options to be handled in any order [datafusion-sqlparser-rs]
via GitHub
-
2025/04/03
[PR] perf: Add TopK benchmarks as variation over the `sort_tpch` benchmarks [datafusion]
via GitHub
-
2025/04/03
Re: [I] with datafusion comet,no performance improvement. [datafusion-comet]
via GitHub
-
2025/04/03
Re: [I] Running Spark Shell with Comet throws Exception [datafusion-comet]
via GitHub
-
2025/04/03
Re: [PR] Add `statistics_by_partition API` to ExecutionPlan [datafusion]
via GitHub
-
2025/04/03
Re: [I] Update supported Spark and Java versions in installation guide [datafusion-comet]
via GitHub
-
2025/04/03
Re: [I] address failure caused by method signature change in SPARK-48791 [datafusion-comet]
via GitHub
-
2025/04/03
Re: [PR] Format `Date32` to string given timestamp specifiers [datafusion]
via GitHub
-
2025/04/03
Re: [I] Add more developer documentation [datafusion-comet]
via GitHub
-
2025/04/03
Re: [I] Explore integration with Delta Lake [datafusion-comet]
via GitHub
-
2025/04/03
Re: [I] Support native parquet read and hdfs read? [datafusion-comet]
via GitHub
-
2025/04/03
Re: [I] Extend benchmarking to "TopK" queries [datafusion]
via GitHub
-
2025/04/03
Re: [I] Support native parquet read and hdfs read? [datafusion-comet]
via GitHub
-
2025/04/03
Re: [PR] fix!: incorrect coercion when comparing with string literals [datafusion]
via GitHub
-
2025/04/03
Re: [PR] Docs : Added Sql exmaples for window Functions : nth_val , etc [datafusion]
via GitHub
-
2025/04/03
Re: [PR] Fix: after repartitioning, statistics should be inexact [datafusion]
via GitHub
-
2025/04/03
Re: [PR] minor: Organize fields inside SortMergeJoinStream [datafusion]
via GitHub
-
2025/04/03
Re: [PR] Add short circuit evaluation for `AND` and `OR` [datafusion]
via GitHub