Messages by Date
-
2025/04/05
Re: [PR] Enhance: simplify x=x [datafusion]
via GitHub
-
2025/04/05
Re: [PR] Enhance: simplify x=x [datafusion]
via GitHub
-
2025/04/05
Re: [I] Enable `split_file_groups_by_statistics` by default [datafusion]
via GitHub
-
2025/04/05
[PR] chore: rm duplicated `JoinOn` type [datafusion]
via GitHub
-
2025/04/05
[PR] Enhance: simplify x=x [datafusion]
via GitHub
-
2025/04/05
Re: [I] Set DataFusion runtime configurations through SQL interface [datafusion]
via GitHub
-
2025/04/05
[PR] fix decimal precision issue in simplify expression optimize rule [datafusion]
via GitHub
-
2025/04/05
Re: [I] Ballista: Partition columns are duplicated in protobuf decoding. [datafusion-ballista]
via GitHub
-
2025/04/04
Re: [PR] MSSQL: Add support for functionality `MERGE` output clause [datafusion-sqlparser-rs]
via GitHub
-
2025/04/04
Re: [I] Trivial WHERE filter not eliminated when combined with CTE [datafusion]
via GitHub
-
2025/04/04
Re: [PR] Add support for MSSQL IF/ELSE statements. [datafusion-sqlparser-rs]
via GitHub
-
2025/04/04
Re: [PR] Add support for 'IN <SetExpression>' [datafusion-sqlparser-rs]
via GitHub
-
2025/04/04
Re: [PR] parquet reader: move pruning predicate creation from ParquetSource to ParquetOpener [datafusion]
via GitHub
-
2025/04/04
Re: [PR] Add all missing table options to be handled in any order [datafusion-sqlparser-rs]
via GitHub
-
2025/04/04
Re: [PR] Use `any` instead of `for_each` [datafusion]
via GitHub
-
2025/04/04
Re: [PR] chore: Attach Diagnostic to "incompatible type in unary expression" error [datafusion]
via GitHub
-
2025/04/04
Re: [PR] Use `any` instead of `for_each` [datafusion]
via GitHub
-
2025/04/04
Re: [PR] Add support for MSSQL IF/ELSE statements. [datafusion-sqlparser-rs]
via GitHub
-
2025/04/04
Re: [PR] Improve collection during repr and repr_html [datafusion-python]
via GitHub
-
2025/04/04
[PR] Allow single quotes in EXTRACT() for Redshift. [datafusion-sqlparser-rs]
via GitHub
-
2025/04/04
Re: [I] Collecting parquet without any transformations throws an exception [datafusion-comet]
via GitHub
-
2025/04/04
Re: [PR] Add disk usage limit configuration to datafusion-cli [datafusion]
via GitHub
-
2025/04/04
Re: [PR] Add disk usage limit configuration to datafusion-cli [datafusion]
via GitHub
-
2025/04/04
Re: [I] Will Comet support closed-source forks of Apache Spark (e.g. CSP versions)? [datafusion-comet]
via GitHub
-
2025/04/04
Re: [PR] Update changelog and version number [datafusion-python]
via GitHub
-
2025/04/04
Re: [I] Spark executor fail to start occasionally with SIGILL [datafusion-comet]
via GitHub
-
2025/04/04
Re: [PR] Blog post on Parquet pruning in datafusion [datafusion-site]
via GitHub
-
2025/04/04
[PR] Chore: Call arrow's methods `row_count` and `skipped_row_count` [datafusion]
via GitHub
-
2025/04/04
[I] Enable `split_file_groups_by_statistics` by default [datafusion]
via GitHub
-
2025/04/04
Re: [I] Enable `split_file_groups_by_statistics` by default [datafusion]
via GitHub
-
2025/04/04
Re: [I] Cache Parquet Metadata [datafusion]
via GitHub
-
2025/04/04
Re: [PR] Docs : Added Sql examples for window Functions : `nth_val` , etc [datafusion]
via GitHub
-
2025/04/04
Re: [PR] parquet reader: move pruning predicate creation from ParquetSource to ParquetOpener [datafusion]
via GitHub
-
2025/04/04
[I] `count` fails for FFI Table Providers [datafusion]
via GitHub
-
2025/04/04
[PR] Add disk usage limit configuration to datafusion-cli [datafusion]
via GitHub
-
2025/04/04
Re: [PR] feat: add MAP type support for first level [datafusion-comet]
via GitHub
-
2025/04/04
Re: [PR] chore: Create simple fuzz test as part of test suite [datafusion-comet]
via GitHub
-
2025/04/04
Re: [I] Running Spark Shell with Comet throws Exception [datafusion-comet]
via GitHub
-
2025/04/04
Re: [PR] Add `statistics_by_partition API` to ExecutionPlan [datafusion]
via GitHub
-
2025/04/04
[PR] fix: corrected the logic of eliminating CometSparkToColumnarExec [datafusion-comet]
via GitHub
-
2025/04/04
Re: [PR] STRING_AGG missing functionality [datafusion]
via GitHub
-
2025/04/04
Re: [I] [DISCUSS] Switch to `tree` explain by default [datafusion]
via GitHub
-
2025/04/04
Re: [PR] docs: change OSX/OS X to macOS [datafusion-comet]
via GitHub
-
2025/04/04
Re: [PR] Migrate physical plan tests to `insta` (Part-1) [datafusion]
via GitHub
-
2025/04/04
Re: [PR] Test: configuration fuzzer for (external) sort queries [datafusion]
via GitHub
-
2025/04/04
Re: [PR] chore: Create simple fuzz test as part of test suite [datafusion-comet]
via GitHub
-
2025/04/04
Re: [PR] (WIP) Upgrading to arrow 55 [datafusion]
via GitHub
-
2025/04/04
Re: [I] Will Comet support closed-source forks of Apache Spark (e.g. CSP versions)? [datafusion-comet]
via GitHub
-
2025/04/04
Re: [PR] minor: Fix clippy warnings [datafusion-comet]
via GitHub
-
2025/04/04
Re: [PR] chore: Fix some inconsistencies in memory pool configuration [datafusion-comet]
via GitHub
-
2025/04/04
Re: [PR] docs: various improvements to tuning guide [datafusion-comet]
via GitHub
-
2025/04/04
Re: [I] Enable `split_file_groups_by_statistics` by default [datafusion]
via GitHub
-
2025/04/04
[I] Filter cache [datafusion]
via GitHub
-
2025/04/04
Re: [PR] feat: implement GroupsAccumulator for `count(DISTINCT)` aggr [datafusion]
via GitHub
-
2025/04/04
Re: [PR] Add support for 'IN <SetExpression>' [datafusion-sqlparser-rs]
via GitHub
-
2025/04/04
Re: [PR] Add support for 'IN <SetExpression>' [datafusion-sqlparser-rs]
via GitHub
-
2025/04/04
Re: [PR] parquet reader: move pruning predicate creation from ParquetSource to ParquetOpener [datafusion]
via GitHub
-
2025/04/04
Re: [PR] Improve performance of `last_value` by implementing special `GroupsAccumulator` [datafusion]
via GitHub
-
2025/04/04
Re: [PR] [BLOG] tpchgen-rs: World’s fastest open source TPCH data generator, written in Rust [datafusion-site]
via GitHub
-
2025/04/04
Re: [PR] chore: update clickbench [datafusion]
via GitHub
-
2025/04/04
Re: [I] Update ClickBench queries to avoid ::INT::DATE casting [datafusion]
via GitHub
-
2025/04/04
Re: [PR] [BLOG] tpchgen-rs: World’s fastest open source TPCH data generator, written in Rust [datafusion-site]
via GitHub
-
2025/04/04
Re: [PR] [BLOG] tpchgen-rs: World’s fastest open source TPCH data generator, written in Rust [datafusion-site]
via GitHub
-
2025/04/04
Re: [PR] [BLOG] tpchgen-rs: World’s fastest open source TPCH data generator, written in Rust [datafusion-site]
via GitHub
-
2025/04/04
Re: [PR] Run test [datafusion]
via GitHub
-
2025/04/04
Re: [PR] Run test [datafusion]
via GitHub
-
2025/04/04
Re: [PR] Run test [datafusion]
via GitHub
-
2025/04/04
Re: [PR] Run test [datafusion]
via GitHub
-
2025/04/04
[PR] Run test [datafusion]
via GitHub
-
2025/04/04
Re: [PR] [BLOG] tpchgen-rs: World’s fastest open source TPCH data generator, written in Rust [datafusion-site]
via GitHub
-
2025/04/04
Re: [PR] fix: corrected the logic of eliminating CometSparkToColumnarExec [datafusion-comet]
via GitHub
-
2025/04/04
Re: [PR] Fix: after repartitioning, the `PartitionedFile` and `FileGroup` statistics should be inexact [datafusion]
via GitHub
-
2025/04/04
Re: [PR] chore: Create simple fuzz test as part of test suite [datafusion-comet]
via GitHub
-
2025/04/04
[PR] chore: remove unused executor configuration option [datafusion-ballista]
via GitHub
-
2025/04/04
Re: [I] Make Clickbench Q29 5x faster for datafusion [datafusion]
via GitHub
-
2025/04/04
Re: [PR] ExecutionPlan: add APIs for filter pushdown & optimizer rule to apply them [datafusion]
via GitHub
-
2025/04/04
Re: [I] Nested correlated subquery error with a depth exceeding 1 [datafusion]
via GitHub
-
2025/04/04
Re: [PR] [BLOG] tpchgen-rs: World’s fastest open source TPCH data generator, written in Rust [datafusion-site]
via GitHub
-
2025/04/04
[PR] feat: remove flight-sql from scheduler [datafusion-ballista]
via GitHub
-
2025/04/04
Re: [PR] perf: replace `merge` `uninitiated_partitions` `VecDeque<usize>` with custom fixed size queue [datafusion]
via GitHub
-
2025/04/04
[I] Cache Parquet Metadata [datafusion]
via GitHub
-
2025/04/04
Re: [PR] fix: add an "expr_planners" method to SessionState [datafusion]
via GitHub
-
2025/04/04
Re: [PR] Blog post about user defined window functions [datafusion-site]
via GitHub
-
2025/04/04
Re: [PR] Remove CoalescePartitions insertion from HashJoinExec [datafusion]
via GitHub
-
2025/04/04
Re: [PR] Minor: add Arc for statistics in FileGroup [datafusion]
via GitHub
-
2025/04/04
Re: [I] A complete solution for stable and safe sort with spill [datafusion]
via GitHub
-
2025/04/04
Re: [PR] perf: replace `merge` `uninitiated_partitions` `VecDeque<usize>` with custom fixed size queue [datafusion]
via GitHub
-
2025/04/04
Re: [PR] (WIP) Upgrading to arrow 55 [datafusion]
via GitHub
-
2025/04/04
Re: [PR] (WIP) Upgrading to arrow 55 [datafusion]
via GitHub
-
2025/04/04
Re: [PR] Draft: Make Clickbench Q29 5x faster for datafusion [datafusion]
via GitHub
-
2025/04/04
[I] Remove `flight-sql` from ballista [datafusion-ballista]
via GitHub
-
2025/04/04
Re: [PR] Improve spill performance: Disable re-validation of spilled files [datafusion]
via GitHub
-
2025/04/04
Re: [I] Make it easier to run TPCH queries with datafusion-cli [datafusion]
via GitHub
-
2025/04/04
Re: [PR] ExecutionPlan: add APIs for filter pushdown & optimizer rule to apply them [datafusion]
via GitHub
-
2025/04/04
Re: [PR] ExecutionPlan: add APIs for filter pushdown & optimizer rule to apply them [datafusion]
via GitHub
-
2025/04/04
Re: [I] Support integration with Parquet modular encryption [datafusion]
via GitHub
-
2025/04/04
Re: [PR] bench: Document how to use cross platform Samply profiler [datafusion]
via GitHub
-
2025/04/04
Re: [I] Enable `split_file_groups_by_statistics` by default [datafusion]
via GitHub
-
2025/04/04
Re: [PR] Allow single quotes in EXTRACT() for Redshift. [datafusion-sqlparser-rs]
via GitHub
-
2025/04/04
Re: [PR] MSSQL: Add support for functionality `MERGE` output clause [datafusion-sqlparser-rs]
via GitHub
-
2025/04/04
[PR] Add Table Functions to FFI Crate [datafusion]
via GitHub
-
2025/04/04
Re: [I] `batches_to_sort_string` differing from similar implementation in `assert_batches_sorted_eq` [datafusion]
via GitHub
-
2025/04/04
Re: [PR] Migrate optimizer tests to insta [datafusion]
via GitHub
-
2025/04/04
Re: [PR] perf: replace `merge` `uninitiated_partitions` `VecDeque<usize>` with custom fixed size queue [datafusion]
via GitHub
-
2025/04/04
Re: [PR] fix: adjust CometNativeScan's doCanonicalize and hashCode for AQE, use DataSourceScanExec trait [datafusion-comet]
via GitHub
-
2025/04/04
[PR] Site/tpch data generator [datafusion-site]
via GitHub
-
2025/04/04
[PR] chore(deps): bump quote from 1.0.38 to 1.0.40 [datafusion]
via GitHub
-
2025/04/04
Re: [PR] ExecutionPlan: add APIs for filter pushdown & optimizer rule to apply them [datafusion]
via GitHub
-
2025/04/04
[I] [complex types] Unsupported data type org.apache.spark.sql.Row [datafusion-comet]
via GitHub
-
2025/04/04
Re: [PR] parquet reader: move pruning predicate creation from ParquetSource to ParquetOpener [datafusion]
via GitHub
-
2025/04/04
Re: [PR] Migrate datafusion/sql tests to insta, part6 [datafusion]
via GitHub
-
2025/04/04
Re: [PR] Fix duplicate unqualified Field name (schema error) on join queries [datafusion]
via GitHub
-
2025/04/04
Re: [PR] Draft: Make Clickbench Q29 5x faster for datafusion [datafusion]
via GitHub
-
2025/04/04
Re: [PR] Use `any` instead of `for_each` [datafusion]
via GitHub
-
2025/04/04
Re: [PR] fix!: incorrect coercion when comparing with string literals [datafusion]
via GitHub
-
2025/04/04
Re: [PR] Format `Date32` to string given timestamp specifiers [datafusion]
via GitHub
-
2025/04/04
Re: [PR] Improve performance of `first_value` by implementing special `GroupsAccumulator` [datafusion]
via GitHub
-
2025/04/04
[PR] fix: add map coercion for binary ops [datafusion]
via GitHub
-
2025/04/04
Re: [PR] Add support for MSSQL IF/ELSE statements. [datafusion-sqlparser-rs]
via GitHub
-
2025/04/04
Re: [PR] feat: pushdown filter for native_iceberg_compat [datafusion-comet]
via GitHub
-
2025/04/04
Re: [I] AQE Unable to Rewrite Joins as Broadcast Hash Joins Due to Existing CometBroadcastHashJoin Operator [datafusion-comet]
via GitHub
-
2025/04/04
[I] Dynamic pruning filters from TopK state (optimize `ORDER BY LIMIT` queries) [datafusion]
via GitHub
-
2025/04/04
[PR] minor: Fix clippy warnings [datafusion-comet]
via GitHub
-
2025/04/04
Re: [PR] feat: support merge for `Distribution` [datafusion]
via GitHub
-
2025/04/04
Re: [PR] (WIP) Upgrading to arrow 55 [datafusion]
via GitHub
-
2025/04/04
Re: [PR] chore: Attach Diagnostic to "incompatible type in unary expression" error [datafusion]
via GitHub
-
2025/04/04
Re: [PR] fix: check if handle has been initialized before closing [datafusion-comet]
via GitHub
-
2025/04/04
[PR] chore(deps): bump substrait from 0.54.0 to 0.55.0 [datafusion]
via GitHub
-
2025/04/04
Re: [PR] feat: respect `batchSize/workerThreads/blockingThreads` configurations for native_iceberg_compat scan [datafusion-comet]
via GitHub
-
2025/04/04
Re: [PR] Improve performance of `first_value` by implementing special `GroupsAccumulator` [datafusion]
via GitHub
-
2025/04/04
Re: [PR] feat(sql): add diagnostic for wrong number of function arguments [datafusion]
via GitHub
-
2025/04/04
Re: [PR] Refactor file schema type coercions [datafusion]
via GitHub
-
2025/04/04
[PR] Fix: Snowflake ALTER SESSION cannot be followed by other statements. [datafusion-sqlparser-rs]
via GitHub
-
2025/04/04
[PR] Add support for MSSQL IF/ELSE statements. [datafusion-sqlparser-rs]
via GitHub
-
2025/04/04
Re: [D] More thorough contribution guideline [datafusion]
via GitHub
-
2025/04/04
Re: [PR] feat: make scheduler session context stateless [datafusion-ballista]
via GitHub
-
2025/04/04
Re: [PR] Only unnest source for `EmptyRelation` [datafusion]
via GitHub
-
2025/04/04
[PR] feat: make scheduler session context stateless [datafusion-ballista]
via GitHub
-
2025/04/04
Re: [PR] Change default `EXPLAIN` format in `datafusion-cli` to `tree` format [datafusion]
via GitHub
-
2025/04/04
Re: [PR] fix: Making shuffle files generated in native shuffle mode reclaimable [datafusion-comet]
via GitHub
-
2025/04/04
Re: [PR] Migrate datafusion/sql tests to insta, part6 [datafusion]
via GitHub
-
2025/04/04
Re: [PR] parquet reader: move pruning predicate creation from ParquetSource to ParquetOpener [datafusion]
via GitHub
-
2025/04/04
Re: [PR] Clean up hash_join's ExecutionPlan::execute [datafusion]
via GitHub
-
2025/04/04
Re: [PR] Update ClickBench queries to avoid to_timestamp_seconds [datafusion]
via GitHub
-
2025/04/04
Re: [PR] (WIP) Upgrading to arrow 55 [datafusion]
via GitHub
-
2025/04/04
Re: [PR] Blog post on Parquet pruning in datafusion [datafusion-site]
via GitHub
-
2025/04/04
Re: [PR] parquet reader: move pruning predicate creation from ParquetSource to ParquetOpener [datafusion]
via GitHub
-
2025/04/04
Re: [PR] perf: replace `merge` `uninitiated_partitions` `VecDeque<usize>` with custom fixed size queue [datafusion]
via GitHub
-
2025/04/04
Re: [PR] Add SQL logic tests for compound field access in JOIN conditions [datafusion]
via GitHub
-
2025/04/04
Re: [PR] Add dynamic pruning filters from TopK state [datafusion]
via GitHub
-
2025/04/04
Re: [PR] 1065/enhancement/add ctx to `__init__.py` [datafusion-python]
via GitHub
-
2025/04/04
[PR] Fix parquet pruning blog post hyperlink [datafusion-site]
via GitHub
-
2025/04/04
Re: [I] Dependency conflict with rquest due to async-compression and xz2 linking to lzma [datafusion]
via GitHub
-
2025/04/04
Re: [PR] Improvement/improve wildcard error 15004 [datafusion]
via GitHub
-
2025/04/04
Re: [PR] feat: support merge for `Distribution` [datafusion]
via GitHub
-
2025/04/04
Re: [PR] chore: Fix some inconsistencies in memory pool configuration [datafusion-comet]
via GitHub
-
2025/04/04
Re: [PR] fix: nested window function [datafusion]
via GitHub
-
2025/04/04
Re: [I] Unable to query file on Kubernetes on AWS EKS, for remote-sql.rs example [datafusion-ballista]
via GitHub
-
2025/04/04
Re: [PR] perf: replace `merge` `uninitiated_partitions` `VecDeque<usize>` with custom fixed size queue [datafusion]
via GitHub
-
2025/04/04
Re: [I] Unable to query file on Kubernetes on AWS EKS, for remote-sql.rs example [datafusion-ballista]
via GitHub
-
2025/04/04
Re: [PR] Support Avg distinct for `float64` type [datafusion]
via GitHub
-
2025/04/04
[PR] Migrate datafusion/sql tests to insta, part4 [datafusion]
via GitHub
-
2025/04/04
Re: [I] Unsupported Arrow Vector for export: class org.apache.arrow.vector.complex.ListVector [datafusion-comet]
via GitHub
-
2025/04/04
Re: [PR] parquet reader: move pruning predicate creation from ParquetSource to ParquetOpener [datafusion]
via GitHub
-
2025/04/04
Re: [PR] feat: add test to check for `ctx.read_json()` [datafusion-ballista]
via GitHub
-
2025/04/04
Re: [I] Use spill manager in row hasher [datafusion]
via GitHub
-
2025/04/04
Re: [I] Add most functions to the Expr class so that they're chainable. [datafusion-python]
via GitHub
-
2025/04/04
Re: [PR] fix: check if handle has been initialized before closing [datafusion-comet]
via GitHub
-
2025/04/04
[PR] chore(deps): bump blake3 from 1.7.0 to 1.8.0 [datafusion]
via GitHub
-
2025/04/04
Re: [PR] Introduce load-balanced `split_groups_by_statistics` method [datafusion]
via GitHub
-
2025/04/04
Re: [PR] Introduce load-balanced `split_groups_by_statistics` method [datafusion]
via GitHub
-
2025/04/04
Re: [I] TPCH DataGen Not working [datafusion-comet]
via GitHub
-
2025/04/04
Re: [PR] (WIP) Upgrading to arrow 55 [datafusion]
via GitHub
-
2025/04/04
Re: [PR] Introduce load-balanced `split_groups_by_statistics` method [datafusion]
via GitHub
-
2025/04/04
Re: [PR] parquet reader: move pruning predicate creation from ParquetSource to ParquetOpener [datafusion]
via GitHub
-
2025/04/04
Re: [PR] [ignore] see which tests do not explicitly enable Comet [datafusion-comet]
via GitHub
-
2025/04/04
Re: [PR] parquet reader: move pruning predicate creation from ParquetSource to ParquetOpener [datafusion]
via GitHub
-
2025/04/04
Re: [PR] Docs : Added Sql examples for window Functions : `nth_val` , etc [datafusion]
via GitHub
-
2025/04/04
Re: [PR] chore: Enable Comet explicitly in `CometTPCDSQueryTestSuite` [datafusion-comet]
via GitHub
-
2025/04/04
Re: [PR] feat: Add config `max_temp_directory_size` to limit max disk usage for spilling queries [datafusion]
via GitHub
-
2025/04/04
Re: [I] Add support for S3 Object Store in default binaries [datafusion-ballista]
via GitHub
-
2025/04/04
Re: [PR] parquet reader: move pruning predicate creation from ParquetSource to ParquetOpener [datafusion]
via GitHub
-
2025/04/04
Re: [PR] feat: introduce hadoop mini cluster to test native scan on hdfs [datafusion-comet]
via GitHub
-
2025/04/04
Re: [PR] Improvement/improve wildcard error 15004 [datafusion]
via GitHub
-
2025/04/04
Re: [PR] docs: various improvements to tuning guide [datafusion-comet]
via GitHub
-
2025/04/04
Re: [PR] Remove CoalescePartitions insertion from HashJoinExec [datafusion]
via GitHub
-
2025/04/04
Re: [PR] Add doc for the `statistics_from_parquet_meta_calc method` [datafusion]
via GitHub
-
2025/04/04
Re: [PR] Add support for PostgreSQL 'IN <SetExpression>' [datafusion-sqlparser-rs]
via GitHub
-
2025/04/04
Re: [PR] Add dynamic pruning filters from TopK state [datafusion]
via GitHub
-
2025/04/04
Re: [PR] parquet reader: move pruning predicate creation from ParquetSource to ParquetOpener [datafusion]
via GitHub
-
2025/04/04
[PR] Introduce selection vector repartitioning [datafusion]
via GitHub
-
2025/04/04
Re: [I] [EPIC] A collection of tickets for improving sorting larger than memory datasets / spilling sorts [datafusion]
via GitHub
-
2025/04/04
Re: [PR] chore: Override node name for CometSparkToColumnar [datafusion-comet]
via GitHub
-
2025/04/04
Re: [PR] Blog post on Parquet pruning in datafusion [datafusion-site]
via GitHub
-
2025/04/04
Re: [I] Building project takes a *long* time (esp compilation time for `datafusion` core crate) [datafusion]
via GitHub
-
2025/04/04
Re: [I] Missing crates.io 46.0.1 release for the `datafusion` crate [datafusion]
via GitHub
-
2025/04/04
Re: [PR] Blog post on Parquet filter pushdown [datafusion-site]
via GitHub
-
2025/04/04
Re: [PR] Format `Date32` to string given timestamp specifiers [datafusion]
via GitHub
-
2025/04/04
Re: [PR] Change default `EXPLAIN` format in `datafusion-cli` to `tree` format [datafusion]
via GitHub
-
2025/04/04
Re: [I] datafusion-cli: document reading partitioned parquet [datafusion]
via GitHub