This is an automated email from the ASF dual-hosted git repository.
github-bot pushed a commit to branch asf-staging
in repository https://gitbox.apache.org/repos/asf/datafusion-site.git
The following commit(s) were added to refs/heads/asf-staging by this push:
new f1dee57 Commit build products
f1dee57 is described below
commit f1dee57c3b07b4893f94ca12e433dbc2c8c31860
Author: Build Pelican (action) <[email protected]>
AuthorDate: Sat Jan 24 12:05:49 2026 +0000
Commit build products
---
blog/2026/01/08/datafusion-52.0.0/index.html | 10 ----------
blog/feeds/all-en.atom.xml | 8 --------
blog/feeds/blog.atom.xml | 8 --------
blog/feeds/pmc.atom.xml | 8 --------
4 files changed, 34 deletions(-)
diff --git a/blog/2026/01/08/datafusion-52.0.0/index.html
b/blog/2026/01/08/datafusion-52.0.0/index.html
index cf9383e..8c9a858 100644
--- a/blog/2026/01/08/datafusion-52.0.0/index.html
+++ b/blog/2026/01/08/datafusion-52.0.0/index.html
@@ -49,7 +49,6 @@
<li><a href="#performance-improvements">Performance Improvements 🚀</a><ul>
<li><a href="#faster-case-expressions">Faster CASE Expressions</a></li>
<li><a href="#new-merge-join">New Merge Join</a></li>
-<li><a href="#rewritten-merge-join">Rewritten merge join</a></li>
<li><a href="#caching-improvements">Caching Improvements</a></li>
<li><a href="#improved-hash-join-filter-pushdown">Improved Hash Join Filter
Pushdown</a></li>
</ul>
@@ -116,14 +115,6 @@ case in <a
href="https://github.com/apache/datafusion/issues/18487">#18487</a>,
<a href="https://github.com/apache/datafusion/pull/18875">#18875</a> show
dramatic gains for TPC-H Q21 (minutes to milliseconds) while
leaving other queries unchanged or modestly faster. Thanks to <a
href="https://github.com/mbutrovich">mbutrovich</a> for
the implementation and reviews from <a
href="https://github.com/Dandandan">Dandandan</a>.</p>
-<h3 id="rewritten-merge-join">Rewritten merge join<a class="headerlink"
href="#rewritten-merge-join" title="Permanent link">¶</a></h3>
-<p>DataFusion 52 includes a rewrite of the sort-merge join (SMJ) output
buffering to
-avoid excessive <code>concat_batches</code> work and to use
<code>BatchCoalescer</code> internally and
-for final output. This change targets pathological slowdowns like the reported
-LeftAnti join case in <a
href="https://github.com/apache/datafusion/issues/18487">#18487</a>, which also
affected Comet workloads that rely on
-SMJ. Benchmarks in <a
href="https://github.com/apache/datafusion/pull/18875">#18875</a> show dramatic
gains for TPC-H Q21 (moving from
-minutes to milliseconds) while leaving most other queries unchanged or modestly
-faster, and the update is fully internal with no user-facing API changes.</p>
<h3 id="caching-improvements">Caching Improvements<a class="headerlink"
href="#caching-improvements" title="Permanent link">¶</a></h3>
<p>This release also includes several additional caching improvements.</p>
<p>A new statistics cache for Parquet Metadata avoids repeatedly
(re)calculating
@@ -303,7 +294,6 @@ can find out how to reach us on the <a
href="https://datafusion.apache.org/contr
<li><a href="#performance-improvements">Performance Improvements 🚀</a><ul>
<li><a href="#faster-case-expressions">Faster CASE Expressions</a></li>
<li><a href="#new-merge-join">New Merge Join</a></li>
-<li><a href="#rewritten-merge-join">Rewritten merge join</a></li>
<li><a href="#caching-improvements">Caching Improvements</a></li>
<li><a href="#improved-hash-join-filter-pushdown">Improved Hash Join Filter
Pushdown</a></li>
</ul>
diff --git a/blog/feeds/all-en.atom.xml b/blog/feeds/all-en.atom.xml
index 0780248..5b43a53 100644
--- a/blog/feeds/all-en.atom.xml
+++ b/blog/feeds/all-en.atom.xml
@@ -353,14 +353,6 @@ case in <a
href="https://github.com/apache/datafusion/issues/18487">#18487
<a
href="https://github.com/apache/datafusion/pull/18875">#18875</a> show
dramatic gains for TPC-H Q21 (minutes to milliseconds) while
leaving other queries unchanged or modestly faster. Thanks to <a
href="https://github.com/mbutrovich">mbutrovich</a> for
the implementation and reviews from <a
href="https://github.com/Dandandan">Dandandan</a>.</p>
-<h3 id="rewritten-merge-join">Rewritten merge join<a
class="headerlink" href="#rewritten-merge-join" title="Permanent
link">¶</a></h3>
-<p>DataFusion 52 includes a rewrite of the sort-merge join (SMJ) output
buffering to
-avoid excessive <code>concat_batches</code> work and to use
<code>BatchCoalescer</code> internally and
-for final output. This change targets pathological slowdowns like the reported
-LeftAnti join case in <a
href="https://github.com/apache/datafusion/issues/18487">#18487</a>,
which also affected Comet workloads that rely on
-SMJ. Benchmarks in <a
href="https://github.com/apache/datafusion/pull/18875">#18875</a> show
dramatic gains for TPC-H Q21 (moving from
-minutes to milliseconds) while leaving most other queries unchanged or modestly
-faster, and the update is fully internal with no user-facing API
changes.</p>
<h3 id="caching-improvements">Caching Improvements<a
class="headerlink" href="#caching-improvements" title="Permanent
link">¶</a></h3>
<p>This release also includes several additional caching
improvements.</p>
<p>A new statistics cache for Parquet Metadata avoids repeatedly
(re)calculating
diff --git a/blog/feeds/blog.atom.xml b/blog/feeds/blog.atom.xml
index 78d6f6c..7b94de3 100644
--- a/blog/feeds/blog.atom.xml
+++ b/blog/feeds/blog.atom.xml
@@ -353,14 +353,6 @@ case in <a
href="https://github.com/apache/datafusion/issues/18487">#18487
<a
href="https://github.com/apache/datafusion/pull/18875">#18875</a> show
dramatic gains for TPC-H Q21 (minutes to milliseconds) while
leaving other queries unchanged or modestly faster. Thanks to <a
href="https://github.com/mbutrovich">mbutrovich</a> for
the implementation and reviews from <a
href="https://github.com/Dandandan">Dandandan</a>.</p>
-<h3 id="rewritten-merge-join">Rewritten merge join<a
class="headerlink" href="#rewritten-merge-join" title="Permanent
link">¶</a></h3>
-<p>DataFusion 52 includes a rewrite of the sort-merge join (SMJ) output
buffering to
-avoid excessive <code>concat_batches</code> work and to use
<code>BatchCoalescer</code> internally and
-for final output. This change targets pathological slowdowns like the reported
-LeftAnti join case in <a
href="https://github.com/apache/datafusion/issues/18487">#18487</a>,
which also affected Comet workloads that rely on
-SMJ. Benchmarks in <a
href="https://github.com/apache/datafusion/pull/18875">#18875</a> show
dramatic gains for TPC-H Q21 (moving from
-minutes to milliseconds) while leaving most other queries unchanged or modestly
-faster, and the update is fully internal with no user-facing API
changes.</p>
<h3 id="caching-improvements">Caching Improvements<a
class="headerlink" href="#caching-improvements" title="Permanent
link">¶</a></h3>
<p>This release also includes several additional caching
improvements.</p>
<p>A new statistics cache for Parquet Metadata avoids repeatedly
(re)calculating
diff --git a/blog/feeds/pmc.atom.xml b/blog/feeds/pmc.atom.xml
index 7091d9f..1a0ec8d 100644
--- a/blog/feeds/pmc.atom.xml
+++ b/blog/feeds/pmc.atom.xml
@@ -69,14 +69,6 @@ case in <a
href="https://github.com/apache/datafusion/issues/18487">#18487
<a
href="https://github.com/apache/datafusion/pull/18875">#18875</a> show
dramatic gains for TPC-H Q21 (minutes to milliseconds) while
leaving other queries unchanged or modestly faster. Thanks to <a
href="https://github.com/mbutrovich">mbutrovich</a> for
the implementation and reviews from <a
href="https://github.com/Dandandan">Dandandan</a>.</p>
-<h3 id="rewritten-merge-join">Rewritten merge join<a
class="headerlink" href="#rewritten-merge-join" title="Permanent
link">¶</a></h3>
-<p>DataFusion 52 includes a rewrite of the sort-merge join (SMJ) output
buffering to
-avoid excessive <code>concat_batches</code> work and to use
<code>BatchCoalescer</code> internally and
-for final output. This change targets pathological slowdowns like the reported
-LeftAnti join case in <a
href="https://github.com/apache/datafusion/issues/18487">#18487</a>,
which also affected Comet workloads that rely on
-SMJ. Benchmarks in <a
href="https://github.com/apache/datafusion/pull/18875">#18875</a> show
dramatic gains for TPC-H Q21 (moving from
-minutes to milliseconds) while leaving most other queries unchanged or modestly
-faster, and the update is fully internal with no user-facing API
changes.</p>
<h3 id="caching-improvements">Caching Improvements<a
class="headerlink" href="#caching-improvements" title="Permanent
link">¶</a></h3>
<p>This release also includes several additional caching
improvements.</p>
<p>A new statistics cache for Parquet Metadata avoids repeatedly
(re)calculating
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]