This is an automated email from the ASF dual-hosted git repository.

github-bot pushed a commit to branch asf-staging
in repository https://gitbox.apache.org/repos/asf/datafusion-site.git


The following commit(s) were added to refs/heads/asf-staging by this push:
     new f1dee57  Commit build products
f1dee57 is described below

commit f1dee57c3b07b4893f94ca12e433dbc2c8c31860
Author: Build Pelican (action) <[email protected]>
AuthorDate: Sat Jan 24 12:05:49 2026 +0000

    Commit build products
---
 blog/2026/01/08/datafusion-52.0.0/index.html | 10 ----------
 blog/feeds/all-en.atom.xml                   |  8 --------
 blog/feeds/blog.atom.xml                     |  8 --------
 blog/feeds/pmc.atom.xml                      |  8 --------
 4 files changed, 34 deletions(-)

diff --git a/blog/2026/01/08/datafusion-52.0.0/index.html 
b/blog/2026/01/08/datafusion-52.0.0/index.html
index cf9383e..8c9a858 100644
--- a/blog/2026/01/08/datafusion-52.0.0/index.html
+++ b/blog/2026/01/08/datafusion-52.0.0/index.html
@@ -49,7 +49,6 @@
 <li><a href="#performance-improvements">Performance Improvements 🚀</a><ul>
 <li><a href="#faster-case-expressions">Faster CASE Expressions</a></li>
 <li><a href="#new-merge-join">New Merge Join</a></li>
-<li><a href="#rewritten-merge-join">Rewritten merge join</a></li>
 <li><a href="#caching-improvements">Caching Improvements</a></li>
 <li><a href="#improved-hash-join-filter-pushdown">Improved Hash Join Filter 
Pushdown</a></li>
 </ul>
@@ -116,14 +115,6 @@ case in <a 
href="https://github.com/apache/datafusion/issues/18487";>#18487</a>,
 <a href="https://github.com/apache/datafusion/pull/18875";>#18875</a> show 
dramatic gains for TPC-H Q21 (minutes to milliseconds) while
 leaving other queries unchanged or modestly faster. Thanks to <a 
href="https://github.com/mbutrovich";>mbutrovich</a> for
 the implementation and reviews from <a 
href="https://github.com/Dandandan";>Dandandan</a>.</p>
-<h3 id="rewritten-merge-join">Rewritten merge join<a class="headerlink" 
href="#rewritten-merge-join" title="Permanent link">¶</a></h3>
-<p>DataFusion 52 includes a rewrite of the sort-merge join (SMJ) output 
buffering to
-avoid excessive <code>concat_batches</code> work and to use 
<code>BatchCoalescer</code> internally and
-for final output. This change targets pathological slowdowns like the reported
-LeftAnti join case in <a 
href="https://github.com/apache/datafusion/issues/18487";>#18487</a>, which also 
affected Comet workloads that rely on
-SMJ. Benchmarks in <a 
href="https://github.com/apache/datafusion/pull/18875";>#18875</a> show dramatic 
gains for TPC-H Q21 (moving from
-minutes to milliseconds) while leaving most other queries unchanged or modestly
-faster, and the update is fully internal with no user-facing API changes.</p>
 <h3 id="caching-improvements">Caching Improvements<a class="headerlink" 
href="#caching-improvements" title="Permanent link">¶</a></h3>
 <p>This release also includes several additional caching improvements.</p>
 <p>A new statistics cache for Parquet Metadata avoids repeatedly 
(re)calculating
@@ -303,7 +294,6 @@ can find out how to reach us on the <a 
href="https://datafusion.apache.org/contr
 <li><a href="#performance-improvements">Performance Improvements 🚀</a><ul>
 <li><a href="#faster-case-expressions">Faster CASE Expressions</a></li>
 <li><a href="#new-merge-join">New Merge Join</a></li>
-<li><a href="#rewritten-merge-join">Rewritten merge join</a></li>
 <li><a href="#caching-improvements">Caching Improvements</a></li>
 <li><a href="#improved-hash-join-filter-pushdown">Improved Hash Join Filter 
Pushdown</a></li>
 </ul>
diff --git a/blog/feeds/all-en.atom.xml b/blog/feeds/all-en.atom.xml
index 0780248..5b43a53 100644
--- a/blog/feeds/all-en.atom.xml
+++ b/blog/feeds/all-en.atom.xml
@@ -353,14 +353,6 @@ case in &lt;a 
href="https://github.com/apache/datafusion/issues/18487"&gt;#18487
 &lt;a 
href="https://github.com/apache/datafusion/pull/18875"&gt;#18875&lt;/a&gt; show 
dramatic gains for TPC-H Q21 (minutes to milliseconds) while
 leaving other queries unchanged or modestly faster. Thanks to &lt;a 
href="https://github.com/mbutrovich"&gt;mbutrovich&lt;/a&gt; for
 the implementation and reviews from &lt;a 
href="https://github.com/Dandandan"&gt;Dandandan&lt;/a&gt;.&lt;/p&gt;
-&lt;h3 id="rewritten-merge-join"&gt;Rewritten merge join&lt;a 
class="headerlink" href="#rewritten-merge-join" title="Permanent 
link"&gt;¶&lt;/a&gt;&lt;/h3&gt;
-&lt;p&gt;DataFusion 52 includes a rewrite of the sort-merge join (SMJ) output 
buffering to
-avoid excessive &lt;code&gt;concat_batches&lt;/code&gt; work and to use 
&lt;code&gt;BatchCoalescer&lt;/code&gt; internally and
-for final output. This change targets pathological slowdowns like the reported
-LeftAnti join case in &lt;a 
href="https://github.com/apache/datafusion/issues/18487"&gt;#18487&lt;/a&gt;, 
which also affected Comet workloads that rely on
-SMJ. Benchmarks in &lt;a 
href="https://github.com/apache/datafusion/pull/18875"&gt;#18875&lt;/a&gt; show 
dramatic gains for TPC-H Q21 (moving from
-minutes to milliseconds) while leaving most other queries unchanged or modestly
-faster, and the update is fully internal with no user-facing API 
changes.&lt;/p&gt;
 &lt;h3 id="caching-improvements"&gt;Caching Improvements&lt;a 
class="headerlink" href="#caching-improvements" title="Permanent 
link"&gt;¶&lt;/a&gt;&lt;/h3&gt;
 &lt;p&gt;This release also includes several additional caching 
improvements.&lt;/p&gt;
 &lt;p&gt;A new statistics cache for Parquet Metadata avoids repeatedly 
(re)calculating
diff --git a/blog/feeds/blog.atom.xml b/blog/feeds/blog.atom.xml
index 78d6f6c..7b94de3 100644
--- a/blog/feeds/blog.atom.xml
+++ b/blog/feeds/blog.atom.xml
@@ -353,14 +353,6 @@ case in &lt;a 
href="https://github.com/apache/datafusion/issues/18487"&gt;#18487
 &lt;a 
href="https://github.com/apache/datafusion/pull/18875"&gt;#18875&lt;/a&gt; show 
dramatic gains for TPC-H Q21 (minutes to milliseconds) while
 leaving other queries unchanged or modestly faster. Thanks to &lt;a 
href="https://github.com/mbutrovich"&gt;mbutrovich&lt;/a&gt; for
 the implementation and reviews from &lt;a 
href="https://github.com/Dandandan"&gt;Dandandan&lt;/a&gt;.&lt;/p&gt;
-&lt;h3 id="rewritten-merge-join"&gt;Rewritten merge join&lt;a 
class="headerlink" href="#rewritten-merge-join" title="Permanent 
link"&gt;¶&lt;/a&gt;&lt;/h3&gt;
-&lt;p&gt;DataFusion 52 includes a rewrite of the sort-merge join (SMJ) output 
buffering to
-avoid excessive &lt;code&gt;concat_batches&lt;/code&gt; work and to use 
&lt;code&gt;BatchCoalescer&lt;/code&gt; internally and
-for final output. This change targets pathological slowdowns like the reported
-LeftAnti join case in &lt;a 
href="https://github.com/apache/datafusion/issues/18487"&gt;#18487&lt;/a&gt;, 
which also affected Comet workloads that rely on
-SMJ. Benchmarks in &lt;a 
href="https://github.com/apache/datafusion/pull/18875"&gt;#18875&lt;/a&gt; show 
dramatic gains for TPC-H Q21 (moving from
-minutes to milliseconds) while leaving most other queries unchanged or modestly
-faster, and the update is fully internal with no user-facing API 
changes.&lt;/p&gt;
 &lt;h3 id="caching-improvements"&gt;Caching Improvements&lt;a 
class="headerlink" href="#caching-improvements" title="Permanent 
link"&gt;¶&lt;/a&gt;&lt;/h3&gt;
 &lt;p&gt;This release also includes several additional caching 
improvements.&lt;/p&gt;
 &lt;p&gt;A new statistics cache for Parquet Metadata avoids repeatedly 
(re)calculating
diff --git a/blog/feeds/pmc.atom.xml b/blog/feeds/pmc.atom.xml
index 7091d9f..1a0ec8d 100644
--- a/blog/feeds/pmc.atom.xml
+++ b/blog/feeds/pmc.atom.xml
@@ -69,14 +69,6 @@ case in &lt;a 
href="https://github.com/apache/datafusion/issues/18487"&gt;#18487
 &lt;a 
href="https://github.com/apache/datafusion/pull/18875"&gt;#18875&lt;/a&gt; show 
dramatic gains for TPC-H Q21 (minutes to milliseconds) while
 leaving other queries unchanged or modestly faster. Thanks to &lt;a 
href="https://github.com/mbutrovich"&gt;mbutrovich&lt;/a&gt; for
 the implementation and reviews from &lt;a 
href="https://github.com/Dandandan"&gt;Dandandan&lt;/a&gt;.&lt;/p&gt;
-&lt;h3 id="rewritten-merge-join"&gt;Rewritten merge join&lt;a 
class="headerlink" href="#rewritten-merge-join" title="Permanent 
link"&gt;¶&lt;/a&gt;&lt;/h3&gt;
-&lt;p&gt;DataFusion 52 includes a rewrite of the sort-merge join (SMJ) output 
buffering to
-avoid excessive &lt;code&gt;concat_batches&lt;/code&gt; work and to use 
&lt;code&gt;BatchCoalescer&lt;/code&gt; internally and
-for final output. This change targets pathological slowdowns like the reported
-LeftAnti join case in &lt;a 
href="https://github.com/apache/datafusion/issues/18487"&gt;#18487&lt;/a&gt;, 
which also affected Comet workloads that rely on
-SMJ. Benchmarks in &lt;a 
href="https://github.com/apache/datafusion/pull/18875"&gt;#18875&lt;/a&gt; show 
dramatic gains for TPC-H Q21 (moving from
-minutes to milliseconds) while leaving most other queries unchanged or modestly
-faster, and the update is fully internal with no user-facing API 
changes.&lt;/p&gt;
 &lt;h3 id="caching-improvements"&gt;Caching Improvements&lt;a 
class="headerlink" href="#caching-improvements" title="Permanent 
link"&gt;¶&lt;/a&gt;&lt;/h3&gt;
 &lt;p&gt;This release also includes several additional caching 
improvements.&lt;/p&gt;
 &lt;p&gt;A new statistics cache for Parquet Metadata avoids repeatedly 
(re)calculating


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to