This is an automated email from the ASF dual-hosted git repository.
github-bot pushed a commit to branch asf-site
in repository https://gitbox.apache.org/repos/asf/datafusion-site.git
The following commit(s) were added to refs/heads/asf-site by this push:
new 8772194 Commit build products
8772194 is described below
commit 87721940d8fb7e918a12363eb4ec99837796713d
Author: Build Pelican (action) <[email protected]>
AuthorDate: Fri Jan 30 22:44:54 2026 +0000
Commit build products
---
.../01/30}/datafusion-comet-0.13.0/index.html | 6 +-
output/author/pmc.html | 28 +--
output/category/blog.html | 50 ++---
output/feed.xml | 32 +--
output/feeds/all-en.atom.xml | 228 ++++++++++-----------
output/feeds/blog.atom.xml | 228 ++++++++++-----------
output/feeds/pmc.atom.xml | 228 ++++++++++-----------
output/feeds/pmc.rss.xml | 22 +-
output/index.html | 54 ++---
9 files changed, 438 insertions(+), 438 deletions(-)
diff --git a/output/2025/12/04/datafusion-comet-0.13.0/index.html
b/output/2026/01/30/datafusion-comet-0.13.0/index.html
similarity index 98%
rename from output/2025/12/04/datafusion-comet-0.13.0/index.html
rename to output/2026/01/30/datafusion-comet-0.13.0/index.html
index 60b14e9..ec142ff 100644
--- a/output/2025/12/04/datafusion-comet-0.13.0/index.html
+++ b/output/2026/01/30/datafusion-comet-0.13.0/index.html
@@ -4,7 +4,7 @@
<meta charset="utf-8">
<meta http-equiv="x-ua-compatible" content="ie=edge">
<meta name="viewport" content="width=device-width, initial-scale=1.0">
- <title>Apache DataFusion Comet 0.12.0 Release - Apache DataFusion
Blog</title>
+ <title>Apache DataFusion Comet 0.13.0 Release - Apache DataFusion
Blog</title>
<link href="/blog/css/bootstrap.min.css" rel="stylesheet">
<link href="/blog/css/fontawesome.all.min.css" rel="stylesheet">
<link href="/blog/css/headerlink.css" rel="stylesheet">
@@ -40,9 +40,9 @@
<div class="row justify-content-center">
<div class="col-12 col-md-8 main-content">
<h1>
- Apache DataFusion Comet 0.12.0 Release
+ Apache DataFusion Comet 0.13.0 Release
</h1>
- <p>Posted on: Thu 04 December 2025 by pmc</p>
+ <p>Posted on: Fri 30 January 2026 by pmc</p>
<aside class="toc-container d-md-none mb-2">
<div class="toc"><span class="toctitle">Contents</span><ul>
diff --git a/output/author/pmc.html b/output/author/pmc.html
index 7756d6f..eab42dc 100644
--- a/output/author/pmc.html
+++ b/output/author/pmc.html
@@ -21,9 +21,9 @@
<ol id="post-list">
<li><article class="hentry">
- <header> <h2 class="entry-title"><a
href="https://datafusion.apache.org/blog/2026/01/12/datafusion-52.0.0"
rel="bookmark" title="Permalink to Apache DataFusion 52.0.0 Released">Apache
DataFusion 52.0.0 Released</a></h2> </header>
+ <header> <h2 class="entry-title"><a
href="https://datafusion.apache.org/blog/2026/01/30/datafusion-comet-0.13.0"
rel="bookmark" title="Permalink to Apache DataFusion Comet 0.13.0
Release">Apache DataFusion Comet 0.13.0 Release</a></h2> </header>
<footer class="post-info">
- <time class="published"
datetime="2026-01-12T00:00:00+00:00"> Mon 12 January 2026 </time>
+ <time class="published"
datetime="2026-01-30T00:00:00+00:00"> Fri 30 January 2026 </time>
<address class="vcard author">By
<a class="url fn"
href="https://datafusion.apache.org/blog/author/pmc.html">pmc</a>
</address>
@@ -47,17 +47,15 @@ limitations under the License.
{% endcomment %}
-->
-<p>We are proud to announce the release of <a
href="https://crates.io/crates/datafusion/52.0.0">DataFusion 52.0.0</a>. This
post highlights
-some of the major improvements since <a
href="https://datafusion.apache.org/blog/2025/11/25/datafusion-51.0.0/">DataFusion
51.0.0</a>. The complete list of
-changes is available in the <a
href="https://github.com/apache/datafusion/blob/branch-52/dev/changelog/52.0.0.md">changelog</a>.
Thanks to the <a
href="https://github.com/apache/datafusion/blob/branch-52/dev/changelog/52.0.0.md#credits">121
contributors</a> for
-making this release possible.</p>
-<h2 id="performance-improvements">Performance Improvements 🚀<a
class="headerlink" href="#performance-improvements" title="Permanent
link">¶</a></h2>
-<p>We continue to …</p> </div><!-- /.entry-content -->
+<p>The Apache DataFusion PMC is pleased to announce version 0.13.0 of the <a
href="https://datafusion.apache.org/comet/">Comet</a> subproject.</p>
+<p>Comet is an accelerator for Apache Spark that translates Spark physical
plans to DataFusion physical plans for
+improved performance and efficiency without requiring any code changes.</p>
+<p>This release covers approximately eight weeks of development …</p>
</div><!-- /.entry-content -->
</article></li>
<li><article class="hentry">
- <header> <h2 class="entry-title"><a
href="https://datafusion.apache.org/blog/2025/12/04/datafusion-comet-0.13.0"
rel="bookmark" title="Permalink to Apache DataFusion Comet 0.12.0
Release">Apache DataFusion Comet 0.12.0 Release</a></h2> </header>
+ <header> <h2 class="entry-title"><a
href="https://datafusion.apache.org/blog/2026/01/12/datafusion-52.0.0"
rel="bookmark" title="Permalink to Apache DataFusion 52.0.0 Released">Apache
DataFusion 52.0.0 Released</a></h2> </header>
<footer class="post-info">
- <time class="published"
datetime="2025-12-04T00:00:00+00:00"> Thu 04 December 2025 </time>
+ <time class="published"
datetime="2026-01-12T00:00:00+00:00"> Mon 12 January 2026 </time>
<address class="vcard author">By
<a class="url fn"
href="https://datafusion.apache.org/blog/author/pmc.html">pmc</a>
</address>
@@ -81,10 +79,12 @@ limitations under the License.
{% endcomment %}
-->
-<p>The Apache DataFusion PMC is pleased to announce version 0.13.0 of the <a
href="https://datafusion.apache.org/comet/">Comet</a> subproject.</p>
-<p>Comet is an accelerator for Apache Spark that translates Spark physical
plans to DataFusion physical plans for
-improved performance and efficiency without requiring any code changes.</p>
-<p>This release covers approximately eight weeks of development …</p>
</div><!-- /.entry-content -->
+<p>We are proud to announce the release of <a
href="https://crates.io/crates/datafusion/52.0.0">DataFusion 52.0.0</a>. This
post highlights
+some of the major improvements since <a
href="https://datafusion.apache.org/blog/2025/11/25/datafusion-51.0.0/">DataFusion
51.0.0</a>. The complete list of
+changes is available in the <a
href="https://github.com/apache/datafusion/blob/branch-52/dev/changelog/52.0.0.md">changelog</a>.
Thanks to the <a
href="https://github.com/apache/datafusion/blob/branch-52/dev/changelog/52.0.0.md#credits">121
contributors</a> for
+making this release possible.</p>
+<h2 id="performance-improvements">Performance Improvements 🚀<a
class="headerlink" href="#performance-improvements" title="Permanent
link">¶</a></h2>
+<p>We continue to …</p> </div><!-- /.entry-content -->
</article></li>
<li><article class="hentry">
<header> <h2 class="entry-title"><a
href="https://datafusion.apache.org/blog/2025/12/04/datafusion-comet-0.12.0"
rel="bookmark" title="Permalink to Apache DataFusion Comet 0.12.0
Release">Apache DataFusion Comet 0.12.0 Release</a></h2> </header>
diff --git a/output/category/blog.html b/output/category/blog.html
index 3a13a58..7bcbed7 100644
--- a/output/category/blog.html
+++ b/output/category/blog.html
@@ -22,9 +22,9 @@
<ol id="post-list">
<li><article class="hentry">
- <header> <h2 class="entry-title"><a
href="https://datafusion.apache.org/blog/2026/01/12/datafusion-52.0.0"
rel="bookmark" title="Permalink to Apache DataFusion 52.0.0 Released">Apache
DataFusion 52.0.0 Released</a></h2> </header>
+ <header> <h2 class="entry-title"><a
href="https://datafusion.apache.org/blog/2026/01/30/datafusion-comet-0.13.0"
rel="bookmark" title="Permalink to Apache DataFusion Comet 0.13.0
Release">Apache DataFusion Comet 0.13.0 Release</a></h2> </header>
<footer class="post-info">
- <time class="published"
datetime="2026-01-12T00:00:00+00:00"> Mon 12 January 2026 </time>
+ <time class="published"
datetime="2026-01-30T00:00:00+00:00"> Fri 30 January 2026 </time>
<address class="vcard author">By
<a class="url fn"
href="https://datafusion.apache.org/blog/author/pmc.html">pmc</a>
</address>
@@ -48,19 +48,17 @@ limitations under the License.
{% endcomment %}
-->
-<p>We are proud to announce the release of <a
href="https://crates.io/crates/datafusion/52.0.0">DataFusion 52.0.0</a>. This
post highlights
-some of the major improvements since <a
href="https://datafusion.apache.org/blog/2025/11/25/datafusion-51.0.0/">DataFusion
51.0.0</a>. The complete list of
-changes is available in the <a
href="https://github.com/apache/datafusion/blob/branch-52/dev/changelog/52.0.0.md">changelog</a>.
Thanks to the <a
href="https://github.com/apache/datafusion/blob/branch-52/dev/changelog/52.0.0.md#credits">121
contributors</a> for
-making this release possible.</p>
-<h2 id="performance-improvements">Performance Improvements 🚀<a
class="headerlink" href="#performance-improvements" title="Permanent
link">¶</a></h2>
-<p>We continue to …</p> </div><!-- /.entry-content -->
+<p>The Apache DataFusion PMC is pleased to announce version 0.13.0 of the <a
href="https://datafusion.apache.org/comet/">Comet</a> subproject.</p>
+<p>Comet is an accelerator for Apache Spark that translates Spark physical
plans to DataFusion physical plans for
+improved performance and efficiency without requiring any code changes.</p>
+<p>This release covers approximately eight weeks of development …</p>
</div><!-- /.entry-content -->
</article></li>
<li><article class="hentry">
- <header> <h2 class="entry-title"><a
href="https://datafusion.apache.org/blog/2026/01/12/extending-sql"
rel="bookmark" title="Permalink to Extending SQL in DataFusion: from ->> to
TABLESAMPLE">Extending SQL in DataFusion: from ->> to TABLESAMPLE</a></h2>
</header>
+ <header> <h2 class="entry-title"><a
href="https://datafusion.apache.org/blog/2026/01/12/datafusion-52.0.0"
rel="bookmark" title="Permalink to Apache DataFusion 52.0.0 Released">Apache
DataFusion 52.0.0 Released</a></h2> </header>
<footer class="post-info">
<time class="published"
datetime="2026-01-12T00:00:00+00:00"> Mon 12 January 2026 </time>
<address class="vcard author">By
- <a class="url fn"
href="https://datafusion.apache.org/blog/author/geoffrey-claude-datadog.html">Geoffrey
Claude (Datadog)</a>
+ <a class="url fn"
href="https://datafusion.apache.org/blog/author/pmc.html">pmc</a>
</address>
</footer><!-- /.post-info -->
<div class="entry-content"> <!--
@@ -82,15 +80,19 @@ limitations under the License.
{% endcomment %}
-->
-<p>If you embed <a href="https://datafusion.apache.org/">DataFusion</a> in
your product, your users will eventually run SQL that DataFusion does not
recognize. Not because the query is unreasonable, but because SQL in practice
includes many dialects and system-specific statements.</p>
-<p>Suppose you store data as Parquet files on S3 and want users to attach an
…</p> </div><!-- /.entry-content -->
+<p>We are proud to announce the release of <a
href="https://crates.io/crates/datafusion/52.0.0">DataFusion 52.0.0</a>. This
post highlights
+some of the major improvements since <a
href="https://datafusion.apache.org/blog/2025/11/25/datafusion-51.0.0/">DataFusion
51.0.0</a>. The complete list of
+changes is available in the <a
href="https://github.com/apache/datafusion/blob/branch-52/dev/changelog/52.0.0.md">changelog</a>.
Thanks to the <a
href="https://github.com/apache/datafusion/blob/branch-52/dev/changelog/52.0.0.md#credits">121
contributors</a> for
+making this release possible.</p>
+<h2 id="performance-improvements">Performance Improvements 🚀<a
class="headerlink" href="#performance-improvements" title="Permanent
link">¶</a></h2>
+<p>We continue to …</p> </div><!-- /.entry-content -->
</article></li>
<li><article class="hentry">
- <header> <h2 class="entry-title"><a
href="https://datafusion.apache.org/blog/2025/12/15/avoid-consecutive-repartitions"
rel="bookmark" title="Permalink to Optimizing Repartitions in DataFusion: How
I Went From Database Noob to Core Contribution">Optimizing Repartitions in
DataFusion: How I Went From Database Noob to Core Contribution</a></h2>
</header>
+ <header> <h2 class="entry-title"><a
href="https://datafusion.apache.org/blog/2026/01/12/extending-sql"
rel="bookmark" title="Permalink to Extending SQL in DataFusion: from ->> to
TABLESAMPLE">Extending SQL in DataFusion: from ->> to TABLESAMPLE</a></h2>
</header>
<footer class="post-info">
- <time class="published"
datetime="2025-12-15T00:00:00+00:00"> Mon 15 December 2025 </time>
+ <time class="published"
datetime="2026-01-12T00:00:00+00:00"> Mon 12 January 2026 </time>
<address class="vcard author">By
- <a class="url fn"
href="https://datafusion.apache.org/blog/author/gene-bordegaray.html">Gene
Bordegaray</a>
+ <a class="url fn"
href="https://datafusion.apache.org/blog/author/geoffrey-claude-datadog.html">Geoffrey
Claude (Datadog)</a>
</address>
</footer><!-- /.post-info -->
<div class="entry-content"> <!--
@@ -111,17 +113,16 @@ See the License for the specific language governing
permissions and
limitations under the License.
{% endcomment %}
-->
-<div style="display: flex; align-items: center; gap: 20px; margin-bottom:
20px;">
-<div style="flex: 1;">
-Databases are some of the most complex yet interesting pieces of software.
They are amazing pieces of abstraction: query engines optimize and execute
complex plans, storage engines provide sophisticated infrastructure as the
backbone of the system, while intricate file formats lay the groundwork for
particular workloads. All of this is …</div></div> </div><!-- /.entry-content
-->
+<p>If you embed <a href="https://datafusion.apache.org/">DataFusion</a> in
your product, your users will eventually run SQL that DataFusion does not
recognize. Not because the query is unreasonable, but because SQL in practice
includes many dialects and system-specific statements.</p>
+<p>Suppose you store data as Parquet files on S3 and want users to attach an
…</p> </div><!-- /.entry-content -->
</article></li>
<li><article class="hentry">
- <header> <h2 class="entry-title"><a
href="https://datafusion.apache.org/blog/2025/12/04/datafusion-comet-0.13.0"
rel="bookmark" title="Permalink to Apache DataFusion Comet 0.12.0
Release">Apache DataFusion Comet 0.12.0 Release</a></h2> </header>
+ <header> <h2 class="entry-title"><a
href="https://datafusion.apache.org/blog/2025/12/15/avoid-consecutive-repartitions"
rel="bookmark" title="Permalink to Optimizing Repartitions in DataFusion: How
I Went From Database Noob to Core Contribution">Optimizing Repartitions in
DataFusion: How I Went From Database Noob to Core Contribution</a></h2>
</header>
<footer class="post-info">
- <time class="published"
datetime="2025-12-04T00:00:00+00:00"> Thu 04 December 2025 </time>
+ <time class="published"
datetime="2025-12-15T00:00:00+00:00"> Mon 15 December 2025 </time>
<address class="vcard author">By
- <a class="url fn"
href="https://datafusion.apache.org/blog/author/pmc.html">pmc</a>
+ <a class="url fn"
href="https://datafusion.apache.org/blog/author/gene-bordegaray.html">Gene
Bordegaray</a>
</address>
</footer><!-- /.post-info -->
<div class="entry-content"> <!--
@@ -142,11 +143,10 @@ See the License for the specific language governing
permissions and
limitations under the License.
{% endcomment %}
-->
+<div style="display: flex; align-items: center; gap: 20px; margin-bottom:
20px;">
+<div style="flex: 1;">
-<p>The Apache DataFusion PMC is pleased to announce version 0.13.0 of the <a
href="https://datafusion.apache.org/comet/">Comet</a> subproject.</p>
-<p>Comet is an accelerator for Apache Spark that translates Spark physical
plans to DataFusion physical plans for
-improved performance and efficiency without requiring any code changes.</p>
-<p>This release covers approximately eight weeks of development …</p>
</div><!-- /.entry-content -->
+Databases are some of the most complex yet interesting pieces of software.
They are amazing pieces of abstraction: query engines optimize and execute
complex plans, storage engines provide sophisticated infrastructure as the
backbone of the system, while intricate file formats lay the groundwork for
particular workloads. All of this is …</div></div> </div><!-- /.entry-content
-->
</article></li>
<li><article class="hentry">
<header> <h2 class="entry-title"><a
href="https://datafusion.apache.org/blog/2025/12/04/datafusion-comet-0.12.0"
rel="bookmark" title="Permalink to Apache DataFusion Comet 0.12.0
Release">Apache DataFusion Comet 0.12.0 Release</a></h2> </header>
diff --git a/output/feed.xml b/output/feed.xml
index b67aa23..2db1aff 100644
--- a/output/feed.xml
+++ b/output/feed.xml
@@ -1,5 +1,5 @@
<?xml version="1.0" encoding="utf-8"?>
-<rss version="2.0"><channel><title>Apache DataFusion
Blog</title><link>https://datafusion.apache.org/blog/</link><description></description><lastBuildDate>Mon,
12 Jan 2026 00:00:00 +0000</lastBuildDate><item><title>Apache DataFusion
52.0.0
Released</title><link>https://datafusion.apache.org/blog/2026/01/12/datafusion-52.0.0</link><description><!--
+<rss version="2.0"><channel><title>Apache DataFusion
Blog</title><link>https://datafusion.apache.org/blog/</link><description></description><lastBuildDate>Fri,
30 Jan 2026 00:00:00 +0000</lastBuildDate><item><title>Apache DataFusion Comet
0.13.0
Release</title><link>https://datafusion.apache.org/blog/2026/01/30/datafusion-comet-0.13.0</link><description><!--
{% comment %}
Licensed to the Apache Software Foundation (ASF) under one or more
contributor license agreements. See the NOTICE file distributed with
@@ -18,12 +18,10 @@ limitations under the License.
{% endcomment %}
-->
-<p>We are proud to announce the release of <a
href="https://crates.io/crates/datafusion/52.0.0">DataFusion
52.0.0</a>. This post highlights
-some of the major improvements since <a
href="https://datafusion.apache.org/blog/2025/11/25/datafusion-51.0.0/">DataFusion
51.0.0</a>. The complete list of
-changes is available in the <a
href="https://github.com/apache/datafusion/blob/branch-52/dev/changelog/52.0.0.md">changelog</a>.
Thanks to the <a
href="https://github.com/apache/datafusion/blob/branch-52/dev/changelog/52.0.0.md#credits">121
contributors</a> for
-making this release possible.</p>
-<h2 id="performance-improvements">Performance Improvements 🚀<a
class="headerlink" href="#performance-improvements" title="Permanent
link">¶</a></h2>
-<p>We continue to …</p></description><dc:creator
xmlns:dc="http://purl.org/dc/elements/1.1/">pmc</dc:creator><pubDate>Mon, 12
Jan 2026 00:00:00 +0000</pubDate><guid
isPermaLink="false">tag:datafusion.apache.org,2026-01-12:/blog/2026/01/12/datafusion-52.0.0</guid><category>blog</category></item><item><title>Extending
SQL in DataFusion: from ->> to
TABLESAMPLE</title><link>https://datafusion.apache.org/blog/2026/01/12/extending-sql</link><description><!--
+<p>The Apache DataFusion PMC is pleased to announce version 0.13.0 of
the <a href="https://datafusion.apache.org/comet/">Comet</a>
subproject.</p>
+<p>Comet is an accelerator for Apache Spark that translates Spark
physical plans to DataFusion physical plans for
+improved performance and efficiency without requiring any code
changes.</p>
+<p>This release covers approximately eight weeks of development
…</p></description><dc:creator
xmlns:dc="http://purl.org/dc/elements/1.1/">pmc</dc:creator><pubDate>Fri, 30
Jan 2026 00:00:00 +0000</pubDate><guid
isPermaLink="false">tag:datafusion.apache.org,2026-01-30:/blog/2026/01/30/datafusion-comet-0.13.0</guid><category>blog</category></item><item><title>Apache
DataFusion 52.0.0
Released</title><link>https://datafusion.apache.org/blog/2026/01/12/datafusion-52.0.0</link><de
[...]
{% comment %}
Licensed to the Apache Software Foundation (ASF) under one or more
contributor license agreements. See the NOTICE file distributed with
@@ -42,8 +40,12 @@ limitations under the License.
{% endcomment %}
-->
-<p>If you embed <a
href="https://datafusion.apache.org/">DataFusion</a> in your product,
your users will eventually run SQL that DataFusion does not recognize. Not
because the query is unreasonable, but because SQL in practice includes many
dialects and system-specific statements.</p>
-<p>Suppose you store data as Parquet files on S3 and want users to
attach an …</p></description><dc:creator
xmlns:dc="http://purl.org/dc/elements/1.1/">Geoffrey Claude
(Datadog)</dc:creator><pubDate>Mon, 12 Jan 2026 00:00:00 +0000</pubDate><guid
isPermaLink="false">tag:datafusion.apache.org,2026-01-12:/blog/2026/01/12/extending-sql</guid><category>blog</category></item><item><title>Optimizing
Repartitions in DataFusion: How I Went From Database Noob to Core
Contribution</titl [...]
+<p>We are proud to announce the release of <a
href="https://crates.io/crates/datafusion/52.0.0">DataFusion
52.0.0</a>. This post highlights
+some of the major improvements since <a
href="https://datafusion.apache.org/blog/2025/11/25/datafusion-51.0.0/">DataFusion
51.0.0</a>. The complete list of
+changes is available in the <a
href="https://github.com/apache/datafusion/blob/branch-52/dev/changelog/52.0.0.md">changelog</a>.
Thanks to the <a
href="https://github.com/apache/datafusion/blob/branch-52/dev/changelog/52.0.0.md#credits">121
contributors</a> for
+making this release possible.</p>
+<h2 id="performance-improvements">Performance Improvements 🚀<a
class="headerlink" href="#performance-improvements" title="Permanent
link">¶</a></h2>
+<p>We continue to …</p></description><dc:creator
xmlns:dc="http://purl.org/dc/elements/1.1/">pmc</dc:creator><pubDate>Mon, 12
Jan 2026 00:00:00 +0000</pubDate><guid
isPermaLink="false">tag:datafusion.apache.org,2026-01-12:/blog/2026/01/12/datafusion-52.0.0</guid><category>blog</category></item><item><title>Extending
SQL in DataFusion: from ->> to
TABLESAMPLE</title><link>https://datafusion.apache.org/blog/2026/01/12/extending-sql</link><description><!--
{% comment %}
Licensed to the Apache Software Foundation (ASF) under one or more
contributor license agreements. See the NOTICE file distributed with
@@ -61,10 +63,9 @@ See the License for the specific language governing
permissions and
limitations under the License.
{% endcomment %}
-->
-<div style="display: flex; align-items: center; gap: 20px; margin-bottom:
20px;">
-<div style="flex: 1;">
-Databases are some of the most complex yet interesting pieces of software.
They are amazing pieces of abstraction: query engines optimize and execute
complex plans, storage engines provide sophisticated infrastructure as the
backbone of the system, while intricate file formats lay the groundwork for
particular workloads. All of this is
…</div></div></description><dc:creator
xmlns:dc="http://purl.org/dc/elements/1.1/">Gene
Bordegaray</dc:creator><pubDate>Mon, 15 Dec 2025 00:00 [...]
+<p>If you embed <a
href="https://datafusion.apache.org/">DataFusion</a> in your product,
your users will eventually run SQL that DataFusion does not recognize. Not
because the query is unreasonable, but because SQL in practice includes many
dialects and system-specific statements.</p>
+<p>Suppose you store data as Parquet files on S3 and want users to
attach an …</p></description><dc:creator
xmlns:dc="http://purl.org/dc/elements/1.1/">Geoffrey Claude
(Datadog)</dc:creator><pubDate>Mon, 12 Jan 2026 00:00:00 +0000</pubDate><guid
isPermaLink="false">tag:datafusion.apache.org,2026-01-12:/blog/2026/01/12/extending-sql</guid><category>blog</category></item><item><title>Optimizing
Repartitions in DataFusion: How I Went From Database Noob to Core
Contribution</titl [...]
{% comment %}
Licensed to the Apache Software Foundation (ASF) under one or more
contributor license agreements. See the NOTICE file distributed with
@@ -82,11 +83,10 @@ See the License for the specific language governing
permissions and
limitations under the License.
{% endcomment %}
-->
+<div style="display: flex; align-items: center; gap: 20px; margin-bottom:
20px;">
+<div style="flex: 1;">
-<p>The Apache DataFusion PMC is pleased to announce version 0.13.0 of
the <a href="https://datafusion.apache.org/comet/">Comet</a>
subproject.</p>
-<p>Comet is an accelerator for Apache Spark that translates Spark
physical plans to DataFusion physical plans for
-improved performance and efficiency without requiring any code
changes.</p>
-<p>This release covers approximately eight weeks of development
…</p></description><dc:creator
xmlns:dc="http://purl.org/dc/elements/1.1/">pmc</dc:creator><pubDate>Thu, 04
Dec 2025 00:00:00 +0000</pubDate><guid
isPermaLink="false">tag:datafusion.apache.org,2025-12-04:/blog/2025/12/04/datafusion-comet-0.13.0</guid><category>blog</category></item><item><title>Apache
DataFusion Comet 0.12.0
Release</title><link>https://datafusion.apache.org/blog/2025/12/04/datafusion-comet-0.12.
[...]
+Databases are some of the most complex yet interesting pieces of software.
They are amazing pieces of abstraction: query engines optimize and execute
complex plans, storage engines provide sophisticated infrastructure as the
backbone of the system, while intricate file formats lay the groundwork for
particular workloads. All of this is
…</div></div></description><dc:creator
xmlns:dc="http://purl.org/dc/elements/1.1/">Gene
Bordegaray</dc:creator><pubDate>Mon, 15 Dec 2025 00:00 [...]
{% comment %}
Licensed to the Apache Software Foundation (ASF) under one or more
contributor license agreements. See the NOTICE file distributed with
diff --git a/output/feeds/all-en.atom.xml b/output/feeds/all-en.atom.xml
index 7c1829b..a6e2fc0 100644
--- a/output/feeds/all-en.atom.xml
+++ b/output/feeds/all-en.atom.xml
@@ -1,5 +1,117 @@
<?xml version="1.0" encoding="utf-8"?>
-<feed xmlns="http://www.w3.org/2005/Atom"><title>Apache DataFusion
Blog</title><link href="https://datafusion.apache.org/blog/"
rel="alternate"></link><link
href="https://datafusion.apache.org/blog/feeds/all-en.atom.xml"
rel="self"></link><id>https://datafusion.apache.org/blog/</id><updated>2026-01-12T00:00:00+00:00</updated><subtitle></subtitle><entry><title>Apache
DataFusion 52.0.0 Released</title><link
href="https://datafusion.apache.org/blog/2026/01/12/datafusion-52.0.0"
rel="alterna [...]
+<feed xmlns="http://www.w3.org/2005/Atom"><title>Apache DataFusion
Blog</title><link href="https://datafusion.apache.org/blog/"
rel="alternate"></link><link
href="https://datafusion.apache.org/blog/feeds/all-en.atom.xml"
rel="self"></link><id>https://datafusion.apache.org/blog/</id><updated>2026-01-30T00:00:00+00:00</updated><subtitle></subtitle><entry><title>Apache
DataFusion Comet 0.13.0 Release</title><link
href="https://datafusion.apache.org/blog/2026/01/30/datafusion-comet-0.13.0" r
[...]
+{% comment %}
+Licensed to the Apache Software Foundation (ASF) under one or more
+contributor license agreements. See the NOTICE file distributed with
+this work for additional information regarding copyright ownership.
+The ASF licenses this file to you under the Apache License, Version 2.0
+(the "License"); you may not use this file except in compliance with
+the License. You may obtain a copy of the License at
+
+http://www.apache.org/licenses/LICENSE-2.0
+
+Unless required by applicable law or agreed to in writing, software
+distributed under the License is distributed on an "AS IS" BASIS,
+WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+See the License for the specific language governing permissions and
+limitations under the License.
+{% endcomment %}
+-->
+
+<p>The Apache DataFusion PMC is pleased to announce version 0.13.0 of
the <a href="https://datafusion.apache.org/comet/">Comet</a>
subproject.</p>
+<p>Comet is an accelerator for Apache Spark that translates Spark
physical plans to DataFusion physical plans for
+improved performance and efficiency without requiring any code
changes.</p>
+<p>This release covers approximately eight weeks of development
…</p></summary><content type="html"><!--
+{% comment %}
+Licensed to the Apache Software Foundation (ASF) under one or more
+contributor license agreements. See the NOTICE file distributed with
+this work for additional information regarding copyright ownership.
+The ASF licenses this file to you under the Apache License, Version 2.0
+(the "License"); you may not use this file except in compliance with
+the License. You may obtain a copy of the License at
+
+http://www.apache.org/licenses/LICENSE-2.0
+
+Unless required by applicable law or agreed to in writing, software
+distributed under the License is distributed on an "AS IS" BASIS,
+WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+See the License for the specific language governing permissions and
+limitations under the License.
+{% endcomment %}
+-->
+
+<p>The Apache DataFusion PMC is pleased to announce version 0.13.0 of
the <a href="https://datafusion.apache.org/comet/">Comet</a>
subproject.</p>
+<p>Comet is an accelerator for Apache Spark that translates Spark
physical plans to DataFusion physical plans for
+improved performance and efficiency without requiring any code
changes.</p>
+<p>This release covers approximately eight weeks of development work and
is the result of merging 160 PRs from 15
+contributors. See the <a
href="https://github.com/apache/datafusion-comet/blob/main/dev/changelog/0.13.0.md">change
log</a> for more information.</p>
+<h2 id="key-features">Key Features<a class="headerlink"
href="#key-features" title="Permanent link">¶</a></h2>
+<h3 id="native-parquet-write-support-experimental">Native Parquet Write
Support (Experimental)<a class="headerlink"
href="#native-parquet-write-support-experimental" title="Permanent
link">¶</a></h3>
+<p>This release introduces experimental native Parquet write
capabilities, allowing Comet to intercept and execute Parquet write operations
natively through DataFusion. Key capabilities include:</p>
+<ul>
+<li>File commit protocol support for reliable writes</li>
+<li>Remote HDFS writing via OpenDAL integration</li>
+<li>Complex type support (arrays, maps, structs)</li>
+<li>Proper handling of object store settings</li>
+</ul>
+<p>To enable native Parquet writes, set:</p>
+<pre><code>spark.comet.allowIncompatibleOp.DataWritingCommandExec=true
+spark.comet.parquet.write.enabled=true
+</code></pre>
+<p><strong>Note</strong>: This feature is highly
experimental and should not be used in production environments. It is currently
categorized as a testing feature and is disabled by default.</p>
+<h3 id="native-iceberg-improvements">Native Iceberg Improvements<a
class="headerlink" href="#native-iceberg-improvements" title="Permanent
link">¶</a></h3>
+<p>Comet's fully-native Iceberg integration received significant
enhancements in this release:</p>
+<p><strong>REST Catalog Support</strong>: Native Iceberg
scans now support REST catalogs, enabling integration with catalog services
like Apache Polaris and Tabular. Configure with:</p>
+<pre><code class="language-shell">--conf
spark.sql.catalog.rest_cat=org.apache.iceberg.spark.SparkCatalog
+--conf
spark.sql.catalog.rest_cat.catalog-impl=org.apache.iceberg.rest.RESTCatalog
+--conf spark.sql.catalog.rest_cat.uri=http://localhost:8181
+--conf spark.comet.scan.icebergNative.enabled=true
+</code></pre>
+<p><strong>Session Token Authentication</strong>: Added
support for session tokens in native Iceberg scans for secure S3
access.</p>
+<p><strong>Performance Optimizations</strong>:
+- Deduplicated serialized metadata reducing memory overhead
+- Switched from JSON to protobuf for partition value serialization
+- Removed IcebergFileStream in favor of iceberg-rust's built-in parallelization
+- Reduced metadata serialization points
+- Added SchemaAdapter caching</p>
+<p>To enable fully-native Iceberg scanning:</p>
+<pre><code>spark.comet.scan.icebergNative.enabled=true
+</code></pre>
+<p>The native reader supports Iceberg table spec v1 and v2, all
primitive and complex types, schema evolution, time travel, positional and
equality deletes, filter pushdown, and various storage backends (local, HDFS,
S3).</p>
+<h3 id="native-csv-reading-experimental">Native CSV Reading
(Experimental)<a class="headerlink" href="#native-csv-reading-experimental"
title="Permanent link">¶</a></h3>
+<p>Experimental support for native CSV file reading has been added,
expanding Comet's file format capabilities beyond Parquet.</p>
+<h3 id="new-expressions">New Expressions<a class="headerlink"
href="#new-expressions" title="Permanent link">¶</a></h3>
+<p>The release adds support for numerous expressions:
+- Array functions: <code>explode</code>,
<code>explode_outer</code>, <code>size</code>
+- Date/time functions: <code>unix_date</code>,
<code>date_format</code>, <code>datediff</code>,
<code>last_day</code>, <code>unix_timestamp</code>
+- String functions: <code>left</code>
+- JSON functions: <code>from_json</code> (partial
support)</p>
+<h3 id="ansi-mode-support">ANSI Mode Support<a class="headerlink"
href="#ansi-mode-support" title="Permanent link">¶</a></h3>
+<p>Sum and average aggregate expressions now support ANSI mode for both
integer and decimal inputs, enabling overflow checking in strict SQL
mode.</p>
+<h3 id="native-shuffle-improvements">Native Shuffle Improvements<a
class="headerlink" href="#native-shuffle-improvements" title="Permanent
link">¶</a></h3>
+<ul>
+<li>Round-robin partitioning is now supported in native
shuffle</li>
+<li>Spill metrics are now reported correctly</li>
+<li>Configurable shuffle writer buffer size via
<code>spark.comet.shuffle.write.bufferSize</code></li>
+</ul>
+<h2 id="performance-improvements">Performance Improvements<a
class="headerlink" href="#performance-improvements" title="Permanent
link">¶</a></h2>
+<p>This release includes extensive performance optimizations:</p>
+<ul>
+<li><strong>String to integer casting</strong>: Significant
speedups through optimized parsing</li>
+<li><strong>String functions</strong>: Optimized
<code>lpad</code>/<code>rpad</code> to remove
unnecessary memory allocations</li>
+<li><strong>Date operations</strong>: Improved
<code>normalize_nan</code> and date truncate performance</li>
+<li><strong>Query planning</strong>: Cached query plans to
avoid per-partition serialization overhead</li>
+<li><strong>Memory efficiency</strong>: Reduced GC pressure
in protobuf serialization</li>
+<li><strong>Hash operations</strong>: Optimized complex-type
hash implementations including murmur3 support for nested types</li>
+<li><strong>Runtime efficiency</strong>: Eliminated
busy-polling of Tokio stream for plans without CometScan</li>
+<li><strong>Metrics overhead</strong>: Reduced timer and
syscall overhead in native shuffle writer</li>
+</ul>
+<h2 id="deprecations">Deprecations<a class="headerlink"
href="#deprecations" title="Permanent link">¶</a></h2>
+<p>The <code>native_comet</code> scan mode is now deprecated
in favor of <code>native_iceberg_compat</code> and will be removed
in a future release. The <code>auto</code> scan mode no longer
falls back to <code>native_comet</code>.</p>
+<h2 id="compatibility">Compatibility<a class="headerlink"
href="#compatibility" title="Permanent link">¶</a></h2>
+<p>This release upgrades to DataFusion 51, Arrow 57, and the latest
iceberg-rust. The minimum supported Rust version is now 1.88.</p>
+<p>Supported platforms include Spark 3.4.3, 3.5.4-3.5.7, and Spark 4.0.x
with various JDK and Scala combinations.</p>
+<p>The community encourages users to test Comet with existing Spark
workloads and welcomes contributions to ongoing
development.</p></content><category
term="blog"></category></entry><entry><title>Apache DataFusion 52.0.0
Released</title><link
href="https://datafusion.apache.org/blog/2026/01/12/datafusion-52.0.0"
rel="alternate"></link><published>2026-01-12T00:00:00+00:00</published><updated>2026-01-12T00:00:00+00:00</updated><author><name>pmc</name></author><id>tag:datafusio
[...]
{% comment %}
Licensed to the Apache Software Foundation (ASF) under one or more
contributor license agreements. See the NOTICE file distributed with
@@ -787,119 +899,7 @@ Hash repartitioning is useful when working with grouped
data. Imagine you have a
<p>I hope you gained something from my experience and have fun learning
about databases.</p>
<hr/>
<h2
id="acknowledgements"><strong>Acknowledgements</strong><a
class="headerlink" href="#acknowledgements" title="Permanent
link">&para;</a></h2>
-<p>Thank you to <a href="https://github.com/NGA-TRAN">Nga
Tran</a> for continuous mentorship and guidance, the DataFusion
community, specifically <a href="https://github.com/alamb">Andrew
Lamb</a>, for lending me support throughout my work, and Datadog for
providing the opportunity to work on such interesting
systems.</p></content><category
term="blog"></category></entry><entry><title>Apache DataFusion Comet 0.12.0
Release</title><link href="https://da [...]
-{% comment %}
-Licensed to the Apache Software Foundation (ASF) under one or more
-contributor license agreements. See the NOTICE file distributed with
-this work for additional information regarding copyright ownership.
-The ASF licenses this file to you under the Apache License, Version 2.0
-(the "License"); you may not use this file except in compliance with
-the License. You may obtain a copy of the License at
-
-http://www.apache.org/licenses/LICENSE-2.0
-
-Unless required by applicable law or agreed to in writing, software
-distributed under the License is distributed on an "AS IS" BASIS,
-WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-See the License for the specific language governing permissions and
-limitations under the License.
-{% endcomment %}
--->
-
-<p>The Apache DataFusion PMC is pleased to announce version 0.13.0 of
the <a href="https://datafusion.apache.org/comet/">Comet</a>
subproject.</p>
-<p>Comet is an accelerator for Apache Spark that translates Spark
physical plans to DataFusion physical plans for
-improved performance and efficiency without requiring any code
changes.</p>
-<p>This release covers approximately eight weeks of development
…</p></summary><content type="html"><!--
-{% comment %}
-Licensed to the Apache Software Foundation (ASF) under one or more
-contributor license agreements. See the NOTICE file distributed with
-this work for additional information regarding copyright ownership.
-The ASF licenses this file to you under the Apache License, Version 2.0
-(the "License"); you may not use this file except in compliance with
-the License. You may obtain a copy of the License at
-
-http://www.apache.org/licenses/LICENSE-2.0
-
-Unless required by applicable law or agreed to in writing, software
-distributed under the License is distributed on an "AS IS" BASIS,
-WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-See the License for the specific language governing permissions and
-limitations under the License.
-{% endcomment %}
--->
-
-<p>The Apache DataFusion PMC is pleased to announce version 0.13.0 of
the <a href="https://datafusion.apache.org/comet/">Comet</a>
subproject.</p>
-<p>Comet is an accelerator for Apache Spark that translates Spark
physical plans to DataFusion physical plans for
-improved performance and efficiency without requiring any code
changes.</p>
-<p>This release covers approximately eight weeks of development work and
is the result of merging 160 PRs from 15
-contributors. See the <a
href="https://github.com/apache/datafusion-comet/blob/main/dev/changelog/0.13.0.md">change
log</a> for more information.</p>
-<h2 id="key-features">Key Features<a class="headerlink"
href="#key-features" title="Permanent link">¶</a></h2>
-<h3 id="native-parquet-write-support-experimental">Native Parquet Write
Support (Experimental)<a class="headerlink"
href="#native-parquet-write-support-experimental" title="Permanent
link">¶</a></h3>
-<p>This release introduces experimental native Parquet write
capabilities, allowing Comet to intercept and execute Parquet write operations
natively through DataFusion. Key capabilities include:</p>
-<ul>
-<li>File commit protocol support for reliable writes</li>
-<li>Remote HDFS writing via OpenDAL integration</li>
-<li>Complex type support (arrays, maps, structs)</li>
-<li>Proper handling of object store settings</li>
-</ul>
-<p>To enable native Parquet writes, set:</p>
-<pre><code>spark.comet.allowIncompatibleOp.DataWritingCommandExec=true
-spark.comet.parquet.write.enabled=true
-</code></pre>
-<p><strong>Note</strong>: This feature is highly
experimental and should not be used in production environments. It is currently
categorized as a testing feature and is disabled by default.</p>
-<h3 id="native-iceberg-improvements">Native Iceberg Improvements<a
class="headerlink" href="#native-iceberg-improvements" title="Permanent
link">¶</a></h3>
-<p>Comet's fully-native Iceberg integration received significant
enhancements in this release:</p>
-<p><strong>REST Catalog Support</strong>: Native Iceberg
scans now support REST catalogs, enabling integration with catalog services
like Apache Polaris and Tabular. Configure with:</p>
-<pre><code class="language-shell">--conf
spark.sql.catalog.rest_cat=org.apache.iceberg.spark.SparkCatalog
---conf
spark.sql.catalog.rest_cat.catalog-impl=org.apache.iceberg.rest.RESTCatalog
---conf spark.sql.catalog.rest_cat.uri=http://localhost:8181
---conf spark.comet.scan.icebergNative.enabled=true
-</code></pre>
-<p><strong>Session Token Authentication</strong>: Added
support for session tokens in native Iceberg scans for secure S3
access.</p>
-<p><strong>Performance Optimizations</strong>:
-- Deduplicated serialized metadata reducing memory overhead
-- Switched from JSON to protobuf for partition value serialization
-- Removed IcebergFileStream in favor of iceberg-rust's built-in parallelization
-- Reduced metadata serialization points
-- Added SchemaAdapter caching</p>
-<p>To enable fully-native Iceberg scanning:</p>
-<pre><code>spark.comet.scan.icebergNative.enabled=true
-</code></pre>
-<p>The native reader supports Iceberg table spec v1 and v2, all
primitive and complex types, schema evolution, time travel, positional and
equality deletes, filter pushdown, and various storage backends (local, HDFS,
S3).</p>
-<h3 id="native-csv-reading-experimental">Native CSV Reading
(Experimental)<a class="headerlink" href="#native-csv-reading-experimental"
title="Permanent link">¶</a></h3>
-<p>Experimental support for native CSV file reading has been added,
expanding Comet's file format capabilities beyond Parquet.</p>
-<h3 id="new-expressions">New Expressions<a class="headerlink"
href="#new-expressions" title="Permanent link">¶</a></h3>
-<p>The release adds support for numerous expressions:
-- Array functions: <code>explode</code>,
<code>explode_outer</code>, <code>size</code>
-- Date/time functions: <code>unix_date</code>,
<code>date_format</code>, <code>datediff</code>,
<code>last_day</code>, <code>unix_timestamp</code>
-- String functions: <code>left</code>
-- JSON functions: <code>from_json</code> (partial
support)</p>
-<h3 id="ansi-mode-support">ANSI Mode Support<a class="headerlink"
href="#ansi-mode-support" title="Permanent link">¶</a></h3>
-<p>Sum and average aggregate expressions now support ANSI mode for both
integer and decimal inputs, enabling overflow checking in strict SQL
mode.</p>
-<h3 id="native-shuffle-improvements">Native Shuffle Improvements<a
class="headerlink" href="#native-shuffle-improvements" title="Permanent
link">¶</a></h3>
-<ul>
-<li>Round-robin partitioning is now supported in native
shuffle</li>
-<li>Spill metrics are now reported correctly</li>
-<li>Configurable shuffle writer buffer size via
<code>spark.comet.shuffle.write.bufferSize</code></li>
-</ul>
-<h2 id="performance-improvements">Performance Improvements<a
class="headerlink" href="#performance-improvements" title="Permanent
link">¶</a></h2>
-<p>This release includes extensive performance optimizations:</p>
-<ul>
-<li><strong>String to integer casting</strong>: Significant
speedups through optimized parsing</li>
-<li><strong>String functions</strong>: Optimized
<code>lpad</code>/<code>rpad</code> to remove
unnecessary memory allocations</li>
-<li><strong>Date operations</strong>: Improved
<code>normalize_nan</code> and date truncate performance</li>
-<li><strong>Query planning</strong>: Cached query plans to
avoid per-partition serialization overhead</li>
-<li><strong>Memory efficiency</strong>: Reduced GC pressure
in protobuf serialization</li>
-<li><strong>Hash operations</strong>: Optimized complex-type
hash implementations including murmur3 support for nested types</li>
-<li><strong>Runtime efficiency</strong>: Eliminated
busy-polling of Tokio stream for plans without CometScan</li>
-<li><strong>Metrics overhead</strong>: Reduced timer and
syscall overhead in native shuffle writer</li>
-</ul>
-<h2 id="deprecations">Deprecations<a class="headerlink"
href="#deprecations" title="Permanent link">¶</a></h2>
-<p>The <code>native_comet</code> scan mode is now deprecated
in favor of <code>native_iceberg_compat</code> and will be removed
in a future release. The <code>auto</code> scan mode no longer
falls back to <code>native_comet</code>.</p>
-<h2 id="compatibility">Compatibility<a class="headerlink"
href="#compatibility" title="Permanent link">¶</a></h2>
-<p>This release upgrades to DataFusion 51, Arrow 57, and the latest
iceberg-rust. The minimum supported Rust version is now 1.88.</p>
-<p>Supported platforms include Spark 3.4.3, 3.5.4-3.5.7, and Spark 4.0.x
with various JDK and Scala combinations.</p>
-<p>The community encourages users to test Comet with existing Spark
workloads and welcomes contributions to ongoing
development.</p></content><category
term="blog"></category></entry><entry><title>Apache DataFusion Comet 0.12.0
Release</title><link
href="https://datafusion.apache.org/blog/2025/12/04/datafusion-comet-0.12.0"
rel="alternate"></link><published>2025-12-04T00:00:00+00:00</published><updated>2025-12-04T00:00:00+00:00</updated><author><name>pmc</name></author><id>ta
[...]
+<p>Thank you to <a href="https://github.com/NGA-TRAN">Nga
Tran</a> for continuous mentorship and guidance, the DataFusion
community, specifically <a href="https://github.com/alamb">Andrew
Lamb</a>, for lending me support throughout my work, and Datadog for
providing the opportunity to work on such interesting
systems.</p></content><category
term="blog"></category></entry><entry><title>Apache DataFusion Comet 0.12.0
Release</title><link href="https://da [...]
{% comment %}
Licensed to the Apache Software Foundation (ASF) under one or more
contributor license agreements. See the NOTICE file distributed with
diff --git a/output/feeds/blog.atom.xml b/output/feeds/blog.atom.xml
index 35d4607..250a860 100644
--- a/output/feeds/blog.atom.xml
+++ b/output/feeds/blog.atom.xml
@@ -1,5 +1,117 @@
<?xml version="1.0" encoding="utf-8"?>
-<feed xmlns="http://www.w3.org/2005/Atom"><title>Apache DataFusion Blog -
blog</title><link href="https://datafusion.apache.org/blog/"
rel="alternate"></link><link
href="https://datafusion.apache.org/blog/feeds/blog.atom.xml"
rel="self"></link><id>https://datafusion.apache.org/blog/</id><updated>2026-01-12T00:00:00+00:00</updated><subtitle></subtitle><entry><title>Apache
DataFusion 52.0.0 Released</title><link
href="https://datafusion.apache.org/blog/2026/01/12/datafusion-52.0.0" rel="al
[...]
+<feed xmlns="http://www.w3.org/2005/Atom"><title>Apache DataFusion Blog -
blog</title><link href="https://datafusion.apache.org/blog/"
rel="alternate"></link><link
href="https://datafusion.apache.org/blog/feeds/blog.atom.xml"
rel="self"></link><id>https://datafusion.apache.org/blog/</id><updated>2026-01-30T00:00:00+00:00</updated><subtitle></subtitle><entry><title>Apache
DataFusion Comet 0.13.0 Release</title><link
href="https://datafusion.apache.org/blog/2026/01/30/datafusion-comet-0.13 [...]
+{% comment %}
+Licensed to the Apache Software Foundation (ASF) under one or more
+contributor license agreements. See the NOTICE file distributed with
+this work for additional information regarding copyright ownership.
+The ASF licenses this file to you under the Apache License, Version 2.0
+(the "License"); you may not use this file except in compliance with
+the License. You may obtain a copy of the License at
+
+http://www.apache.org/licenses/LICENSE-2.0
+
+Unless required by applicable law or agreed to in writing, software
+distributed under the License is distributed on an "AS IS" BASIS,
+WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+See the License for the specific language governing permissions and
+limitations under the License.
+{% endcomment %}
+-->
+
+<p>The Apache DataFusion PMC is pleased to announce version 0.13.0 of
the <a href="https://datafusion.apache.org/comet/">Comet</a>
subproject.</p>
+<p>Comet is an accelerator for Apache Spark that translates Spark
physical plans to DataFusion physical plans for
+improved performance and efficiency without requiring any code
changes.</p>
+<p>This release covers approximately eight weeks of development
…</p></summary><content type="html"><!--
+{% comment %}
+Licensed to the Apache Software Foundation (ASF) under one or more
+contributor license agreements. See the NOTICE file distributed with
+this work for additional information regarding copyright ownership.
+The ASF licenses this file to you under the Apache License, Version 2.0
+(the "License"); you may not use this file except in compliance with
+the License. You may obtain a copy of the License at
+
+http://www.apache.org/licenses/LICENSE-2.0
+
+Unless required by applicable law or agreed to in writing, software
+distributed under the License is distributed on an "AS IS" BASIS,
+WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+See the License for the specific language governing permissions and
+limitations under the License.
+{% endcomment %}
+-->
+
+<p>The Apache DataFusion PMC is pleased to announce version 0.13.0 of
the <a href="https://datafusion.apache.org/comet/">Comet</a>
subproject.</p>
+<p>Comet is an accelerator for Apache Spark that translates Spark
physical plans to DataFusion physical plans for
+improved performance and efficiency without requiring any code
changes.</p>
+<p>This release covers approximately eight weeks of development work and
is the result of merging 160 PRs from 15
+contributors. See the <a
href="https://github.com/apache/datafusion-comet/blob/main/dev/changelog/0.13.0.md">change
log</a> for more information.</p>
+<h2 id="key-features">Key Features<a class="headerlink"
href="#key-features" title="Permanent link">¶</a></h2>
+<h3 id="native-parquet-write-support-experimental">Native Parquet Write
Support (Experimental)<a class="headerlink"
href="#native-parquet-write-support-experimental" title="Permanent
link">¶</a></h3>
+<p>This release introduces experimental native Parquet write
capabilities, allowing Comet to intercept and execute Parquet write operations
natively through DataFusion. Key capabilities include:</p>
+<ul>
+<li>File commit protocol support for reliable writes</li>
+<li>Remote HDFS writing via OpenDAL integration</li>
+<li>Complex type support (arrays, maps, structs)</li>
+<li>Proper handling of object store settings</li>
+</ul>
+<p>To enable native Parquet writes, set:</p>
+<pre><code>spark.comet.allowIncompatibleOp.DataWritingCommandExec=true
+spark.comet.parquet.write.enabled=true
+</code></pre>
+<p><strong>Note</strong>: This feature is highly
experimental and should not be used in production environments. It is currently
categorized as a testing feature and is disabled by default.</p>
+<h3 id="native-iceberg-improvements">Native Iceberg Improvements<a
class="headerlink" href="#native-iceberg-improvements" title="Permanent
link">¶</a></h3>
+<p>Comet's fully-native Iceberg integration received significant
enhancements in this release:</p>
+<p><strong>REST Catalog Support</strong>: Native Iceberg
scans now support REST catalogs, enabling integration with catalog services
like Apache Polaris and Tabular. Configure with:</p>
+<pre><code class="language-shell">--conf
spark.sql.catalog.rest_cat=org.apache.iceberg.spark.SparkCatalog
+--conf
spark.sql.catalog.rest_cat.catalog-impl=org.apache.iceberg.rest.RESTCatalog
+--conf spark.sql.catalog.rest_cat.uri=http://localhost:8181
+--conf spark.comet.scan.icebergNative.enabled=true
+</code></pre>
+<p><strong>Session Token Authentication</strong>: Added
support for session tokens in native Iceberg scans for secure S3
access.</p>
+<p><strong>Performance Optimizations</strong>:
+- Deduplicated serialized metadata reducing memory overhead
+- Switched from JSON to protobuf for partition value serialization
+- Removed IcebergFileStream in favor of iceberg-rust's built-in parallelization
+- Reduced metadata serialization points
+- Added SchemaAdapter caching</p>
+<p>To enable fully-native Iceberg scanning:</p>
+<pre><code>spark.comet.scan.icebergNative.enabled=true
+</code></pre>
+<p>The native reader supports Iceberg table spec v1 and v2, all
primitive and complex types, schema evolution, time travel, positional and
equality deletes, filter pushdown, and various storage backends (local, HDFS,
S3).</p>
+<h3 id="native-csv-reading-experimental">Native CSV Reading
(Experimental)<a class="headerlink" href="#native-csv-reading-experimental"
title="Permanent link">¶</a></h3>
+<p>Experimental support for native CSV file reading has been added,
expanding Comet's file format capabilities beyond Parquet.</p>
+<h3 id="new-expressions">New Expressions<a class="headerlink"
href="#new-expressions" title="Permanent link">¶</a></h3>
+<p>The release adds support for numerous expressions:
+- Array functions: <code>explode</code>,
<code>explode_outer</code>, <code>size</code>
+- Date/time functions: <code>unix_date</code>,
<code>date_format</code>, <code>datediff</code>,
<code>last_day</code>, <code>unix_timestamp</code>
+- String functions: <code>left</code>
+- JSON functions: <code>from_json</code> (partial
support)</p>
+<h3 id="ansi-mode-support">ANSI Mode Support<a class="headerlink"
href="#ansi-mode-support" title="Permanent link">¶</a></h3>
+<p>Sum and average aggregate expressions now support ANSI mode for both
integer and decimal inputs, enabling overflow checking in strict SQL
mode.</p>
+<h3 id="native-shuffle-improvements">Native Shuffle Improvements<a
class="headerlink" href="#native-shuffle-improvements" title="Permanent
link">¶</a></h3>
+<ul>
+<li>Round-robin partitioning is now supported in native
shuffle</li>
+<li>Spill metrics are now reported correctly</li>
+<li>Configurable shuffle writer buffer size via
<code>spark.comet.shuffle.write.bufferSize</code></li>
+</ul>
+<h2 id="performance-improvements">Performance Improvements<a
class="headerlink" href="#performance-improvements" title="Permanent
link">¶</a></h2>
+<p>This release includes extensive performance optimizations:</p>
+<ul>
+<li><strong>String to integer casting</strong>: Significant
speedups through optimized parsing</li>
+<li><strong>String functions</strong>: Optimized
<code>lpad</code>/<code>rpad</code> to remove
unnecessary memory allocations</li>
+<li><strong>Date operations</strong>: Improved
<code>normalize_nan</code> and date truncate performance</li>
+<li><strong>Query planning</strong>: Cached query plans to
avoid per-partition serialization overhead</li>
+<li><strong>Memory efficiency</strong>: Reduced GC pressure
in protobuf serialization</li>
+<li><strong>Hash operations</strong>: Optimized complex-type
hash implementations including murmur3 support for nested types</li>
+<li><strong>Runtime efficiency</strong>: Eliminated
busy-polling of Tokio stream for plans without CometScan</li>
+<li><strong>Metrics overhead</strong>: Reduced timer and
syscall overhead in native shuffle writer</li>
+</ul>
+<h2 id="deprecations">Deprecations<a class="headerlink"
href="#deprecations" title="Permanent link">¶</a></h2>
+<p>The <code>native_comet</code> scan mode is now deprecated
in favor of <code>native_iceberg_compat</code> and will be removed
in a future release. The <code>auto</code> scan mode no longer
falls back to <code>native_comet</code>.</p>
+<h2 id="compatibility">Compatibility<a class="headerlink"
href="#compatibility" title="Permanent link">¶</a></h2>
+<p>This release upgrades to DataFusion 51, Arrow 57, and the latest
iceberg-rust. The minimum supported Rust version is now 1.88.</p>
+<p>Supported platforms include Spark 3.4.3, 3.5.4-3.5.7, and Spark 4.0.x
with various JDK and Scala combinations.</p>
+<p>The community encourages users to test Comet with existing Spark
workloads and welcomes contributions to ongoing
development.</p></content><category
term="blog"></category></entry><entry><title>Apache DataFusion 52.0.0
Released</title><link
href="https://datafusion.apache.org/blog/2026/01/12/datafusion-52.0.0"
rel="alternate"></link><published>2026-01-12T00:00:00+00:00</published><updated>2026-01-12T00:00:00+00:00</updated><author><name>pmc</name></author><id>tag:datafusio
[...]
{% comment %}
Licensed to the Apache Software Foundation (ASF) under one or more
contributor license agreements. See the NOTICE file distributed with
@@ -787,119 +899,7 @@ Hash repartitioning is useful when working with grouped
data. Imagine you have a
<p>I hope you gained something from my experience and have fun learning
about databases.</p>
<hr/>
<h2
id="acknowledgements"><strong>Acknowledgements</strong><a
class="headerlink" href="#acknowledgements" title="Permanent
link">&para;</a></h2>
-<p>Thank you to <a href="https://github.com/NGA-TRAN">Nga
Tran</a> for continuous mentorship and guidance, the DataFusion
community, specifically <a href="https://github.com/alamb">Andrew
Lamb</a>, for lending me support throughout my work, and Datadog for
providing the opportunity to work on such interesting
systems.</p></content><category
term="blog"></category></entry><entry><title>Apache DataFusion Comet 0.12.0
Release</title><link href="https://da [...]
-{% comment %}
-Licensed to the Apache Software Foundation (ASF) under one or more
-contributor license agreements. See the NOTICE file distributed with
-this work for additional information regarding copyright ownership.
-The ASF licenses this file to you under the Apache License, Version 2.0
-(the "License"); you may not use this file except in compliance with
-the License. You may obtain a copy of the License at
-
-http://www.apache.org/licenses/LICENSE-2.0
-
-Unless required by applicable law or agreed to in writing, software
-distributed under the License is distributed on an "AS IS" BASIS,
-WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-See the License for the specific language governing permissions and
-limitations under the License.
-{% endcomment %}
--->
-
-<p>The Apache DataFusion PMC is pleased to announce version 0.13.0 of
the <a href="https://datafusion.apache.org/comet/">Comet</a>
subproject.</p>
-<p>Comet is an accelerator for Apache Spark that translates Spark
physical plans to DataFusion physical plans for
-improved performance and efficiency without requiring any code
changes.</p>
-<p>This release covers approximately eight weeks of development
…</p></summary><content type="html"><!--
-{% comment %}
-Licensed to the Apache Software Foundation (ASF) under one or more
-contributor license agreements. See the NOTICE file distributed with
-this work for additional information regarding copyright ownership.
-The ASF licenses this file to you under the Apache License, Version 2.0
-(the "License"); you may not use this file except in compliance with
-the License. You may obtain a copy of the License at
-
-http://www.apache.org/licenses/LICENSE-2.0
-
-Unless required by applicable law or agreed to in writing, software
-distributed under the License is distributed on an "AS IS" BASIS,
-WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-See the License for the specific language governing permissions and
-limitations under the License.
-{% endcomment %}
--->
-
-<p>The Apache DataFusion PMC is pleased to announce version 0.13.0 of
the <a href="https://datafusion.apache.org/comet/">Comet</a>
subproject.</p>
-<p>Comet is an accelerator for Apache Spark that translates Spark
physical plans to DataFusion physical plans for
-improved performance and efficiency without requiring any code
changes.</p>
-<p>This release covers approximately eight weeks of development work and
is the result of merging 160 PRs from 15
-contributors. See the <a
href="https://github.com/apache/datafusion-comet/blob/main/dev/changelog/0.13.0.md">change
log</a> for more information.</p>
-<h2 id="key-features">Key Features<a class="headerlink"
href="#key-features" title="Permanent link">¶</a></h2>
-<h3 id="native-parquet-write-support-experimental">Native Parquet Write
Support (Experimental)<a class="headerlink"
href="#native-parquet-write-support-experimental" title="Permanent
link">¶</a></h3>
-<p>This release introduces experimental native Parquet write
capabilities, allowing Comet to intercept and execute Parquet write operations
natively through DataFusion. Key capabilities include:</p>
-<ul>
-<li>File commit protocol support for reliable writes</li>
-<li>Remote HDFS writing via OpenDAL integration</li>
-<li>Complex type support (arrays, maps, structs)</li>
-<li>Proper handling of object store settings</li>
-</ul>
-<p>To enable native Parquet writes, set:</p>
-<pre><code>spark.comet.allowIncompatibleOp.DataWritingCommandExec=true
-spark.comet.parquet.write.enabled=true
-</code></pre>
-<p><strong>Note</strong>: This feature is highly
experimental and should not be used in production environments. It is currently
categorized as a testing feature and is disabled by default.</p>
-<h3 id="native-iceberg-improvements">Native Iceberg Improvements<a
class="headerlink" href="#native-iceberg-improvements" title="Permanent
link">¶</a></h3>
-<p>Comet's fully-native Iceberg integration received significant
enhancements in this release:</p>
-<p><strong>REST Catalog Support</strong>: Native Iceberg
scans now support REST catalogs, enabling integration with catalog services
like Apache Polaris and Tabular. Configure with:</p>
-<pre><code class="language-shell">--conf
spark.sql.catalog.rest_cat=org.apache.iceberg.spark.SparkCatalog
---conf
spark.sql.catalog.rest_cat.catalog-impl=org.apache.iceberg.rest.RESTCatalog
---conf spark.sql.catalog.rest_cat.uri=http://localhost:8181
---conf spark.comet.scan.icebergNative.enabled=true
-</code></pre>
-<p><strong>Session Token Authentication</strong>: Added
support for session tokens in native Iceberg scans for secure S3
access.</p>
-<p><strong>Performance Optimizations</strong>:
-- Deduplicated serialized metadata reducing memory overhead
-- Switched from JSON to protobuf for partition value serialization
-- Removed IcebergFileStream in favor of iceberg-rust's built-in parallelization
-- Reduced metadata serialization points
-- Added SchemaAdapter caching</p>
-<p>To enable fully-native Iceberg scanning:</p>
-<pre><code>spark.comet.scan.icebergNative.enabled=true
-</code></pre>
-<p>The native reader supports Iceberg table spec v1 and v2, all
primitive and complex types, schema evolution, time travel, positional and
equality deletes, filter pushdown, and various storage backends (local, HDFS,
S3).</p>
-<h3 id="native-csv-reading-experimental">Native CSV Reading
(Experimental)<a class="headerlink" href="#native-csv-reading-experimental"
title="Permanent link">¶</a></h3>
-<p>Experimental support for native CSV file reading has been added,
expanding Comet's file format capabilities beyond Parquet.</p>
-<h3 id="new-expressions">New Expressions<a class="headerlink"
href="#new-expressions" title="Permanent link">¶</a></h3>
-<p>The release adds support for numerous expressions:
-- Array functions: <code>explode</code>,
<code>explode_outer</code>, <code>size</code>
-- Date/time functions: <code>unix_date</code>,
<code>date_format</code>, <code>datediff</code>,
<code>last_day</code>, <code>unix_timestamp</code>
-- String functions: <code>left</code>
-- JSON functions: <code>from_json</code> (partial
support)</p>
-<h3 id="ansi-mode-support">ANSI Mode Support<a class="headerlink"
href="#ansi-mode-support" title="Permanent link">¶</a></h3>
-<p>Sum and average aggregate expressions now support ANSI mode for both
integer and decimal inputs, enabling overflow checking in strict SQL
mode.</p>
-<h3 id="native-shuffle-improvements">Native Shuffle Improvements<a
class="headerlink" href="#native-shuffle-improvements" title="Permanent
link">¶</a></h3>
-<ul>
-<li>Round-robin partitioning is now supported in native
shuffle</li>
-<li>Spill metrics are now reported correctly</li>
-<li>Configurable shuffle writer buffer size via
<code>spark.comet.shuffle.write.bufferSize</code></li>
-</ul>
-<h2 id="performance-improvements">Performance Improvements<a
class="headerlink" href="#performance-improvements" title="Permanent
link">¶</a></h2>
-<p>This release includes extensive performance optimizations:</p>
-<ul>
-<li><strong>String to integer casting</strong>: Significant
speedups through optimized parsing</li>
-<li><strong>String functions</strong>: Optimized
<code>lpad</code>/<code>rpad</code> to remove
unnecessary memory allocations</li>
-<li><strong>Date operations</strong>: Improved
<code>normalize_nan</code> and date truncate performance</li>
-<li><strong>Query planning</strong>: Cached query plans to
avoid per-partition serialization overhead</li>
-<li><strong>Memory efficiency</strong>: Reduced GC pressure
in protobuf serialization</li>
-<li><strong>Hash operations</strong>: Optimized complex-type
hash implementations including murmur3 support for nested types</li>
-<li><strong>Runtime efficiency</strong>: Eliminated
busy-polling of Tokio stream for plans without CometScan</li>
-<li><strong>Metrics overhead</strong>: Reduced timer and
syscall overhead in native shuffle writer</li>
-</ul>
-<h2 id="deprecations">Deprecations<a class="headerlink"
href="#deprecations" title="Permanent link">¶</a></h2>
-<p>The <code>native_comet</code> scan mode is now deprecated
in favor of <code>native_iceberg_compat</code> and will be removed
in a future release. The <code>auto</code> scan mode no longer
falls back to <code>native_comet</code>.</p>
-<h2 id="compatibility">Compatibility<a class="headerlink"
href="#compatibility" title="Permanent link">¶</a></h2>
-<p>This release upgrades to DataFusion 51, Arrow 57, and the latest
iceberg-rust. The minimum supported Rust version is now 1.88.</p>
-<p>Supported platforms include Spark 3.4.3, 3.5.4-3.5.7, and Spark 4.0.x
with various JDK and Scala combinations.</p>
-<p>The community encourages users to test Comet with existing Spark
workloads and welcomes contributions to ongoing
development.</p></content><category
term="blog"></category></entry><entry><title>Apache DataFusion Comet 0.12.0
Release</title><link
href="https://datafusion.apache.org/blog/2025/12/04/datafusion-comet-0.12.0"
rel="alternate"></link><published>2025-12-04T00:00:00+00:00</published><updated>2025-12-04T00:00:00+00:00</updated><author><name>pmc</name></author><id>ta
[...]
+<p>Thank you to <a href="https://github.com/NGA-TRAN">Nga
Tran</a> for continuous mentorship and guidance, the DataFusion
community, specifically <a href="https://github.com/alamb">Andrew
Lamb</a>, for lending me support throughout my work, and Datadog for
providing the opportunity to work on such interesting
systems.</p></content><category
term="blog"></category></entry><entry><title>Apache DataFusion Comet 0.12.0
Release</title><link href="https://da [...]
{% comment %}
Licensed to the Apache Software Foundation (ASF) under one or more
contributor license agreements. See the NOTICE file distributed with
diff --git a/output/feeds/pmc.atom.xml b/output/feeds/pmc.atom.xml
index a99f11b..7af1ad9 100644
--- a/output/feeds/pmc.atom.xml
+++ b/output/feeds/pmc.atom.xml
@@ -1,5 +1,117 @@
<?xml version="1.0" encoding="utf-8"?>
-<feed xmlns="http://www.w3.org/2005/Atom"><title>Apache DataFusion Blog -
pmc</title><link href="https://datafusion.apache.org/blog/"
rel="alternate"></link><link
href="https://datafusion.apache.org/blog/feeds/pmc.atom.xml"
rel="self"></link><id>https://datafusion.apache.org/blog/</id><updated>2026-01-12T00:00:00+00:00</updated><subtitle></subtitle><entry><title>Apache
DataFusion 52.0.0 Released</title><link
href="https://datafusion.apache.org/blog/2026/01/12/datafusion-52.0.0"
rel="alte [...]
+<feed xmlns="http://www.w3.org/2005/Atom"><title>Apache DataFusion Blog -
pmc</title><link href="https://datafusion.apache.org/blog/"
rel="alternate"></link><link
href="https://datafusion.apache.org/blog/feeds/pmc.atom.xml"
rel="self"></link><id>https://datafusion.apache.org/blog/</id><updated>2026-01-30T00:00:00+00:00</updated><subtitle></subtitle><entry><title>Apache
DataFusion Comet 0.13.0 Release</title><link
href="https://datafusion.apache.org/blog/2026/01/30/datafusion-comet-0.13.0
[...]
+{% comment %}
+Licensed to the Apache Software Foundation (ASF) under one or more
+contributor license agreements. See the NOTICE file distributed with
+this work for additional information regarding copyright ownership.
+The ASF licenses this file to you under the Apache License, Version 2.0
+(the "License"); you may not use this file except in compliance with
+the License. You may obtain a copy of the License at
+
+http://www.apache.org/licenses/LICENSE-2.0
+
+Unless required by applicable law or agreed to in writing, software
+distributed under the License is distributed on an "AS IS" BASIS,
+WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+See the License for the specific language governing permissions and
+limitations under the License.
+{% endcomment %}
+-->
+
+<p>The Apache DataFusion PMC is pleased to announce version 0.13.0 of
the <a href="https://datafusion.apache.org/comet/">Comet</a>
subproject.</p>
+<p>Comet is an accelerator for Apache Spark that translates Spark
physical plans to DataFusion physical plans for
+improved performance and efficiency without requiring any code
changes.</p>
+<p>This release covers approximately eight weeks of development
…</p></summary><content type="html"><!--
+{% comment %}
+Licensed to the Apache Software Foundation (ASF) under one or more
+contributor license agreements. See the NOTICE file distributed with
+this work for additional information regarding copyright ownership.
+The ASF licenses this file to you under the Apache License, Version 2.0
+(the "License"); you may not use this file except in compliance with
+the License. You may obtain a copy of the License at
+
+http://www.apache.org/licenses/LICENSE-2.0
+
+Unless required by applicable law or agreed to in writing, software
+distributed under the License is distributed on an "AS IS" BASIS,
+WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+See the License for the specific language governing permissions and
+limitations under the License.
+{% endcomment %}
+-->
+
+<p>The Apache DataFusion PMC is pleased to announce version 0.13.0 of
the <a href="https://datafusion.apache.org/comet/">Comet</a>
subproject.</p>
+<p>Comet is an accelerator for Apache Spark that translates Spark
physical plans to DataFusion physical plans for
+improved performance and efficiency without requiring any code
changes.</p>
+<p>This release covers approximately eight weeks of development work and
is the result of merging 160 PRs from 15
+contributors. See the <a
href="https://github.com/apache/datafusion-comet/blob/main/dev/changelog/0.13.0.md">change
log</a> for more information.</p>
+<h2 id="key-features">Key Features<a class="headerlink"
href="#key-features" title="Permanent link">¶</a></h2>
+<h3 id="native-parquet-write-support-experimental">Native Parquet Write
Support (Experimental)<a class="headerlink"
href="#native-parquet-write-support-experimental" title="Permanent
link">¶</a></h3>
+<p>This release introduces experimental native Parquet write
capabilities, allowing Comet to intercept and execute Parquet write operations
natively through DataFusion. Key capabilities include:</p>
+<ul>
+<li>File commit protocol support for reliable writes</li>
+<li>Remote HDFS writing via OpenDAL integration</li>
+<li>Complex type support (arrays, maps, structs)</li>
+<li>Proper handling of object store settings</li>
+</ul>
+<p>To enable native Parquet writes, set:</p>
+<pre><code>spark.comet.allowIncompatibleOp.DataWritingCommandExec=true
+spark.comet.parquet.write.enabled=true
+</code></pre>
+<p><strong>Note</strong>: This feature is highly
experimental and should not be used in production environments. It is currently
categorized as a testing feature and is disabled by default.</p>
+<h3 id="native-iceberg-improvements">Native Iceberg Improvements<a
class="headerlink" href="#native-iceberg-improvements" title="Permanent
link">¶</a></h3>
+<p>Comet's fully-native Iceberg integration received significant
enhancements in this release:</p>
+<p><strong>REST Catalog Support</strong>: Native Iceberg
scans now support REST catalogs, enabling integration with catalog services
like Apache Polaris and Tabular. Configure with:</p>
+<pre><code class="language-shell">--conf
spark.sql.catalog.rest_cat=org.apache.iceberg.spark.SparkCatalog
+--conf
spark.sql.catalog.rest_cat.catalog-impl=org.apache.iceberg.rest.RESTCatalog
+--conf spark.sql.catalog.rest_cat.uri=http://localhost:8181
+--conf spark.comet.scan.icebergNative.enabled=true
+</code></pre>
+<p><strong>Session Token Authentication</strong>: Added
support for session tokens in native Iceberg scans for secure S3
access.</p>
+<p><strong>Performance Optimizations</strong>:
+- Deduplicated serialized metadata reducing memory overhead
+- Switched from JSON to protobuf for partition value serialization
+- Removed IcebergFileStream in favor of iceberg-rust's built-in parallelization
+- Reduced metadata serialization points
+- Added SchemaAdapter caching</p>
+<p>To enable fully-native Iceberg scanning:</p>
+<pre><code>spark.comet.scan.icebergNative.enabled=true
+</code></pre>
+<p>The native reader supports Iceberg table spec v1 and v2, all
primitive and complex types, schema evolution, time travel, positional and
equality deletes, filter pushdown, and various storage backends (local, HDFS,
S3).</p>
+<h3 id="native-csv-reading-experimental">Native CSV Reading
(Experimental)<a class="headerlink" href="#native-csv-reading-experimental"
title="Permanent link">¶</a></h3>
+<p>Experimental support for native CSV file reading has been added,
expanding Comet's file format capabilities beyond Parquet.</p>
+<h3 id="new-expressions">New Expressions<a class="headerlink"
href="#new-expressions" title="Permanent link">¶</a></h3>
+<p>The release adds support for numerous expressions:
+- Array functions: <code>explode</code>,
<code>explode_outer</code>, <code>size</code>
+- Date/time functions: <code>unix_date</code>,
<code>date_format</code>, <code>datediff</code>,
<code>last_day</code>, <code>unix_timestamp</code>
+- String functions: <code>left</code>
+- JSON functions: <code>from_json</code> (partial
support)</p>
+<h3 id="ansi-mode-support">ANSI Mode Support<a class="headerlink"
href="#ansi-mode-support" title="Permanent link">¶</a></h3>
+<p>Sum and average aggregate expressions now support ANSI mode for both
integer and decimal inputs, enabling overflow checking in strict SQL
mode.</p>
+<h3 id="native-shuffle-improvements">Native Shuffle Improvements<a
class="headerlink" href="#native-shuffle-improvements" title="Permanent
link">¶</a></h3>
+<ul>
+<li>Round-robin partitioning is now supported in native
shuffle</li>
+<li>Spill metrics are now reported correctly</li>
+<li>Configurable shuffle writer buffer size via
<code>spark.comet.shuffle.write.bufferSize</code></li>
+</ul>
+<h2 id="performance-improvements">Performance Improvements<a
class="headerlink" href="#performance-improvements" title="Permanent
link">¶</a></h2>
+<p>This release includes extensive performance optimizations:</p>
+<ul>
+<li><strong>String to integer casting</strong>: Significant
speedups through optimized parsing</li>
+<li><strong>String functions</strong>: Optimized
<code>lpad</code>/<code>rpad</code> to remove
unnecessary memory allocations</li>
+<li><strong>Date operations</strong>: Improved
<code>normalize_nan</code> and date truncate performance</li>
+<li><strong>Query planning</strong>: Cached query plans to
avoid per-partition serialization overhead</li>
+<li><strong>Memory efficiency</strong>: Reduced GC pressure
in protobuf serialization</li>
+<li><strong>Hash operations</strong>: Optimized complex-type
hash implementations including murmur3 support for nested types</li>
+<li><strong>Runtime efficiency</strong>: Eliminated
busy-polling of Tokio stream for plans without CometScan</li>
+<li><strong>Metrics overhead</strong>: Reduced timer and
syscall overhead in native shuffle writer</li>
+</ul>
+<h2 id="deprecations">Deprecations<a class="headerlink"
href="#deprecations" title="Permanent link">¶</a></h2>
+<p>The <code>native_comet</code> scan mode is now deprecated
in favor of <code>native_iceberg_compat</code> and will be removed
in a future release. The <code>auto</code> scan mode no longer
falls back to <code>native_comet</code>.</p>
+<h2 id="compatibility">Compatibility<a class="headerlink"
href="#compatibility" title="Permanent link">¶</a></h2>
+<p>This release upgrades to DataFusion 51, Arrow 57, and the latest
iceberg-rust. The minimum supported Rust version is now 1.88.</p>
+<p>Supported platforms include Spark 3.4.3, 3.5.4-3.5.7, and Spark 4.0.x
with various JDK and Scala combinations.</p>
+<p>The community encourages users to test Comet with existing Spark
workloads and welcomes contributions to ongoing
development.</p></content><category
term="blog"></category></entry><entry><title>Apache DataFusion 52.0.0
Released</title><link
href="https://datafusion.apache.org/blog/2026/01/12/datafusion-52.0.0"
rel="alternate"></link><published>2026-01-12T00:00:00+00:00</published><updated>2026-01-12T00:00:00+00:00</updated><author><name>pmc</name></author><id>tag:datafusio
[...]
{% comment %}
Licensed to the Apache Software Foundation (ASF) under one or more
contributor license agreements. See the NOTICE file distributed with
@@ -237,119 +349,7 @@ build a shared technology that none of us could have
built alone.</p>
DataFusion on some of your own data and projects and let us know how it goes,
contribute suggestions, documentation, bug reports, or a PR with documentation,
tests, or code. A list of open issues suitable for beginners is <a
href="https://github.com/apache/arrow-datafusion/issues?q=is%3Aissue+is%3Aopen+label%3A%22good+first+issue%22">here</a>,
and you
-can find out how to reach us on the <a
href="https://datafusion.apache.org/contributor-guide/communication.html">communication
doc</a>.</p></content><category
term="blog"></category></entry><entry><title>Apache DataFusion Comet 0.12.0
Release</title><link
href="https://datafusion.apache.org/blog/2025/12/04/datafusion-comet-0.13.0"
rel="alternate"></link><published>2025-12-04T00:00:00+00:00</published><updated>2025-12-04T00:00:00+00:00</updated><author><name>pmc</name></
[...]
-{% comment %}
-Licensed to the Apache Software Foundation (ASF) under one or more
-contributor license agreements. See the NOTICE file distributed with
-this work for additional information regarding copyright ownership.
-The ASF licenses this file to you under the Apache License, Version 2.0
-(the "License"); you may not use this file except in compliance with
-the License. You may obtain a copy of the License at
-
-http://www.apache.org/licenses/LICENSE-2.0
-
-Unless required by applicable law or agreed to in writing, software
-distributed under the License is distributed on an "AS IS" BASIS,
-WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-See the License for the specific language governing permissions and
-limitations under the License.
-{% endcomment %}
--->
-
-<p>The Apache DataFusion PMC is pleased to announce version 0.13.0 of
the <a href="https://datafusion.apache.org/comet/">Comet</a>
subproject.</p>
-<p>Comet is an accelerator for Apache Spark that translates Spark
physical plans to DataFusion physical plans for
-improved performance and efficiency without requiring any code
changes.</p>
-<p>This release covers approximately eight weeks of development
…</p></summary><content type="html"><!--
-{% comment %}
-Licensed to the Apache Software Foundation (ASF) under one or more
-contributor license agreements. See the NOTICE file distributed with
-this work for additional information regarding copyright ownership.
-The ASF licenses this file to you under the Apache License, Version 2.0
-(the "License"); you may not use this file except in compliance with
-the License. You may obtain a copy of the License at
-
-http://www.apache.org/licenses/LICENSE-2.0
-
-Unless required by applicable law or agreed to in writing, software
-distributed under the License is distributed on an "AS IS" BASIS,
-WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-See the License for the specific language governing permissions and
-limitations under the License.
-{% endcomment %}
--->
-
-<p>The Apache DataFusion PMC is pleased to announce version 0.13.0 of
the <a href="https://datafusion.apache.org/comet/">Comet</a>
subproject.</p>
-<p>Comet is an accelerator for Apache Spark that translates Spark
physical plans to DataFusion physical plans for
-improved performance and efficiency without requiring any code
changes.</p>
-<p>This release covers approximately eight weeks of development work and
is the result of merging 160 PRs from 15
-contributors. See the <a
href="https://github.com/apache/datafusion-comet/blob/main/dev/changelog/0.13.0.md">change
log</a> for more information.</p>
-<h2 id="key-features">Key Features<a class="headerlink"
href="#key-features" title="Permanent link">¶</a></h2>
-<h3 id="native-parquet-write-support-experimental">Native Parquet Write
Support (Experimental)<a class="headerlink"
href="#native-parquet-write-support-experimental" title="Permanent
link">¶</a></h3>
-<p>This release introduces experimental native Parquet write
capabilities, allowing Comet to intercept and execute Parquet write operations
natively through DataFusion. Key capabilities include:</p>
-<ul>
-<li>File commit protocol support for reliable writes</li>
-<li>Remote HDFS writing via OpenDAL integration</li>
-<li>Complex type support (arrays, maps, structs)</li>
-<li>Proper handling of object store settings</li>
-</ul>
-<p>To enable native Parquet writes, set:</p>
-<pre><code>spark.comet.allowIncompatibleOp.DataWritingCommandExec=true
-spark.comet.parquet.write.enabled=true
-</code></pre>
-<p><strong>Note</strong>: This feature is highly
experimental and should not be used in production environments. It is currently
categorized as a testing feature and is disabled by default.</p>
-<h3 id="native-iceberg-improvements">Native Iceberg Improvements<a
class="headerlink" href="#native-iceberg-improvements" title="Permanent
link">¶</a></h3>
-<p>Comet's fully-native Iceberg integration received significant
enhancements in this release:</p>
-<p><strong>REST Catalog Support</strong>: Native Iceberg
scans now support REST catalogs, enabling integration with catalog services
like Apache Polaris and Tabular. Configure with:</p>
-<pre><code class="language-shell">--conf
spark.sql.catalog.rest_cat=org.apache.iceberg.spark.SparkCatalog
---conf
spark.sql.catalog.rest_cat.catalog-impl=org.apache.iceberg.rest.RESTCatalog
---conf spark.sql.catalog.rest_cat.uri=http://localhost:8181
---conf spark.comet.scan.icebergNative.enabled=true
-</code></pre>
-<p><strong>Session Token Authentication</strong>: Added
support for session tokens in native Iceberg scans for secure S3
access.</p>
-<p><strong>Performance Optimizations</strong>:
-- Deduplicated serialized metadata reducing memory overhead
-- Switched from JSON to protobuf for partition value serialization
-- Removed IcebergFileStream in favor of iceberg-rust's built-in parallelization
-- Reduced metadata serialization points
-- Added SchemaAdapter caching</p>
-<p>To enable fully-native Iceberg scanning:</p>
-<pre><code>spark.comet.scan.icebergNative.enabled=true
-</code></pre>
-<p>The native reader supports Iceberg table spec v1 and v2, all
primitive and complex types, schema evolution, time travel, positional and
equality deletes, filter pushdown, and various storage backends (local, HDFS,
S3).</p>
-<h3 id="native-csv-reading-experimental">Native CSV Reading
(Experimental)<a class="headerlink" href="#native-csv-reading-experimental"
title="Permanent link">¶</a></h3>
-<p>Experimental support for native CSV file reading has been added,
expanding Comet's file format capabilities beyond Parquet.</p>
-<h3 id="new-expressions">New Expressions<a class="headerlink"
href="#new-expressions" title="Permanent link">¶</a></h3>
-<p>The release adds support for numerous expressions:
-- Array functions: <code>explode</code>,
<code>explode_outer</code>, <code>size</code>
-- Date/time functions: <code>unix_date</code>,
<code>date_format</code>, <code>datediff</code>,
<code>last_day</code>, <code>unix_timestamp</code>
-- String functions: <code>left</code>
-- JSON functions: <code>from_json</code> (partial
support)</p>
-<h3 id="ansi-mode-support">ANSI Mode Support<a class="headerlink"
href="#ansi-mode-support" title="Permanent link">¶</a></h3>
-<p>Sum and average aggregate expressions now support ANSI mode for both
integer and decimal inputs, enabling overflow checking in strict SQL
mode.</p>
-<h3 id="native-shuffle-improvements">Native Shuffle Improvements<a
class="headerlink" href="#native-shuffle-improvements" title="Permanent
link">¶</a></h3>
-<ul>
-<li>Round-robin partitioning is now supported in native
shuffle</li>
-<li>Spill metrics are now reported correctly</li>
-<li>Configurable shuffle writer buffer size via
<code>spark.comet.shuffle.write.bufferSize</code></li>
-</ul>
-<h2 id="performance-improvements">Performance Improvements<a
class="headerlink" href="#performance-improvements" title="Permanent
link">¶</a></h2>
-<p>This release includes extensive performance optimizations:</p>
-<ul>
-<li><strong>String to integer casting</strong>: Significant
speedups through optimized parsing</li>
-<li><strong>String functions</strong>: Optimized
<code>lpad</code>/<code>rpad</code> to remove
unnecessary memory allocations</li>
-<li><strong>Date operations</strong>: Improved
<code>normalize_nan</code> and date truncate performance</li>
-<li><strong>Query planning</strong>: Cached query plans to
avoid per-partition serialization overhead</li>
-<li><strong>Memory efficiency</strong>: Reduced GC pressure
in protobuf serialization</li>
-<li><strong>Hash operations</strong>: Optimized complex-type
hash implementations including murmur3 support for nested types</li>
-<li><strong>Runtime efficiency</strong>: Eliminated
busy-polling of Tokio stream for plans without CometScan</li>
-<li><strong>Metrics overhead</strong>: Reduced timer and
syscall overhead in native shuffle writer</li>
-</ul>
-<h2 id="deprecations">Deprecations<a class="headerlink"
href="#deprecations" title="Permanent link">¶</a></h2>
-<p>The <code>native_comet</code> scan mode is now deprecated
in favor of <code>native_iceberg_compat</code> and will be removed
in a future release. The <code>auto</code> scan mode no longer
falls back to <code>native_comet</code>.</p>
-<h2 id="compatibility">Compatibility<a class="headerlink"
href="#compatibility" title="Permanent link">¶</a></h2>
-<p>This release upgrades to DataFusion 51, Arrow 57, and the latest
iceberg-rust. The minimum supported Rust version is now 1.88.</p>
-<p>Supported platforms include Spark 3.4.3, 3.5.4-3.5.7, and Spark 4.0.x
with various JDK and Scala combinations.</p>
-<p>The community encourages users to test Comet with existing Spark
workloads and welcomes contributions to ongoing
development.</p></content><category
term="blog"></category></entry><entry><title>Apache DataFusion Comet 0.12.0
Release</title><link
href="https://datafusion.apache.org/blog/2025/12/04/datafusion-comet-0.12.0"
rel="alternate"></link><published>2025-12-04T00:00:00+00:00</published><updated>2025-12-04T00:00:00+00:00</updated><author><name>pmc</name></author><id>ta
[...]
+can find out how to reach us on the <a
href="https://datafusion.apache.org/contributor-guide/communication.html">communication
doc</a>.</p></content><category
term="blog"></category></entry><entry><title>Apache DataFusion Comet 0.12.0
Release</title><link
href="https://datafusion.apache.org/blog/2025/12/04/datafusion-comet-0.12.0"
rel="alternate"></link><published>2025-12-04T00:00:00+00:00</published><updated>2025-12-04T00:00:00+00:00</updated><author><name>pmc</name></
[...]
{% comment %}
Licensed to the Apache Software Foundation (ASF) under one or more
contributor license agreements. See the NOTICE file distributed with
diff --git a/output/feeds/pmc.rss.xml b/output/feeds/pmc.rss.xml
index 4bd8b6f..20bbffa 100644
--- a/output/feeds/pmc.rss.xml
+++ b/output/feeds/pmc.rss.xml
@@ -1,5 +1,5 @@
<?xml version="1.0" encoding="utf-8"?>
-<rss version="2.0"><channel><title>Apache DataFusion Blog -
pmc</title><link>https://datafusion.apache.org/blog/</link><description></description><lastBuildDate>Mon,
12 Jan 2026 00:00:00 +0000</lastBuildDate><item><title>Apache DataFusion
52.0.0
Released</title><link>https://datafusion.apache.org/blog/2026/01/12/datafusion-52.0.0</link><description><!--
+<rss version="2.0"><channel><title>Apache DataFusion Blog -
pmc</title><link>https://datafusion.apache.org/blog/</link><description></description><lastBuildDate>Fri,
30 Jan 2026 00:00:00 +0000</lastBuildDate><item><title>Apache DataFusion Comet
0.13.0
Release</title><link>https://datafusion.apache.org/blog/2026/01/30/datafusion-comet-0.13.0</link><description><!--
{% comment %}
Licensed to the Apache Software Foundation (ASF) under one or more
contributor license agreements. See the NOTICE file distributed with
@@ -18,12 +18,10 @@ limitations under the License.
{% endcomment %}
-->
-<p>We are proud to announce the release of <a
href="https://crates.io/crates/datafusion/52.0.0">DataFusion
52.0.0</a>. This post highlights
-some of the major improvements since <a
href="https://datafusion.apache.org/blog/2025/11/25/datafusion-51.0.0/">DataFusion
51.0.0</a>. The complete list of
-changes is available in the <a
href="https://github.com/apache/datafusion/blob/branch-52/dev/changelog/52.0.0.md">changelog</a>.
Thanks to the <a
href="https://github.com/apache/datafusion/blob/branch-52/dev/changelog/52.0.0.md#credits">121
contributors</a> for
-making this release possible.</p>
-<h2 id="performance-improvements">Performance Improvements 🚀<a
class="headerlink" href="#performance-improvements" title="Permanent
link">¶</a></h2>
-<p>We continue to …</p></description><dc:creator
xmlns:dc="http://purl.org/dc/elements/1.1/">pmc</dc:creator><pubDate>Mon, 12
Jan 2026 00:00:00 +0000</pubDate><guid
isPermaLink="false">tag:datafusion.apache.org,2026-01-12:/blog/2026/01/12/datafusion-52.0.0</guid><category>blog</category></item><item><title>Apache
DataFusion Comet 0.12.0
Release</title><link>https://datafusion.apache.org/blog/2025/12/04/datafusion-comet-0.13.0</link><description><!--
+<p>The Apache DataFusion PMC is pleased to announce version 0.13.0 of
the <a href="https://datafusion.apache.org/comet/">Comet</a>
subproject.</p>
+<p>Comet is an accelerator for Apache Spark that translates Spark
physical plans to DataFusion physical plans for
+improved performance and efficiency without requiring any code
changes.</p>
+<p>This release covers approximately eight weeks of development
…</p></description><dc:creator
xmlns:dc="http://purl.org/dc/elements/1.1/">pmc</dc:creator><pubDate>Fri, 30
Jan 2026 00:00:00 +0000</pubDate><guid
isPermaLink="false">tag:datafusion.apache.org,2026-01-30:/blog/2026/01/30/datafusion-comet-0.13.0</guid><category>blog</category></item><item><title>Apache
DataFusion 52.0.0
Released</title><link>https://datafusion.apache.org/blog/2026/01/12/datafusion-52.0.0</link><de
[...]
{% comment %}
Licensed to the Apache Software Foundation (ASF) under one or more
contributor license agreements. See the NOTICE file distributed with
@@ -42,10 +40,12 @@ limitations under the License.
{% endcomment %}
-->
-<p>The Apache DataFusion PMC is pleased to announce version 0.13.0 of
the <a href="https://datafusion.apache.org/comet/">Comet</a>
subproject.</p>
-<p>Comet is an accelerator for Apache Spark that translates Spark
physical plans to DataFusion physical plans for
-improved performance and efficiency without requiring any code
changes.</p>
-<p>This release covers approximately eight weeks of development
…</p></description><dc:creator
xmlns:dc="http://purl.org/dc/elements/1.1/">pmc</dc:creator><pubDate>Thu, 04
Dec 2025 00:00:00 +0000</pubDate><guid
isPermaLink="false">tag:datafusion.apache.org,2025-12-04:/blog/2025/12/04/datafusion-comet-0.13.0</guid><category>blog</category></item><item><title>Apache
DataFusion Comet 0.12.0
Release</title><link>https://datafusion.apache.org/blog/2025/12/04/datafusion-comet-0.12.
[...]
+<p>We are proud to announce the release of <a
href="https://crates.io/crates/datafusion/52.0.0">DataFusion
52.0.0</a>. This post highlights
+some of the major improvements since <a
href="https://datafusion.apache.org/blog/2025/11/25/datafusion-51.0.0/">DataFusion
51.0.0</a>. The complete list of
+changes is available in the <a
href="https://github.com/apache/datafusion/blob/branch-52/dev/changelog/52.0.0.md">changelog</a>.
Thanks to the <a
href="https://github.com/apache/datafusion/blob/branch-52/dev/changelog/52.0.0.md#credits">121
contributors</a> for
+making this release possible.</p>
+<h2 id="performance-improvements">Performance Improvements 🚀<a
class="headerlink" href="#performance-improvements" title="Permanent
link">¶</a></h2>
+<p>We continue to …</p></description><dc:creator
xmlns:dc="http://purl.org/dc/elements/1.1/">pmc</dc:creator><pubDate>Mon, 12
Jan 2026 00:00:00 +0000</pubDate><guid
isPermaLink="false">tag:datafusion.apache.org,2026-01-12:/blog/2026/01/12/datafusion-52.0.0</guid><category>blog</category></item><item><title>Apache
DataFusion Comet 0.12.0
Release</title><link>https://datafusion.apache.org/blog/2025/12/04/datafusion-comet-0.12.0</link><description><!--
{% comment %}
Licensed to the Apache Software Foundation (ASF) under one or more
contributor license agreements. See the NOTICE file distributed with
diff --git a/output/index.html b/output/index.html
index c3398de..4d84d7f 100644
--- a/output/index.html
+++ b/output/index.html
@@ -51,8 +51,8 @@
<article class="post">
<header>
<div class="title">
- <h1><a
href="/blog/2026/01/12/datafusion-52.0.0">Apache DataFusion 52.0.0
Released</a></h1>
- <p>Posted on: Mon 12 January 2026 by pmc</p>
+ <h1><a
href="/blog/2026/01/30/datafusion-comet-0.13.0">Apache DataFusion Comet 0.13.0
Release</a></h1>
+ <p>Posted on: Fri 30 January 2026 by pmc</p>
<p><!--
{% comment %}
Licensed to the Apache Software Foundation (ASF) under one or more
@@ -72,15 +72,13 @@ limitations under the License.
{% endcomment %}
-->
-<p>We are proud to announce the release of <a
href="https://crates.io/crates/datafusion/52.0.0">DataFusion 52.0.0</a>. This
post highlights
-some of the major improvements since <a
href="https://datafusion.apache.org/blog/2025/11/25/datafusion-51.0.0/">DataFusion
51.0.0</a>. The complete list of
-changes is available in the <a
href="https://github.com/apache/datafusion/blob/branch-52/dev/changelog/52.0.0.md">changelog</a>.
Thanks to the <a
href="https://github.com/apache/datafusion/blob/branch-52/dev/changelog/52.0.0.md#credits">121
contributors</a> for
-making this release possible.</p>
-<h2 id="performance-improvements">Performance Improvements 🚀<a
class="headerlink" href="#performance-improvements" title="Permanent
link">¶</a></h2>
-<p>We continue to …</p></p>
+<p>The Apache DataFusion PMC is pleased to announce version 0.13.0 of the <a
href="https://datafusion.apache.org/comet/">Comet</a> subproject.</p>
+<p>Comet is an accelerator for Apache Spark that translates Spark physical
plans to DataFusion physical plans for
+improved performance and efficiency without requiring any code changes.</p>
+<p>This release covers approximately eight weeks of development …</p></p>
<footer>
<ul class="actions">
- <div style="text-align: right"><a
href="/blog/2026/01/12/datafusion-52.0.0" class="button medium">Continue
Reading</a></div>
+ <div style="text-align: right"><a
href="/blog/2026/01/30/datafusion-comet-0.13.0" class="button medium">Continue
Reading</a></div>
</ul>
<ul class="stats">
</ul>
@@ -94,8 +92,8 @@ making this release possible.</p>
<article class="post">
<header>
<div class="title">
- <h1><a href="/blog/2026/01/12/extending-sql">Extending
SQL in DataFusion: from ->> to TABLESAMPLE</a></h1>
- <p>Posted on: Mon 12 January 2026 by Geoffrey Claude
(Datadog)</p>
+ <h1><a
href="/blog/2026/01/12/datafusion-52.0.0">Apache DataFusion 52.0.0
Released</a></h1>
+ <p>Posted on: Mon 12 January 2026 by pmc</p>
<p><!--
{% comment %}
Licensed to the Apache Software Foundation (ASF) under one or more
@@ -115,11 +113,15 @@ limitations under the License.
{% endcomment %}
-->
-<p>If you embed <a href="https://datafusion.apache.org/">DataFusion</a> in
your product, your users will eventually run SQL that DataFusion does not
recognize. Not because the query is unreasonable, but because SQL in practice
includes many dialects and system-specific statements.</p>
-<p>Suppose you store data as Parquet files on S3 and want users to attach an
…</p></p>
+<p>We are proud to announce the release of <a
href="https://crates.io/crates/datafusion/52.0.0">DataFusion 52.0.0</a>. This
post highlights
+some of the major improvements since <a
href="https://datafusion.apache.org/blog/2025/11/25/datafusion-51.0.0/">DataFusion
51.0.0</a>. The complete list of
+changes is available in the <a
href="https://github.com/apache/datafusion/blob/branch-52/dev/changelog/52.0.0.md">changelog</a>.
Thanks to the <a
href="https://github.com/apache/datafusion/blob/branch-52/dev/changelog/52.0.0.md#credits">121
contributors</a> for
+making this release possible.</p>
+<h2 id="performance-improvements">Performance Improvements 🚀<a
class="headerlink" href="#performance-improvements" title="Permanent
link">¶</a></h2>
+<p>We continue to …</p></p>
<footer>
<ul class="actions">
- <div style="text-align: right"><a
href="/blog/2026/01/12/extending-sql" class="button medium">Continue
Reading</a></div>
+ <div style="text-align: right"><a
href="/blog/2026/01/12/datafusion-52.0.0" class="button medium">Continue
Reading</a></div>
</ul>
<ul class="stats">
</ul>
@@ -133,8 +135,8 @@ limitations under the License.
<article class="post">
<header>
<div class="title">
- <h1><a
href="/blog/2025/12/15/avoid-consecutive-repartitions">Optimizing Repartitions
in DataFusion: How I Went From Database Noob to Core Contribution</a></h1>
- <p>Posted on: Mon 15 December 2025 by Gene
Bordegaray</p>
+ <h1><a href="/blog/2026/01/12/extending-sql">Extending
SQL in DataFusion: from ->> to TABLESAMPLE</a></h1>
+ <p>Posted on: Mon 12 January 2026 by Geoffrey Claude
(Datadog)</p>
<p><!--
{% comment %}
Licensed to the Apache Software Foundation (ASF) under one or more
@@ -153,13 +155,12 @@ See the License for the specific language governing
permissions and
limitations under the License.
{% endcomment %}
-->
-<div style="display: flex; align-items: center; gap: 20px; margin-bottom:
20px;">
-<div style="flex: 1;">
-Databases are some of the most complex yet interesting pieces of software.
They are amazing pieces of abstraction: query engines optimize and execute
complex plans, storage engines provide sophisticated infrastructure as the
backbone of the system, while intricate file formats lay the groundwork for
particular workloads. All of this is …</div></div></p>
+<p>If you embed <a href="https://datafusion.apache.org/">DataFusion</a> in
your product, your users will eventually run SQL that DataFusion does not
recognize. Not because the query is unreasonable, but because SQL in practice
includes many dialects and system-specific statements.</p>
+<p>Suppose you store data as Parquet files on S3 and want users to attach an
…</p></p>
<footer>
<ul class="actions">
- <div style="text-align: right"><a
href="/blog/2025/12/15/avoid-consecutive-repartitions" class="button
medium">Continue Reading</a></div>
+ <div style="text-align: right"><a
href="/blog/2026/01/12/extending-sql" class="button medium">Continue
Reading</a></div>
</ul>
<ul class="stats">
</ul>
@@ -173,8 +174,8 @@ Databases are some of the most complex yet interesting
pieces of software. They
<article class="post">
<header>
<div class="title">
- <h1><a
href="/blog/2025/12/04/datafusion-comet-0.13.0">Apache DataFusion Comet 0.12.0
Release</a></h1>
- <p>Posted on: Thu 04 December 2025 by pmc</p>
+ <h1><a
href="/blog/2025/12/15/avoid-consecutive-repartitions">Optimizing Repartitions
in DataFusion: How I Went From Database Noob to Core Contribution</a></h1>
+ <p>Posted on: Mon 15 December 2025 by Gene
Bordegaray</p>
<p><!--
{% comment %}
Licensed to the Apache Software Foundation (ASF) under one or more
@@ -193,14 +194,13 @@ See the License for the specific language governing
permissions and
limitations under the License.
{% endcomment %}
-->
+<div style="display: flex; align-items: center; gap: 20px; margin-bottom:
20px;">
+<div style="flex: 1;">
-<p>The Apache DataFusion PMC is pleased to announce version 0.13.0 of the <a
href="https://datafusion.apache.org/comet/">Comet</a> subproject.</p>
-<p>Comet is an accelerator for Apache Spark that translates Spark physical
plans to DataFusion physical plans for
-improved performance and efficiency without requiring any code changes.</p>
-<p>This release covers approximately eight weeks of development …</p></p>
+Databases are some of the most complex yet interesting pieces of software.
They are amazing pieces of abstraction: query engines optimize and execute
complex plans, storage engines provide sophisticated infrastructure as the
backbone of the system, while intricate file formats lay the groundwork for
particular workloads. All of this is …</div></div></p>
<footer>
<ul class="actions">
- <div style="text-align: right"><a
href="/blog/2025/12/04/datafusion-comet-0.13.0" class="button medium">Continue
Reading</a></div>
+ <div style="text-align: right"><a
href="/blog/2025/12/15/avoid-consecutive-repartitions" class="button
medium">Continue Reading</a></div>
</ul>
<ul class="stats">
</ul>
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]