Meeting Minutes from 2022-11-30 Iceberg Sync

Eduard Tudenhoefner Thu, 01 Dec 2022 08:45:57 -0800

Hi Iceberg Community,

Here are the minutes and recording from our Iceberg Sync that took
place on *November
30th*.


Always remember, anyone can join the discussion so feel free to share the
Iceberg-Sync <https://groups.google.com/g/iceberg-sync> google group with
anyone seeking an invite.
The notes and the agenda are posted in the Iceberg Sync doc
<https://docs.google.com/document/d/1YuGhUdukLP5gGiqCbk0A5_Wifqe2CZWgOd3TbhY3UQg/edit?usp=drive_web>
that's
also attached to the meeting invitation and it's an excellent place to add
items as you see fit so we can discuss them in the following community sync.

Meeting Recording
<https://drive.google.com/file/d/1_M9OyGP4EW2TBUq3RL4CgD82eLQsjfwU/view?usp=sharing>
⭕

Meeting Transcript
<https://docs.google.com/document/d/16KGb1wCchpQPuwOdZLwGdnlgjG6r9DYUs4Y9fosr2iU/edit?usp=sharing>

   -

   Highlights
   -

      1.1.0 is out! (Thanks, Gabor and Fokko!)
      -

      Python scan planning is working (Thanks, Fokko!)
      -

         And experimental PyArrow and DuckDB support
         -

      Python Glue support (Thanks!)
      -

      Encryption stream spec is in (Thanks, Gidon!)
      -

      View interfaces are ready (Thanks, John!)
      -

   Releases
   -

      1.2.0 timeline
      -

         Mid to end of January 2023
         -

         Targeting features
         -

      Python 0.2.0
      -

         Scan planning! Experimental PyArrow! Glue support!
         -

         Jun is RM
         -

         Targeting a candidate this week
         -

   Discussion
   -

      Partition stats proposal
      -

         Decide on the format
         -

      Puffin and Spark for NDVs
      -

         Trino performance is much better for some queries – NDV related
         -

         Will update DSv2 to pass NDV
         -

      Changelog scans
      -

         AS OF syntax is not supported
         -

         Filtering didn’t work easily
         -

      Alternative view representations
      -

      Rate limit on Spark Streaming (pr :
      https://github.com/apache/iceberg/pull/4479)

Any analysis or improvement action based on this?
https://brooklyndata.co/blog/benchmarking-open-table-formats

Thanks everyone

Meeting Minutes from 2022-11-30 Iceberg Sync

Reply via email to