[jira] [Updated] (HUDI-9687) Configure default parallelism as 10 if input parallelism

2025-08-04 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9687?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9687: - Labels: pull-request-available (was: ) > Configure default parallelism as 10 if input parallelism

[jira] [Updated] (HUDI-9684) Add Upgrade/Downgrade Test Fixtures Framework

2025-08-04 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9684?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9684: - Labels: pull-request-available (was: ) > Add Upgrade/Downgrade Test Fixtures Framework >

[jira] [Updated] (HUDI-9685) Enable row group-level file stitching in Hudi clustering using schema grouping and Parquet APIs

2025-08-04 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9685: - Labels: pull-request-available (was: ) > Enable row group-level file stitching in Hudi clustering

[jira] [Updated] (HUDI-9683) Fix spark rename columns for HoodieInternalRowUtils.genUnsafeRowWriter

2025-08-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9683?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9683: - Labels: pull-request-available (was: ) > Fix spark rename columns for HoodieInternalRowUtils.genU

[jira] [Updated] (HUDI-9682) Add support to FileGroupRecordBuffer to assist w/ cow merge handle migration

2025-08-01 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9682?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9682: - Labels: pull-request-available (was: ) > Add support to FileGroupRecordBuffer to assist w/ cow me

[jira] [Updated] (HUDI-9679) Refactor filesystem operations to use storage

2025-08-01 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9679: - Labels: 1.1.0 pull-request-available (was: 1.1.0) > Refactor filesystem operations to use storage

[jira] [Updated] (HUDI-9678) Spark overwrite partitioned mor table failed

2025-08-01 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9678?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9678: - Labels: pull-request-available (was: ) > Spark overwrite partitioned mor table failed > -

[jira] [Updated] (HUDI-8385) Apply clean commits to col stats partition on restore in data table

2025-07-31 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8385: - Labels: pull-request-available (was: ) > Apply clean commits to col stats partition on restore in

[jira] [Updated] (HUDI-9676) Avro HoodieAvroUtils.rewriteRecordWithNewSchema logic with renamed columns is wrong

2025-07-31 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9676?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9676: - Labels: pull-request-available (was: ) > Avro HoodieAvroUtils.rewriteRecordWithNewSchema logic wi

[jira] [Updated] (HUDI-9673) Remove the eventime metadata function from DefaultHoodieRecordPayload

2025-07-31 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9673?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9673: - Labels: pull-request-available (was: ) > Remove the eventime metadata function from DefaultHoodie

[jira] [Updated] (HUDI-9675) Savepoint safety checks are missing for cleaner policy KEEP_LATEST_FILE_VERSIONS

2025-07-31 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9675?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9675: - Labels: pull-request-available (was: ) > Savepoint safety checks are missing for cleaner policy

[jira] [Updated] (HUDI-9672) Fix data loss for spark incremental query with skip clustering enabled

2025-07-31 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9672: - Labels: pull-request-available (was: ) > Fix data loss for spark incremental query with skip clus

[jira] [Updated] (HUDI-9671) Spark reads the changelog data whose _hoodie_operation is -U

2025-07-31 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9671?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9671: - Labels: pull-request-available (was: ) > Spark reads the changelog data whose _hoodie_operation i

[jira] [Updated] (HUDI-9606) Fix the support of user-provided key generator

2025-07-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9606: - Labels: pull-request-available (was: ) > Fix the support of user-provided key generator > ---

[jira] [Updated] (HUDI-9669) Add schema on write support for hive reader

2025-07-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9669?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9669: - Labels: pull-request-available (was: ) > Add schema on write support for hive reader > --

[jira] [Updated] (HUDI-9667) Incorporate completion time into restore workflow

2025-07-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9667: - Labels: pull-request-available (was: ) > Incorporate completion time into restore workflow >

[jira] [Updated] (HUDI-9668) Fix float to double conversion for nested + arrays + maps

2025-07-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9668: - Labels: pull-request-available (was: ) > Fix float to double conversion for nested + arrays + map

[jira] [Updated] (HUDI-9666) Fix the record key encoding with a single record key field for complex key generator

2025-07-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9666?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9666: - Labels: pull-request-available (was: ) > Fix the record key encoding with a single record key fie

[jira] [Updated] (HUDI-9665) Repartition the write status RDD for MDT DAG to avoid long processing durations

2025-07-29 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9665?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9665: - Labels: pull-request-available (was: ) > Repartition the write status RDD for MDT DAG to avoid lo

[jira] [Updated] (HUDI-9663) Provide new TransactionManager functionality and move instant time management to Transactionmanager

2025-07-29 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9663?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9663: - Labels: pull-request-available (was: ) > Provide new TransactionManager functionality and move in

[jira] [Updated] (HUDI-9664) Refactor HoodieReaderContext and move record APIs into RecordContext

2025-07-29 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9664?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9664: - Labels: pull-request-available (was: ) > Refactor HoodieReaderContext and move record APIs into R

[jira] [Updated] (HUDI-9661) Fix `Hoodie#getOrderingValue()` to avoid extracting order fields at record level

2025-07-29 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9661?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9661: - Labels: pull-request-available (was: ) > Fix `Hoodie#getOrderingValue()` to avoid extracting orde

[jira] [Updated] (HUDI-9658) Improve Upgrade/Downgrade issues in event of crash

2025-07-28 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9658?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9658: - Labels: pull-request-available (was: ) > Improve Upgrade/Downgrade issues in event of crash > ---

[jira] [Updated] (HUDI-9657) Test schema on read in the filegroup reader

2025-07-28 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9657: - Labels: pull-request-available (was: ) > Test schema on read in the filegroup reader > --

[jira] [Updated] (HUDI-9656) revise interface of key encode

2025-07-28 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9656?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9656: - Labels: pull-request-available (was: ) > revise interface of key encode > ---

[jira] [Updated] (HUDI-9628) Add bloom filter pruning when looking up keys in metadata files

2025-07-28 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9628?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9628: - Labels: pull-request-available (was: ) > Add bloom filter pruning when looking up keys in metadat

[jira] [Updated] (HUDI-9654) [Schema Evolution] Dont evolve schema if reconciled schema exactly match original schema

2025-07-28 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9654?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9654: - Labels: pull-request-available schema-evolution (was: schema-evolution) > [Schema Evolution] Dont

[jira] [Updated] (HUDI-9653) Add Spark Procedures for showing requested , completed cleans

2025-07-27 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9653?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9653: - Labels: pull-request-available (was: ) > Add Spark Procedures for showing requested , completed c

[jira] [Updated] (HUDI-9650) Add test for buffered record merger

2025-07-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9650: - Labels: pull-request-available (was: ) > Add test for buffered record merger > --

[jira] [Updated] (HUDI-9632) for all spark rdd usage of index lookup, must uncache by caller

2025-07-25 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9632: - Labels: pull-request-available (was: ) > for all spark rdd usage of index lookup, must uncache by

[jira] [Updated] (HUDI-9645) Master broken July 25, 2025

2025-07-25 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9645: - Labels: pull-request-available (was: ) > Master broken July 25, 2025 > --

[jira] [Updated] (HUDI-9642) Local maven build fails July 24, 2025

2025-07-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9642: - Labels: pull-request-available (was: ) > Local maven build fails July 24, 2025 >

[jira] [Updated] (HUDI-9641) Create HoodieAvroBinaryRecord

2025-07-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9641?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9641: - Labels: pull-request-available (was: ) > Create HoodieAvroBinaryRecord >

[jira] [Updated] (HUDI-9640) Replace MERGE_PROPERTIES with multiple configs with prefix

2025-07-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9640?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9640: - Labels: pull-request-available (was: ) > Replace MERGE_PROPERTIES with multiple configs with pref

[jira] [Updated] (HUDI-9638) Fix table creation logic for payload deprecation

2025-07-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9638: - Labels: pull-request-available (was: ) > Fix table creation logic for payload deprecation >

[jira] [Updated] (HUDI-9572) Fix minor performance improvements

2025-07-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9572?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9572: - Labels: pull-request-available (was: ) > Fix minor performance improvements > ---

[jira] [Updated] (HUDI-9637) Address comments to an existing PR

2025-07-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9637?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9637: - Labels: pull-request-available (was: ) > Address comments to an existing PR > ---

[jira] [Updated] (HUDI-9527) Use FileGroupReader for HoodieTableMetadataUtil

2025-07-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9527?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9527: - Labels: pull-request-available (was: ) > Use FileGroupReader for HoodieTableMetadataUtil > --

[jira] [Updated] (HUDI-9636) Revert changes to DESCRIBE command in Spark

2025-07-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9636: - Labels: pull-request-available (was: ) > Revert changes to DESCRIBE command in Spark > --

[jira] [Updated] (HUDI-9617) Clean redundant adapters for supporting multiple Flink versions

2025-07-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9617: - Labels: pull-request-available (was: ) > Clean redundant adapters for supporting multiple Flink v

[jira] [Updated] (HUDI-9634) Archival considers retaining the `the earliest retain instant` in the clean plan

2025-07-23 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9634?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9634: - Labels: pull-request-available (was: ) > Archival considers retaining the `the earliest retain in

[jira] [Updated] (HUDI-9633) Add ability to remove custom configs from being passed to kafka consumer

2025-07-23 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9633: - Labels: pull-request-available (was: ) > Add ability to remove custom configs from being passed t

[jira] [Updated] (HUDI-9626) Implement pre caching of metadata Hfiles below a certain size threshold

2025-07-23 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9626: - Labels: pull-request-available (was: ) > Implement pre caching of metadata Hfiles below a certain

[jira] [Updated] (HUDI-9630) Support non-globally unique record keys in the record index

2025-07-23 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9630: - Labels: pull-request-available (was: ) > Support non-globally unique record keys in the record in

[jira] [Updated] (HUDI-9566) Secondary index convert everything to string

2025-07-22 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9566: - Labels: pull-request-available (was: ) > Secondary index convert everything to string > -

[jira] [Updated] (HUDI-9579) only allow SI creation for certain column types

2025-07-22 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9579?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9579: - Labels: pull-request-available (was: ) > only allow SI creation for certain column types > --

[jira] [Updated] (HUDI-9621) Fix divergence between file index and incremental file index

2025-07-22 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9621?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9621: - Labels: pull-request-available (was: ) > Fix divergence between file index and incremental file i

[jira] [Updated] (HUDI-9602) Fix or migrate all writers to use mode mode + mergers

2025-07-22 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9602?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9602: - Labels: pull-request-available (was: ) > Fix or migrate all writers to use mode mode + mergers >

[jira] [Updated] (HUDI-6920) Move the special handing for instant time in HoodieAppendHandle out

2025-07-22 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-6920: - Labels: pull-request-available (was: ) > Move the special handing for instant time in HoodieAppen

[jira] [Updated] (HUDI-8863) Fine-grained Rate Limiting When Stream Read

2025-07-22 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8863: - Labels: pull-request-available (was: ) > Fine-grained Rate Limiting When Stream Read > --

[jira] [Updated] (HUDI-9590) OptimizedLogBlockScan support w/ FG reader

2025-07-22 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9590?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9590: - Labels: pull-request-available (was: ) > OptimizedLogBlockScan support w/ FG reader > ---

[jira] [Updated] (HUDI-9618) Remove explicit type casting to `HoodieWriteMergeHandle` in `BaseFlinkCommitActionExecutor`

2025-07-21 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9618: - Labels: pull-request-available (was: ) > Remove explicit type casting to `HoodieWriteMergeHandle`

[jira] [Updated] (HUDI-9611) Metadata initialization repeatedly triggering

2025-07-21 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9611: - Labels: pull-request-available (was: ) > Metadata initialization repeatedly triggering >

[jira] [Updated] (HUDI-9615) Unify Schema On Read in the filegroup reader

2025-07-21 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9615?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9615: - Labels: pull-request-available (was: ) > Unify Schema On Read in the filegroup reader > -

[jira] [Updated] (HUDI-9612) Improve schema resolver performance by skipping delete-only files

2025-07-21 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9612: - Labels: pull-request-available (was: ) > Improve schema resolver performance by skipping delete-o

[jira] [Updated] (HUDI-9610) Support to get table params from catalog for copy_to_temp_view call

2025-07-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9610: - Labels: pull-request-available (was: ) > Support to get table params from catalog for copy_to_tem

[jira] [Updated] (HUDI-9609) Add support for V1 incremental queries in spark using fg reader

2025-07-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9609?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9609: - Labels: pull-request-available (was: ) > Add support for V1 incremental queries in spark using fg

[jira] [Updated] (HUDI-9604) Fix batch creation during send acknowledgement flow in GcsEventsSource

2025-07-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9604: - Labels: pull-request-available (was: ) > Fix batch creation during send acknowledgement flow in G

[jira] [Updated] (HUDI-9565) Unify Schema Evolution in the FilegroupReader

2025-07-17 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9565: - Labels: pull-request-available (was: ) > Unify Schema Evolution in the FilegroupReader >

[jira] [Updated] (HUDI-9335) Refactor RowDataKeyGens and make it common point for keygen instantiation in Flink

2025-07-17 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9335?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9335: - Labels: pull-request-available (was: ) > Refactor RowDataKeyGens and make it common point for key

[jira] [Updated] (HUDI-9578) Support reader caching inside FG reader

2025-07-16 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9578?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9578: - Labels: pull-request-available (was: ) > Support reader caching inside FG reader > --

[jira] [Updated] (HUDI-9594) Allow Hudi to delegate catalog operations to Apache Polaris

2025-07-15 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9594?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9594: - Labels: pull-request-available (was: ) > Allow Hudi to delegate catalog operations to Apache Pola

[jira] [Updated] (HUDI-9592) Use FileGroupReader for metadata table reads in spark datasource

2025-07-15 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9592?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9592: - Labels: pull-request-available (was: ) > Use FileGroupReader for metadata table reads in spark da

[jira] [Updated] (HUDI-9593) Support Custom partitioner in append mode

2025-07-15 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9593: - Labels: pull-request-available (was: ) > Support Custom partitioner in append mode >

[jira] [Updated] (HUDI-9591) FG reader supports an iterator as log records

2025-07-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9591: - Labels: pull-request-available (was: ) > FG reader supports an iterator as log records >

[jira] [Updated] (HUDI-9581) Allow point lookup in MDT read

2025-07-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9581?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9581: - Labels: pull-request-available (was: ) > Allow point lookup in MDT read > ---

[jira] [Updated] (HUDI-9576) Add Filegroup Reader Testing for Schema Evolution

2025-07-12 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9576?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9576: - Labels: pull-request-available (was: ) > Add Filegroup Reader Testing for Schema Evolution >

[jira] [Updated] (HUDI-9570) Flink AsyncInstant may lost data when trigger recommit.

2025-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9570?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9570: - Labels: pull-request-available (was: ) > Flink AsyncInstant may lost data when trigger recommit.

[jira] [Updated] (HUDI-9575) Skip compaction for file slice with no log files

2025-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9575?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9575: - Labels: pull-request-available (was: ) > Skip compaction for file slice with no log files > -

[jira] [Updated] (HUDI-9574) Avoid creating InternalSchemaManager for each file in FileGroup reader based Flink Compaction

2025-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9574?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9574: - Labels: pull-request-available (was: ) > Avoid creating InternalSchemaManager for each file in Fi

[jira] [Updated] (HUDI-9548) Secondary index look up should escape -> hash -> sort for read write

2025-07-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9548?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9548: - Labels: pull-request-available (was: ) > Secondary index look up should escape -> hash -> sort fo

[jira] [Updated] (HUDI-9551) Secondary index lookup in table version 9 is not prefix lookup

2025-07-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9551?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9551: - Labels: pull-request-available (was: ) > Secondary index lookup in table version 9 is not prefix

[jira] [Updated] (HUDI-9569) Add support for multiple ordering fields

2025-07-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9569: - Labels: pull-request-available (was: ) > Add support for multiple ordering fields > -

[jira] [Updated] (HUDI-9571) Avoid scanning timeline to fetch InternalSchema for schema evolution

2025-07-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9571?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9571: - Labels: pull-request-available (was: ) > Avoid scanning timeline to fetch InternalSchema for sche

[jira] [Updated] (HUDI-9567) BaseHoodieLogRecordReader should skip checking inflight instant for table version EIGHT

2025-07-03 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9567?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9567: - Labels: pull-request-available (was: ) > BaseHoodieLogRecordReader should skip checking inflight

[jira] [Updated] (HUDI-9526) Use FileGroupReader for CDC Flows

2025-07-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9526?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9526: - Labels: pull-request-available (was: ) > Use FileGroupReader for CDC Flows >

[jira] [Updated] (HUDI-9119) Hudi 1.0.1 cannot write MOR tables

2025-07-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9119?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9119: - Labels: pull-request-available (was: ) > Hudi 1.0.1 cannot write MOR tables > ---

[jira] [Updated] (HUDI-9564) Inroduce BufferRecordMerger API in FileGroupRecordBuffer

2025-07-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9564?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9564: - Labels: pull-request-available (was: ) > Inroduce BufferRecordMerger API in FileGroupRecordBuffer

[jira] [Updated] (HUDI-9174) Cleanup CLI usages of log scanner

2025-07-01 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9174?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9174: - Labels: pull-request-available (was: ) > Cleanup CLI usages of log scanner >

[jira] [Updated] (HUDI-9563) Add resource tags for AWS Glue database and tables

2025-07-01 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9563?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9563: - Labels: pull-request-available (was: ) > Add resource tags for AWS Glue database and tables > ---

[jira] [Updated] (HUDI-9556) Trino-hudi module migration to Hudi Repo

2025-06-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9556: - Labels: pull-request-available (was: ) > Trino-hudi module migration to Hudi Repo > -

[jira] [Updated] (HUDI-9559) Fail rollback execution if deletion of files fails

2025-06-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9559: - Labels: pull-request-available (was: ) > Fail rollback execution if deletion of files fails > ---

[jira] [Updated] (HUDI-9439) Add helpers for reading select fields from schema in FileGroupReader

2025-06-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9439?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9439: - Labels: pull-request-available (was: ) > Add helpers for reading select fields from schema in Fil

[jira] [Updated] (HUDI-9560) [Umbrella] RFC-94: Deprecate payload classes from Hudi

2025-06-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9560?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9560: - Labels: pull-request-available (was: ) > [Umbrella] RFC-94: Deprecate payload classes from Hudi >

[jira] [Updated] (HUDI-9562) Support to use spark.task.cpus to calculate the merge/compaction max memory

2025-06-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9562?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9562: - Labels: pull-request-available (was: ) > Support to use spark.task.cpus to calculate the merge/co

[jira] [Updated] (HUDI-8290) Use Filegroup Reader in Spark Structured Streaming Read

2025-06-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8290?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8290: - Labels: needs-attention pull-request-available (was: needs-attention) > Use Filegroup Reader in S

[jira] [Updated] (HUDI-9558) Make merge handles configurable

2025-06-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9558?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9558: - Labels: pull-request-available (was: ) > Make merge handles configurable > --

[jira] [Updated] (HUDI-9529) Bloom index inspects base file from incomplete compaction

2025-06-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9529: - Labels: pull-request-available (was: ) > Bloom index inspects base file from incomplete compactio

[jira] [Updated] (HUDI-9553) Add equals and hashcode override for Predicate static classes

2025-06-25 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9553?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9553: - Labels: pull-request-available (was: ) > Add equals and hashcode override for Predicate static cl

[jira] [Updated] (HUDI-9226) Add support for Flink 2.0

2025-06-25 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9226?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9226: - Labels: pull-request-available (was: ) > Add support for Flink 2.0 > - >

[jira] [Updated] (HUDI-9528) Support custom database and table name for all hudi meta-syncs

2025-06-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9528: - Labels: pull-request-available (was: ) > Support custom database and table name for all hudi meta

[jira] [Updated] (HUDI-9543) Secondary index readers and writers do not handle null char properly

2025-06-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9543?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9543: - Labels: pull-request-available (was: ) > Secondary index readers and writers do not handle null c

[jira] [Updated] (HUDI-9476) Cleanup code paths for instant generation

2025-06-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9476?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9476: - Labels: pull-request-available (was: ) > Cleanup code paths for instant generation >

[jira] [Updated] (HUDI-9340) Add SI support w/ new dag

2025-06-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9340?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9340: - Labels: pull-request-available (was: ) > Add SI support w/ new dag > - >

[jira] [Updated] (HUDI-9523) Fix UT Only one SparkContext should be running in this JVM

2025-06-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9523?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9523: - Labels: pull-request-available (was: ) > Fix UT Only one SparkContext should be running in this J

[jira] [Updated] (HUDI-9520) Support new hudi sink based on Flink V2 sink

2025-06-12 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9520: - Labels: pull-request-available (was: ) > Support new hudi sink based on Flink V2 sink > -

[jira] [Updated] (HUDI-9518) Upgrade Flink 1.15/1.16 parquet version from 1.12.2 to 1.12.3

2025-06-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9518?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9518: - Labels: pull-request-available (was: ) > Upgrade Flink 1.15/1.16 parquet version from 1.12.2 to 1

[jira] [Updated] (HUDI-9517) Improve Flink hudi Sink record write error metric and logging

2025-06-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9517: - Labels: pull-request-available (was: ) > Improve Flink hudi Sink record write error metric and lo

[jira] [Updated] (HUDI-8286) Use filegroup reader for compaction

2025-06-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8286: - Labels: pull-request-available (was: ) > Use filegroup reader for compaction > --

[jira] [Updated] (HUDI-9514) Scanner resources not properly closed in HoodieCompact

2025-06-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9514?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9514: - Labels: pull-request-available (was: ) > Scanner resources not properly closed in HoodieCompact >

  1   2   3   4   5   6   7   8   9   10   >