Re: [PR] [HUDI-9239]Fix the bug of Spark Cache not releasing cleanly [hudi]

2025-03-27 Thread via GitHub
hudi-bot commented on PR #13050: URL: https://github.com/apache/hudi/pull/13050#issuecomment-2760360200 ## CI report: * cd164f612b0740446385572e55cce394a76e4cb4 Azure: [SUCCESS](https://dev.azure.com/apachehudi/a1a51da7-8592-47d4-88dc-fd67bed336bb/_build/results?buildId=4515)

[jira] [Updated] (HUDI-9239) Fix the bug of Spark Cache not releasing cleanly

2025-03-27 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9239: - Labels: pull-request-available (was: ) > Fix the bug of Spark Cache not releasing cleanly > -

Re: [PR] [HUDI-8409] Add test to ensure merge mode during upgrade and downgrade [hudi]

2025-03-27 Thread via GitHub
nsivabalan commented on code in PR #13046: URL: https://github.com/apache/hudi/pull/13046#discussion_r2017973607 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/table/upgrade/DowngradeHandler.java: ## @@ -38,7 +40,7 @@ public interface DowngradeHandler { * @p

Re: [PR] [HUDI-9236] create marker file for every log file [hudi]

2025-03-27 Thread via GitHub
danny0405 commented on code in PR #13007: URL: https://github.com/apache/hudi/pull/13007#discussion_r2017980935 ## hudi-common/src/main/java/org/apache/hudi/metadata/HoodieTableMetadataUtil.java: ## @@ -1089,35 +1089,6 @@ public static Set getRecordKeys(List logFilePaths, Hoodi

Re: [PR] [HUDI-9236] create marker file for every log file [hudi]

2025-03-27 Thread via GitHub
danny0405 commented on code in PR #13007: URL: https://github.com/apache/hudi/pull/13007#discussion_r2017980604 ## hudi-client/hudi-spark-client/src/test/java/org/apache/hudi/table/functional/TestMarkerBasedRollbackStrategy.java: ## @@ -165,10 +238,11 @@ public void testCopyOnWr

Re: [PR] [HUDI-9236] create marker file for every log file [hudi]

2025-03-27 Thread via GitHub
danny0405 commented on code in PR #13007: URL: https://github.com/apache/hudi/pull/13007#discussion_r2017975557 ## hudi-spark-datasource/hudi-spark/src/test/java/org/apache/hudi/table/TestHoodieMergeOnReadTable.java: ## @@ -605,6 +605,10 @@ public void testMetadataStatsOnCommit(

Re: [PR] [HUDI-9236] create marker file for every log file [hudi]

2025-03-27 Thread via GitHub
danny0405 commented on code in PR #13007: URL: https://github.com/apache/hudi/pull/13007#discussion_r2017974085 ## hudi-io/src/main/java/org/apache/hudi/storage/StorageSchemes.java: ## @@ -86,6 +88,11 @@ public enum StorageSchemes { // Hopsworks File System HOPSFS("hopsfs"

Re: [PR] [HUDI-9236] create marker file for every log file [hudi]

2025-03-27 Thread via GitHub
danny0405 commented on code in PR #13007: URL: https://github.com/apache/hudi/pull/13007#discussion_r2017970464 ## hudi-common/src/main/java/org/apache/hudi/metadata/HoodieTableMetadataUtil.java: ## @@ -1143,7 +1114,6 @@ private static List convertMetadataToRollbackRecords(Hood

Re: [PR] [HUDI-9236] create marker file for every log file [hudi]

2025-03-27 Thread via GitHub
danny0405 commented on code in PR #13007: URL: https://github.com/apache/hudi/pull/13007#discussion_r2017967353 ## hudi-client/hudi-spark-client/src/test/java/org/apache/hudi/table/functional/TestMarkerBasedRollbackStrategy.java: ## @@ -149,6 +163,65 @@ public void testMarkerBa

Re: [PR] [HUDI-9236] create marker file for every log file [hudi]

2025-03-27 Thread via GitHub
danny0405 commented on code in PR #13007: URL: https://github.com/apache/hudi/pull/13007#discussion_r2017965070 ## hudi-client/hudi-spark-client/src/test/java/org/apache/hudi/table/action/rollback/TestCopyOnWriteRollbackActionExecutor.java: ## @@ -431,7 +433,7 @@ public void tes

[PR] [HUDI-9230]Fix the bug of Spark Cache not releasing cleanly [hudi]

2025-03-27 Thread via GitHub
zhangyue19921010 opened a new pull request, #13050: URL: https://github.com/apache/hudi/pull/13050 ### Change Logs For now using basePath String as Cache Key for the extra `\` ( DFS consider the paths to be the same), the cache is released strictly according to the string compa

Re: [PR] [HUDI-9236] create marker file for every log file [hudi]

2025-03-27 Thread via GitHub
danny0405 commented on code in PR #13007: URL: https://github.com/apache/hudi/pull/13007#discussion_r2017964552 ## hudi-client/hudi-spark-client/src/test/java/org/apache/hudi/table/action/rollback/TestCopyOnWriteRollbackActionExecutor.java: ## @@ -106,7 +107,7 @@ public void te

Re: [PR] [HUDI-9236] create marker file for every log file [hudi]

2025-03-27 Thread via GitHub
hudi-bot commented on PR #13007: URL: https://github.com/apache/hudi/pull/13007#issuecomment-2760245045 ## CI report: * 97d62dae22862373e799718850bcdcb4e6660a8f Azure: [CANCELED](https://dev.azure.com/apachehudi/a1a51da7-8592-47d4-88dc-fd67bed336bb/_build/results?buildId=4513)

Re: [PR] [HUDI-9230]Fix the bug of Spark Cache not releasing cleanly [hudi]

2025-03-27 Thread via GitHub
hudi-bot commented on PR #13050: URL: https://github.com/apache/hudi/pull/13050#issuecomment-2760210100 ## CI report: * cd164f612b0740446385572e55cce394a76e4cb4 Azure: [PENDING](https://dev.azure.com/apachehudi/a1a51da7-8592-47d4-88dc-fd67bed336bb/_build/results?buildId=4515)

(hudi) 01/01: Fix the bug of Spark Cache not releasing cleanly

2025-03-27 Thread zhangyue19921010
This is an automated email from the ASF dual-hosted git repository. zhangyue19921010 pushed a commit to branch HUDI-9239 in repository https://gitbox.apache.org/repos/asf/hudi.git commit cd164f612b0740446385572e55cce394a76e4cb4 Author: YueZhang AuthorDate: Fri Mar 28 13:00:29 2025 +0800 Fix

Re: [PR] [HUDI-9230]Fix the bug of Spark Cache not releasing cleanly [hudi]

2025-03-27 Thread via GitHub
hudi-bot commented on PR #13050: URL: https://github.com/apache/hudi/pull/13050#issuecomment-2760208144 ## CI report: * cd164f612b0740446385572e55cce394a76e4cb4 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run th

[jira] [Updated] (HUDI-9230) Partition Level Bucket Index adopt Spark and Flink reader bucket id pruning

2025-03-27 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9230: - Labels: pull-request-available (was: ) > Partition Level Bucket Index adopt Spark and Flink reade

(hudi) branch HUDI-9239 created (now cd164f612b0)

2025-03-27 Thread zhangyue19921010
This is an automated email from the ASF dual-hosted git repository. zhangyue19921010 pushed a change to branch HUDI-9239 in repository https://gitbox.apache.org/repos/asf/hudi.git at cd164f612b0 Fix the bug of Spark Cache not releasing cleanly This branch includes the following new commit

[jira] [Created] (HUDI-9239) Fix the bug of Spark Cache not releasing cleanly

2025-03-27 Thread Yue Zhang (Jira)
Yue Zhang created HUDI-9239: --- Summary: Fix the bug of Spark Cache not releasing cleanly Key: HUDI-9239 URL: https://issues.apache.org/jira/browse/HUDI-9239 Project: Apache Hudi Issue Type: Bug

(hudi) branch master updated (906747a5bfb -> 12fa7d27084)

2025-03-27 Thread yihua
This is an automated email from the ASF dual-hosted git repository. yihua pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git from 906747a5bfb [HUDI-9042] Support `TimestampType.DATE_STRING` by `RowDataKeyGen` (#13037) add 12fa7d27084 [HUDI-9238]

hudi.git: Error while running github feature from master:.asf.yaml

2025-03-27 Thread Apache Infrastructure
An error occurred while processing the github feature in .asf.yaml: GitHub discussions can only be enabled if a mailing list target exists for it. --- With regards, ASF Infra.

Re: [PR] [HUDI-9238] Fix bug with HudiIncrSource when source is table version 6 [hudi]

2025-03-27 Thread via GitHub
yihua merged PR #13048: URL: https://github.com/apache/hudi/pull/13048 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.o

Re: [PR] [HUDI-9236] create marker file for every log file [hudi]

2025-03-27 Thread via GitHub
hudi-bot commented on PR #13007: URL: https://github.com/apache/hudi/pull/13007#issuecomment-2760182577 ## CI report: * 97d62dae22862373e799718850bcdcb4e6660a8f Azure: [CANCELED](https://dev.azure.com/apachehudi/a1a51da7-8592-47d4-88dc-fd67bed336bb/_build/results?buildId=4513)

Re: [PR] [HUDI-9236] create marker file for every log file [hudi]

2025-03-27 Thread via GitHub
nsivabalan commented on PR #13007: URL: https://github.com/apache/hudi/pull/13007#issuecomment-2760173050 hey @lokeshj1703 : I have addressed most of source code refactoring changes. Can you take it from here. most likely very few comments in source code and all comments on test code need t

Re: [PR] [HUDI-9236] create marker file for every log file [hudi]

2025-03-27 Thread via GitHub
hudi-bot commented on PR #13007: URL: https://github.com/apache/hudi/pull/13007#issuecomment-2760173323 ## CI report: * b71440eb08fec4554445cc3a9ddfefc5cdb7c86a Azure: [SUCCESS](https://dev.azure.com/apachehudi/a1a51da7-8592-47d4-88dc-fd67bed336bb/_build/results?buildId=4505)

Re: [PR] [HUDI-9236] create marker file for every log file [hudi]

2025-03-27 Thread via GitHub
nsivabalan commented on PR #13007: URL: https://github.com/apache/hudi/pull/13007#issuecomment-2760166889 Major refactoring changes updated: - Introduced CommitMetadataResolver which BaseHoodieWriteClient calls to prepare the commit metadata. All logic to build commit metadata and then c

[jira] [Comment Edited] (HUDI-8872) Fix javadoc comments

2025-03-27 Thread Geser Dugarov (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8872?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17913680#comment-17913680 ] Geser Dugarov edited comment on HUDI-8872 at 3/28/25 4:47 AM: --

Re: [PR] [HUDI-9236] create marker file for every log file [hudi]

2025-03-27 Thread via GitHub
nsivabalan commented on code in PR #13007: URL: https://github.com/apache/hudi/pull/13007#discussion_r2017900194 ## hudi-spark-datasource/hudi-spark/src/test/java/org/apache/hudi/client/functional/TestHoodieBackedMetadata.java: ## @@ -256,6 +259,7 @@ public void testMetadataTabl

Re: [PR] [HUDI-9236] create marker file for every log file [hudi]

2025-03-27 Thread via GitHub
hudi-bot commented on PR #13007: URL: https://github.com/apache/hudi/pull/13007#issuecomment-2760159309 ## CI report: * b71440eb08fec4554445cc3a9ddfefc5cdb7c86a Azure: [SUCCESS](https://dev.azure.com/apachehudi/a1a51da7-8592-47d4-88dc-fd67bed336bb/_build/results?buildId=4505)

Re: [PR] [HUDI-9238] Fix bug with HudiIncrSource when source is table version 6 [hudi]

2025-03-27 Thread via GitHub
hudi-bot commented on PR #13048: URL: https://github.com/apache/hudi/pull/13048#issuecomment-2760145481 ## CI report: * 59c0060026f24456312850ff3d96e6e66f29b31b Azure: [SUCCESS](https://dev.azure.com/apachehudi/a1a51da7-8592-47d4-88dc-fd67bed336bb/_build/results?buildId=4511)

Re: [PR] [HUDI-6895] Change default timeline timezone from local to UTC [hudi]

2025-03-27 Thread via GitHub
hudi-bot commented on PR #9794: URL: https://github.com/apache/hudi/pull/9794#issuecomment-2760114371 ## CI report: * a52be9a2bc8a4270c1bef1a46b5279ad3bfa9497 Azure: [SUCCESS](https://dev.azure.com/apachehudi/a1a51da7-8592-47d4-88dc-fd67bed336bb/_build/results?buildId=4512)

Re: [PR] [HUDI-9238] Fix bug with HudiIncrSource when source is table version 6 [hudi]

2025-03-27 Thread via GitHub
hudi-bot commented on PR #13048: URL: https://github.com/apache/hudi/pull/13048#issuecomment-2760089618 ## CI report: * 59c0060026f24456312850ff3d96e6e66f29b31b Azure: [PENDING](https://dev.azure.com/apachehudi/a1a51da7-8592-47d4-88dc-fd67bed336bb/_build/results?buildId=4511)

Re: [PR] [HUDI-9144] Flink writer for MOR table supports writing RowData … [hudi]

2025-03-27 Thread via GitHub
danny0405 commented on code in PR #12967: URL: https://github.com/apache/hudi/pull/12967#discussion_r2002809956 ## hudi-flink-datasource/hudi-flink/src/main/java/org/apache/hudi/sink/utils/DummyRecordComparator.java: ## @@ -0,0 +1,32 @@ +/* + * Licensed to the Apache Software Fo

Re: [PR] [HUDI-9238] Fix bug with HudiIncrSource when source is table version 6 [hudi]

2025-03-27 Thread via GitHub
hudi-bot commented on PR #13048: URL: https://github.com/apache/hudi/pull/13048#issuecomment-2760069468 ## CI report: * 59c0060026f24456312850ff3d96e6e66f29b31b UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run th

[I] [SUPPORT] flink quick start very slow in kubernetes session deploy mode [hudi]

2025-03-27 Thread via GitHub
804e opened a new issue, #13049: URL: https://github.com/apache/hudi/issues/13049 **Describe the problem you faced** After the data entry test is performed according to the document [flink quick start](https://hudi.apache.org/cn/docs/flink-quick-start-guide), data insertion is sl

Re: [PR] [HUDI-8855] Add bucket properties for spark bucket index query pruning [hudi]

2025-03-27 Thread via GitHub
empcl commented on code in PR #12614: URL: https://github.com/apache/hudi/pull/12614#discussion_r2017819313 ## hudi-spark-datasource/hudi-spark-common/src/main/scala/org/apache/hudi/HoodieFileIndex.scala: ## @@ -534,6 +535,15 @@ object HoodieFileIndex extends Logging { pr

Re: [PR] [HUDI-8635] Support numWrites metric for compaction [hudi]

2025-03-27 Thread via GitHub
linliu-code commented on code in PR #13047: URL: https://github.com/apache/hudi/pull/13047#discussion_r2017746196 ## hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/io/HoodieSparkFileGroupReaderBasedMergeHandle.java: ## @@ -283,6 +283,10 @@ public List close() {

Re: [PR] [HUDI-9238] Fix bug with HudiIncrSource when source is table version 6 [hudi]

2025-03-27 Thread via GitHub
hudi-bot commented on PR #13048: URL: https://github.com/apache/hudi/pull/13048#issuecomment-2760036984 ## CI report: * 59c0060026f24456312850ff3d96e6e66f29b31b Azure: [FAILURE](https://dev.azure.com/apachehudi/a1a51da7-8592-47d4-88dc-fd67bed336bb/_build/results?buildId=4511)

Re: [PR] [HUDI-6895] Change default timeline timezone from local to UTC [hudi]

2025-03-27 Thread via GitHub
hudi-bot commented on PR #9794: URL: https://github.com/apache/hudi/pull/9794#issuecomment-2760012361 ## CI report: * 09965716ab9e348207a2d9d6c460e31fb7ee95f1 Azure: [FAILURE](https://dev.azure.com/apachehudi/a1a51da7-8592-47d4-88dc-fd67bed336bb/_build/results?buildId=4499) *

Re: [PR] [HUDI-8855] Add bucket properties for spark bucket index query pruning [hudi]

2025-03-27 Thread via GitHub
empcl commented on code in PR #12614: URL: https://github.com/apache/hudi/pull/12614#discussion_r2017819313 ## hudi-spark-datasource/hudi-spark-common/src/main/scala/org/apache/hudi/HoodieFileIndex.scala: ## @@ -534,6 +535,15 @@ object HoodieFileIndex extends Logging { pr

Re: [PR] [HUDI-8635] Support numWrites metric for compaction [hudi]

2025-03-27 Thread via GitHub
hudi-bot commented on PR #13047: URL: https://github.com/apache/hudi/pull/13047#issuecomment-2759972361 ## CI report: * 7b446c67365ea8073d4ee9a268bc446e6b5a0335 UNKNOWN * 81e9e5c456416b45e254407b9ac2660114124fc0 UNKNOWN * a1243dc67b0348ef481b64b635441185bf40f936 Azure: [SUCC

(hudi) branch master updated: [HUDI-9167] Remove redundant classes in hudi-utilities-slim-bundle (#12962)

2025-03-27 Thread yihua
This is an automated email from the ASF dual-hosted git repository. yihua pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/master by this push: new badee10d290 [HUDI-9167] Remove redundant classes in

[jira] [Created] (HUDI-9203) Fix usability w/ upgrading from 0.x to 1.x and wish to stay in table version 6

2025-03-27 Thread sivabalan narayanan (Jira)
sivabalan narayanan created HUDI-9203: - Summary: Fix usability w/ upgrading from 0.x to 1.x and wish to stay in table version 6 Key: HUDI-9203 URL: https://issues.apache.org/jira/browse/HUDI-9203

[jira] [Updated] (HUDI-9218) Implement HoodieRecordMerger for Flink HoodieRecord

2025-03-27 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-9218: - Status: Open (was: In Progress) > Implement HoodieRecordMerger for Flink HoodieRecord > -

Re: [PR] [HUDI-9238] Fix bug with HudiIncrSource when source is table version 6 [hudi]

2025-03-27 Thread via GitHub
hudi-bot commented on PR #13048: URL: https://github.com/apache/hudi/pull/13048#issuecomment-2759937044 ## CI report: * 59c0060026f24456312850ff3d96e6e66f29b31b Azure: [PENDING](https://dev.azure.com/apachehudi/a1a51da7-8592-47d4-88dc-fd67bed336bb/_build/results?buildId=4511)

Re: [PR] [HUDI-6895] Change default timeline timezone from local to UTC [hudi]

2025-03-27 Thread via GitHub
hudi-bot commented on PR #9794: URL: https://github.com/apache/hudi/pull/9794#issuecomment-2759936765 ## CI report: * 09965716ab9e348207a2d9d6c460e31fb7ee95f1 Azure: [FAILURE](https://dev.azure.com/apachehudi/a1a51da7-8592-47d4-88dc-fd67bed336bb/_build/results?buildId=4499) *

Re: [PR] [HUDI-9206] Support reading inflight instants with HoodieLogRecordReader [hudi]

2025-03-27 Thread via GitHub
danny0405 commented on code in PR #13010: URL: https://github.com/apache/hudi/pull/13010#discussion_r2017786155 ## hudi-common/src/main/java/org/apache/hudi/common/table/read/HoodieFileGroupReader.java: ## @@ -82,20 +82,17 @@ public final class HoodieFileGroupReader implements

hudi.git: Error while running github feature from master:.asf.yaml

2025-03-27 Thread Apache Infrastructure
An error occurred while processing the github feature in .asf.yaml: GitHub discussions can only be enabled if a mailing list target exists for it. --- With regards, ASF Infra.

Re: [PR] [HUDI-9213] Handle record index field position while create RLI payload [hudi]

2025-03-27 Thread via GitHub
hudi-bot commented on PR #13020: URL: https://github.com/apache/hudi/pull/13020#issuecomment-2748702673 ## CI report: * 1bf87d407d33d26510a8f51dd5fb0271d04e8057 Azure: [FAILURE](https://dev.azure.com/apachehudi/a1a51da7-8592-47d4-88dc-fd67bed336bb/_build/results?buildId=4359)

(hudi) 06/06: finish Spark related conding and test

2025-03-27 Thread zhangyue19921010
This is an automated email from the ASF dual-hosted git repository. zhangyue19921010 pushed a commit to branch HUDI-8990 in repository https://gitbox.apache.org/repos/asf/hudi.git commit aa9ed6ce12c49d9c79034453a9bba3bfaf16a9d2 Author: YueZhang AuthorDate: Fri Mar 21 16:03:50 2025 +0800 fin

[jira] [Updated] (HUDI-9150) Flink regular reader supports row data based record merge

2025-03-27 Thread Shuo Cheng (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9150?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shuo Cheng updated HUDI-9150: - Summary: Flink regular reader supports row data based record merge (was: Flink regular reader and compact

[jira] [Updated] (HUDI-9228) [RFC-87] Support rowdata writing for MOR table with consistent bucket index

2025-03-27 Thread Shuo Cheng (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shuo Cheng updated HUDI-9228: - Status: In Progress (was: Open) > [RFC-87] Support rowdata writing for MOR table with consistent bucket i

[jira] [Closed] (HUDI-9042) `RowDataKeyGen` doesn't support all available key generators

2025-03-27 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9042?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen closed HUDI-9042. Reviewers: Danny Chen Resolution: Fixed Fixed via master branch: 906747a5bfbec6babfcf54a175ec729f24572e

[jira] [Updated] (HUDI-9042) `RowDataKeyGen` doesn't support all available key generators

2025-03-27 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9042?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-9042: - Status: Open (was: In Progress) > `RowDataKeyGen` doesn't support all available key generators >

[jira] [Closed] (HUDI-9218) Implement HoodieRecordMerger for Flink HoodieRecord

2025-03-27 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen closed HUDI-9218. Reviewers: Danny Chen Resolution: Fixed Fixed via master branch: b0ef89aaa020711574e98c511e87b0c3ab895d

(hudi) branch master updated (b0ef89aaa02 -> 906747a5bfb)

2025-03-27 Thread danny0405
This is an automated email from the ASF dual-hosted git repository. danny0405 pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git from b0ef89aaa02 [HUDI-9218] Implement HoodieRecordMerger for Flink HoodieRecord (#13040) add 906747a5bfb [HUDI-9042

Re: [PR] [HUDI-9042] Support `TimestampType.DATE_STRING` by `RowDataKeyGen` [hudi]

2025-03-27 Thread via GitHub
danny0405 merged PR #13037: URL: https://github.com/apache/hudi/pull/13037 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apac

[jira] [Updated] (HUDI-9218) Implement HoodieRecordMerger for Flink HoodieRecord

2025-03-27 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-9218: - Fix Version/s: 1.1.0 > Implement HoodieRecordMerger for Flink HoodieRecord > -

Re: [PR] [HUDI-9218] Implement HoodieRecordMerger for Flink HoodieRecord [hudi]

2025-03-27 Thread via GitHub
danny0405 merged PR #13040: URL: https://github.com/apache/hudi/pull/13040 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apac

Re: [PR] [HUDI-8855] Add bucket properties for spark bucket index query pruning [hudi]

2025-03-27 Thread via GitHub
danny0405 commented on code in PR #12614: URL: https://github.com/apache/hudi/pull/12614#discussion_r2017761242 ## hudi-spark-datasource/hudi-spark-common/src/main/scala/org/apache/hudi/HoodieFileIndex.scala: ## @@ -534,6 +535,15 @@ object HoodieFileIndex extends Logging {

Re: [PR] [HUDI-9236] create marker file for every log file [hudi]

2025-03-27 Thread via GitHub
danny0405 commented on code in PR #13007: URL: https://github.com/apache/hudi/pull/13007#discussion_r2017759460 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/client/utils/CommitMetadataUtils.java: ## @@ -0,0 +1,258 @@ +/* + * Licensed to the Apache Software Fou

Re: [PR] [HUDI-8635] Support numWrites metric for compaction [hudi]

2025-03-27 Thread via GitHub
hudi-bot commented on PR #13047: URL: https://github.com/apache/hudi/pull/13047#issuecomment-2759886011 ## CI report: * 7b446c67365ea8073d4ee9a268bc446e6b5a0335 UNKNOWN * 81e9e5c456416b45e254407b9ac2660114124fc0 UNKNOWN * a1243dc67b0348ef481b64b635441185bf40f936 Azure: [PEND

Re: [PR] [HUDI-9238] Fix bug with HudiIncrSource when source is table version 6 [hudi]

2025-03-27 Thread via GitHub
hudi-bot commented on PR #13048: URL: https://github.com/apache/hudi/pull/13048#issuecomment-2759877877 ## CI report: * 59c0060026f24456312850ff3d96e6e66f29b31b UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run th

[jira] [Updated] (HUDI-9238) HudiIncrSource Fails if source table is v6 and ingestion writer is v8

2025-03-27 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9238?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9238: - Labels: pull-request-available (was: ) > HudiIncrSource Fails if source table is v6 and ingestion

[PR] [HUDI-9238] Fix bug with HudiIncrSource when source is table version 6 [hudi]

2025-03-27 Thread via GitHub
the-other-tim-brown opened a new pull request, #13048: URL: https://github.com/apache/hudi/pull/13048 ### Change Logs - Factor in source table version when determining which view of the table to use in HudiIncrSource - Add test coverage for source tables in v6 ### Impact

[jira] [Created] (HUDI-9238) HudiIncrSource Fails if source table is v6 and ingestion writer is v8

2025-03-27 Thread Timothy Brown (Jira)
Timothy Brown created HUDI-9238: --- Summary: HudiIncrSource Fails if source table is v6 and ingestion writer is v8 Key: HUDI-9238 URL: https://issues.apache.org/jira/browse/HUDI-9238 Project: Apache Hudi

[jira] [Updated] (HUDI-9237) RFC - Pluggable Table Format Support

2025-03-27 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9237?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9237: - Labels: pull-request-available (was: ) > RFC - Pluggable Table Format Support > -

Re: [PR] [HUDI-8635] Support numWrites metric for compaction [hudi]

2025-03-27 Thread via GitHub
hudi-bot commented on PR #13047: URL: https://github.com/apache/hudi/pull/13047#issuecomment-2759868032 ## CI report: * 7b446c67365ea8073d4ee9a268bc446e6b5a0335 UNKNOWN * 81e9e5c456416b45e254407b9ac2660114124fc0 UNKNOWN * a1243dc67b0348ef481b64b635441185bf40f936 UNKNOWN

Re: [PR] [HUDI-9237][RFC-92] Pluggable Table Format Support [hudi]

2025-03-27 Thread via GitHub
balaji-varadarajan-ai commented on code in PR #12998: URL: https://github.com/apache/hudi/pull/12998#discussion_r2017744800 ## rfc/README.md: ## @@ -127,3 +127,4 @@ The list of all RFCs can be found here. | 89 | [Dynamic Partition Level Bucket Index](./rfc-89/rfc-89.md)

[jira] [Closed] (HUDI-9167) Remove redundant classes in hudi-utilities-slim-bundle

2025-03-27 Thread Y Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9167?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Y Ethan Guo closed HUDI-9167. - Resolution: Fixed > Remove redundant classes in hudi-utilities-slim-bundle > -

Re: [PR] [HUDI-9237][RFC-92] Pluggable Table Format Support [hudi]

2025-03-27 Thread via GitHub
bvaradar commented on code in PR #12998: URL: https://github.com/apache/hudi/pull/12998#discussion_r2017743974 ## rfc/rfc-92/rfc-92.md: ## @@ -0,0 +1,200 @@ + +# RFC-92: Pluggable Table Formats in Hudi + +## Proposers + +* Balaji Varadarajan + +## Approvers + +* Vinoth Chand

[jira] [Assigned] (HUDI-9237) RFC - Pluggable Table Format Support

2025-03-27 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9237?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan reassigned HUDI-9237: Assignee: Balaji Varadarajan > RFC - Pluggable Table Format Support > -

Re: [PR] [RFC-92] Pluggable Table Format Support [hudi]

2025-03-27 Thread via GitHub
bvaradar commented on code in PR #12998: URL: https://github.com/apache/hudi/pull/12998#discussion_r2017742533 ## rfc/rfc-92/rfc-92.md: ## @@ -0,0 +1,200 @@ + +# RFC-92: Pluggable Table Formats in Hudi + +## Proposers + +* Balaji Varadarajan + +## Approvers + +* Vinoth Chand

Re: [PR] [RFC-92] Pluggable Table Format Support [hudi]

2025-03-27 Thread via GitHub
bvaradar commented on code in PR #12998: URL: https://github.com/apache/hudi/pull/12998#discussion_r2017742217 ## rfc/rfc-92/rfc-92.md: ## @@ -0,0 +1,200 @@ + +# RFC-92: Pluggable Table Formats in Hudi + +## Proposers + +* Balaji Varadarajan + +## Approvers + +* Vinoth Chand

Re: [PR] [HUDI-9167] Remove redundant classes in hudi-utilities-slim-bundle [hudi]

2025-03-27 Thread via GitHub
yihua merged PR #12962: URL: https://github.com/apache/hudi/pull/12962 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.o

Re: [PR] [HUDI-8635] Support numWrites metric for compaction [hudi]

2025-03-27 Thread via GitHub
yihua commented on code in PR #13047: URL: https://github.com/apache/hudi/pull/13047#discussion_r2017740704 ## hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/io/HoodieSparkFileGroupReaderBasedMergeHandle.java: ## @@ -283,6 +283,10 @@ public List close() { wri

Re: [PR] [HUDI-9140] Fix log block io type and other rollback strategy fixes for table version 6 [hudi]

2025-03-27 Thread via GitHub
lokeshj1703 commented on code in PR #12992: URL: https://github.com/apache/hudi/pull/12992#discussion_r2003090840 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/table/action/rollback/ListingBasedRollbackStrategy.java: ## @@ -162,9 +163,13 @@ public List getRoll

Re: [PR] [HUDI-9088] Fix unnecessary scanning of target table in MERGE INTO on Spark [hudi]

2025-03-27 Thread via GitHub
yihua commented on code in PR #12933: URL: https://github.com/apache/hudi/pull/12933#discussion_r2013030497 ## hudi-spark-datasource/hudi-spark/src/main/scala/org/apache/spark/sql/hudi/command/MergeIntoHoodieTableCommand.scala: ## @@ -659,7 +659,7 @@ case class MergeIntoHoodieTa

hudi.git: Error while running github feature from master:.asf.yaml

2025-03-27 Thread Apache Infrastructure
An error occurred while processing the github feature in .asf.yaml: GitHub discussions can only be enabled if a mailing list target exists for it. --- With regards, ASF Infra.

Re: [PR] [HUDI-9144] Flink writer for MOR table supports writing RowData … [hudi]

2025-03-27 Thread via GitHub
hudi-bot commented on PR #12967: URL: https://github.com/apache/hudi/pull/12967#issuecomment-2742851183 ## CI report: * 27bf56019b6b5d30022131498290dd8cf1ba7b5f Azure: [FAILURE](https://dev.azure.com/apachehudi/a1a51da7-8592-47d4-88dc-fd67bed336bb/_build/results?buildId=4301)

Re: [PR] [HUDI-9236] create marker file for every log file [hudi]

2025-03-27 Thread via GitHub
yihua commented on code in PR #13007: URL: https://github.com/apache/hudi/pull/13007#discussion_r2017727266 ## hudi-spark-datasource/hudi-spark/src/test/java/org/apache/hudi/table/functional/TestHoodieSparkRollback.java: ## @@ -0,0 +1,307 @@ +/* + * Licensed to the Apache Softwa

[jira] [Closed] (HUDI-8532) Decide if CompletionTimeQuery for table version 6 needs to return begin time or state transition time

2025-03-27 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8532?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan closed HUDI-8532. - Resolution: Not A Problem > Decide if CompletionTimeQuery for table version 6 needs to ret

Re: [PR] [HUDI-8635] Support numWrites metric for compaction [hudi]

2025-03-27 Thread via GitHub
hudi-bot commented on PR #13047: URL: https://github.com/apache/hudi/pull/13047#issuecomment-2759788276 ## CI report: * 7b446c67365ea8073d4ee9a268bc446e6b5a0335 UNKNOWN * 81e9e5c456416b45e254407b9ac2660114124fc0 UNKNOWN Bot commands @hudi-bot supports the followi

[jira] [Comment Edited] (HUDI-8532) Decide if CompletionTimeQuery for table version 6 needs to return begin time or state transition time

2025-03-27 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17939093#comment-17939093 ] sivabalan narayanan edited comment on HUDI-8532 at 3/27/25 11:42 PM: ---

Re: [PR] [HUDI-9236] create marker file for every log file [hudi]

2025-03-27 Thread via GitHub
yihua commented on code in PR #13007: URL: https://github.com/apache/hudi/pull/13007#discussion_r2017722269 ## hudi-spark-datasource/hudi-spark/src/test/java/org/apache/hudi/table/functional/TestHoodieSparkMergeOnReadTableRollback.java: ## @@ -242,8 +261,31 @@ void testRollbackW

[jira] [Commented] (HUDI-8532) Decide if CompletionTimeQuery for table version 6 needs to return begin time or state transition time

2025-03-27 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17939093#comment-17939093 ] sivabalan narayanan commented on HUDI-8532: --- Analyzed the usage of  ` Option g

Re: [PR] [HUDI-8635] Support numWrites metric for compaction [hudi]

2025-03-27 Thread via GitHub
hudi-bot commented on PR #13047: URL: https://github.com/apache/hudi/pull/13047#issuecomment-2759785156 ## CI report: * 7b446c67365ea8073d4ee9a268bc446e6b5a0335 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run th

[jira] [Updated] (HUDI-9235) Create a POC for HFile based FG reader

2025-03-27 Thread Lin Liu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9235?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lin Liu updated HUDI-9235: -- Description: We should have a poc for this task first to ensure the direction is correct, and expose potential

Re: [PR] [HUDI-9167] Remove redundant classes in hudi-utilities-slim-bundle [hudi]

2025-03-27 Thread via GitHub
hudi-bot commented on PR #12962: URL: https://github.com/apache/hudi/pull/12962#issuecomment-2759784975 ## CI report: * 283d40d0b810bfa5a192b55ecfbfb9520f4df7f7 UNKNOWN * f9fdf2a7e79a31cac604297792de226d37c075c0 Azure: [CANCELED](https://dev.azure.com/apachehudi/a1a51da7-8592-4

Re: [PR] [HUDI-9236] create marker file for every log file [hudi]

2025-03-27 Thread via GitHub
yihua commented on code in PR #13007: URL: https://github.com/apache/hudi/pull/13007#discussion_r2017718274 ## hudi-spark-datasource/hudi-spark/src/test/java/org/apache/hudi/table/functional/TestHoodieSparkMergeOnReadTableRollback.java: ## @@ -218,6 +235,8 @@ void testRollbackWi

Re: [PR] [HUDI-9236] create marker file for every log file [hudi]

2025-03-27 Thread via GitHub
yihua commented on code in PR #13007: URL: https://github.com/apache/hudi/pull/13007#discussion_r2017709403 ## hudi-spark-datasource/hudi-spark/src/test/java/org/apache/hudi/client/functional/TestHoodieBackedMetadata.java: ## @@ -256,6 +259,7 @@ public void testMetadataTableBoot

[PR] [HUDI-8635] Support numWrites metric for compaction [hudi]

2025-03-27 Thread via GitHub
linliu-code opened a new pull request, #13047: URL: https://github.com/apache/hudi/pull/13047 ### Change Logs This number was not properly added since 1. it is not copied from FG reader read stats. 2. for position-based buffer, it is not collected at all. ### Impact

Re: [PR] [HUDI-9236] create marker file for every log file [hudi]

2025-03-27 Thread via GitHub
yihua commented on code in PR #13007: URL: https://github.com/apache/hudi/pull/13007#discussion_r2017712196 ## hudi-spark-datasource/hudi-spark/src/test/java/org/apache/hudi/table/TestHoodieMergeOnReadTable.java: ## @@ -541,7 +541,7 @@ public void testLogBlocksCountsAfterLogComp

Re: [PR] [HUDI-9236] create marker file for every log file [hudi]

2025-03-27 Thread via GitHub
yihua commented on code in PR #13007: URL: https://github.com/apache/hudi/pull/13007#discussion_r2017700535 ## hudi-client/hudi-spark-client/src/test/java/org/apache/hudi/table/functional/TestMarkerBasedRollbackStrategy.java: ## @@ -149,6 +163,65 @@ public void testMarkerBasedR

[jira] [Created] (HUDI-9237) RFC - Pluggable Table Format Support

2025-03-27 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-9237: Summary: RFC - Pluggable Table Format Support Key: HUDI-9237 URL: https://issues.apache.org/jira/browse/HUDI-9237 Project: Apache Hudi Issue Type: Ne

Re: [PR] [HUDI-9236] create marker file for every log file [hudi]

2025-03-27 Thread via GitHub
yihua commented on code in PR #13007: URL: https://github.com/apache/hudi/pull/13007#discussion_r2017703585 ## hudi-common/src/main/java/org/apache/hudi/common/table/marker/MarkerOperation.java: ## @@ -34,6 +34,7 @@ public class MarkerOperation implements Serializable { // GE

Re: [PR] [HUDI-9236] create marker file for every log file [hudi]

2025-03-27 Thread via GitHub
yihua commented on code in PR #13007: URL: https://github.com/apache/hudi/pull/13007#discussion_r2017706830 ## hudi-common/src/test/java/org/apache/hudi/common/testutils/FileCreateUtils.java: ## @@ -98,6 +98,22 @@ private static void createMetaFile(HoodieTableMetaClient metaCl

Re: [PR] [HUDI-9236] create marker file for every log file [hudi]

2025-03-27 Thread via GitHub
hudi-bot commented on PR #13007: URL: https://github.com/apache/hudi/pull/13007#issuecomment-2759753640 ## CI report: * b71440eb08fec4554445cc3a9ddfefc5cdb7c86a Azure: [SUCCESS](https://dev.azure.com/apachehudi/a1a51da7-8592-47d4-88dc-fd67bed336bb/_build/results?buildId=4505)

Re: [PR] [HUDI-9167] Remove redundant classes in hudi-utilities-slim-bundle [hudi]

2025-03-27 Thread via GitHub
yihua commented on code in PR #12962: URL: https://github.com/apache/hudi/pull/12962#discussion_r2017691805 ## packaging/hudi-spark-bundle/pom.xml: ## @@ -90,7 +90,6 @@ org.eclipse.jetty.websocket:* org.jetbrains.kotlin:*

Re: [PR] [HUDI-9236] create marker file for every log file [hudi]

2025-03-27 Thread via GitHub
yihua commented on code in PR #13007: URL: https://github.com/apache/hudi/pull/13007#discussion_r2017696228 ## hudi-client/hudi-spark-client/src/test/java/org/apache/hudi/table/action/rollback/TestCopyOnWriteRollbackActionExecutor.java: ## @@ -106,7 +107,7 @@ public void testCo

  1   2   3   4   >