[GitHub] [hudi] hudi-bot removed a comment on pull request #4236: [HUDI-2936] Add data count checks in async clustering tests

2021-12-10 Thread GitBox
hudi-bot removed a comment on pull request #4236: URL: https://github.com/apache/hudi/pull/4236#issuecomment-990705653 ## CI report: * e4908379cb7faee6bdc554b0937b9a4557797eea Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[GitHub] [hudi] hudi-bot commented on pull request #4236: [HUDI-2936] Add data count checks in async clustering tests

2021-12-10 Thread GitBox
hudi-bot commented on pull request #4236: URL: https://github.com/apache/hudi/pull/4236#issuecomment-990733015 ## CI report: * 4871c93376740dfc1d53ed7942d4eb96d8c1f0b7 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] hudi-bot commented on pull request #4236: [HUDI-2936] Add data count checks in async clustering tests

2021-12-10 Thread GitBox
hudi-bot commented on pull request #4236: URL: https://github.com/apache/hudi/pull/4236#issuecomment-990739337 ## CI report: * 4871c93376740dfc1d53ed7942d4eb96d8c1f0b7 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] hudi-bot removed a comment on pull request #4236: [HUDI-2936] Add data count checks in async clustering tests

2021-12-10 Thread GitBox
hudi-bot removed a comment on pull request #4236: URL: https://github.com/apache/hudi/pull/4236#issuecomment-990733015 ## CI report: * 4871c93376740dfc1d53ed7942d4eb96d8c1f0b7 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[GitHub] [hudi] hudi-bot commented on pull request #4236: [HUDI-2936] Add data count checks in async clustering tests

2021-12-10 Thread GitBox
hudi-bot commented on pull request #4236: URL: https://github.com/apache/hudi/pull/4236#issuecomment-990742533 ## CI report: * 4871c93376740dfc1d53ed7942d4eb96d8c1f0b7 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] hudi-bot removed a comment on pull request #4236: [HUDI-2936] Add data count checks in async clustering tests

2021-12-10 Thread GitBox
hudi-bot removed a comment on pull request #4236: URL: https://github.com/apache/hudi/pull/4236#issuecomment-990739337 ## CI report: * 4871c93376740dfc1d53ed7942d4eb96d8c1f0b7 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[GitHub] [hudi] xushiyan merged pull request #4269: [HUDI-2878] enhance hudi-quick-start guide for spark-sql

2021-12-10 Thread GitBox
xushiyan merged pull request #4269: URL: https://github.com/apache/hudi/pull/4269 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr

[hudi] branch asf-site updated: [HUDI-2878] enhance hudi-quick-start guide for spark-sql (#4269)

2021-12-10 Thread xushiyan
This is an automated email from the ASF dual-hosted git repository. xushiyan pushed a commit to branch asf-site in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/asf-site by this push: new 4b59d41 [HUDI-2878] enhance hudi-quick-star

[jira] [Closed] (HUDI-2878) Enhance hudi-quick start guide for spark-sql

2021-12-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2878?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu closed HUDI-2878. Reviewers: Raymond Xu Resolution: Done > Enhance hudi-quick start guide for spark-sql > ---

[jira] [Updated] (HUDI-2878) Enhance hudi-quick start guide for spark-sql

2021-12-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2878?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2878: - Fix Version/s: 0.10.0 (was: 0.11.0) > Enhance hudi-quick start guide for spark-sql

[jira] [Comment Edited] (HUDI-2971) Timestamp values being corrupted when using BULK INSERT with row writing enabled

2021-12-10 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2971?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17457017#comment-17457017 ] Sagar Sumit edited comment on HUDI-2971 at 12/10/21, 10:01 AM: -

[jira] [Commented] (HUDI-2971) Timestamp values being corrupted when using BULK INSERT with row writing enabled

2021-12-10 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2971?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17457017#comment-17457017 ] Sagar Sumit commented on HUDI-2971: --- [~ryanpife] There was a [commit|https://github.com

[GitHub] [hudi] danny0405 commented on pull request #4252: [HUDI-2959] Fix the thread leak of cleaning service

2021-12-10 Thread GitBox
danny0405 commented on pull request #4252: URL: https://github.com/apache/hudi/pull/4252#issuecomment-990883225 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific c

[GitHub] [hudi] hudi-bot commented on pull request #4252: [HUDI-2959] Fix the thread leak of cleaning service

2021-12-10 Thread GitBox
hudi-bot commented on pull request #4252: URL: https://github.com/apache/hudi/pull/4252#issuecomment-990885160 ## CI report: * 41efa313a197fce137e851bc129dd7a941021a8e Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] hudi-bot removed a comment on pull request #4252: [HUDI-2959] Fix the thread leak of cleaning service

2021-12-10 Thread GitBox
hudi-bot removed a comment on pull request #4252: URL: https://github.com/apache/hudi/pull/4252#issuecomment-990333903 ## CI report: * 41efa313a197fce137e851bc129dd7a941021a8e Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[jira] [Created] (HUDI-2975) Update website docs for 0.10.0 release

2021-12-10 Thread Danny Chen (Jira)
Danny Chen created HUDI-2975: Summary: Update website docs for 0.10.0 release Key: HUDI-2975 URL: https://issues.apache.org/jira/browse/HUDI-2975 Project: Apache Hudi Issue Type: Task C

[GitHub] [hudi] hudi-bot commented on pull request #4252: [HUDI-2959] Fix the thread leak of cleaning service

2021-12-10 Thread GitBox
hudi-bot commented on pull request #4252: URL: https://github.com/apache/hudi/pull/4252#issuecomment-990917462 ## CI report: * 41efa313a197fce137e851bc129dd7a941021a8e Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] hudi-bot removed a comment on pull request #4252: [HUDI-2959] Fix the thread leak of cleaning service

2021-12-10 Thread GitBox
hudi-bot removed a comment on pull request #4252: URL: https://github.com/apache/hudi/pull/4252#issuecomment-990885160 ## CI report: * 41efa313a197fce137e851bc129dd7a941021a8e Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[GitHub] [hudi] danny0405 commented on a change in pull request #4252: [HUDI-2959] Fix the thread leak of cleaning service

2021-12-10 Thread GitBox
danny0405 commented on a change in pull request #4252: URL: https://github.com/apache/hudi/pull/4252#discussion_r766648582 ## File path: hudi-client/hudi-client-common/src/main/java/org/apache/hudi/client/AbstractHoodieWriteClient.java ## @@ -423,7 +423,11 @@ protected void pr

[GitHub] [hudi] danny0405 commented on a change in pull request #4252: [HUDI-2959] Fix the thread leak of cleaning service

2021-12-10 Thread GitBox
danny0405 commented on a change in pull request #4252: URL: https://github.com/apache/hudi/pull/4252#discussion_r766648989 ## File path: hudi-client/hudi-flink-client/src/main/java/org/apache/hudi/client/HoodieFlinkWriteClient.java ## @@ -281,7 +281,11 @@ public void initMetad

[GitHub] [hudi] danny0405 commented on pull request #4252: [HUDI-2959] Fix the thread leak of cleaning service

2021-12-10 Thread GitBox
danny0405 commented on pull request #4252: URL: https://github.com/apache/hudi/pull/4252#issuecomment-990943700 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific c

[GitHub] [hudi] hudi-bot removed a comment on pull request #4252: [HUDI-2959] Fix the thread leak of cleaning service

2021-12-10 Thread GitBox
hudi-bot removed a comment on pull request #4252: URL: https://github.com/apache/hudi/pull/4252#issuecomment-990917462 ## CI report: * 41efa313a197fce137e851bc129dd7a941021a8e Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[GitHub] [hudi] hudi-bot commented on pull request #4252: [HUDI-2959] Fix the thread leak of cleaning service

2021-12-10 Thread GitBox
hudi-bot commented on pull request #4252: URL: https://github.com/apache/hudi/pull/4252#issuecomment-990944726 ## CI report: * 41efa313a197fce137e851bc129dd7a941021a8e Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] zhangyue19921010 commented on pull request #3887: [HUDI-2648] Retry FileSystem action instead of failed directly.

2021-12-10 Thread GitBox
zhangyue19921010 commented on pull request #3887: URL: https://github.com/apache/hudi/pull/3887#issuecomment-990963440 > sure, makes sense if there are other cloud stores that needs this retry. Can you please address the feedback given already. Sure, I will address the comments ASAP. Tha

[GitHub] [hudi] zhangyue19921010 commented on pull request #4172: [HUDI-2892][BUG]Pending Clustering may stain the ActiveTimeLine and lead to incomplete query results

2021-12-10 Thread GitBox
zhangyue19921010 commented on pull request #4172: URL: https://github.com/apache/hudi/pull/4172#issuecomment-990964789 Hi @yihua @xushiyan and @nsivabalan Thanks a lot for your attention and review. All the comments are addressed. Also azure is passed. PTAL :) -- This is an automated m

[GitHub] [hudi] zhangyue19921010 edited a comment on pull request #3887: [HUDI-2648] Retry FileSystem action instead of failed directly.

2021-12-10 Thread GitBox
zhangyue19921010 edited a comment on pull request #3887: URL: https://github.com/apache/hudi/pull/3887#issuecomment-990963440 > sure, makes sense if there are other cloud stores that needs this retry. Can you please address the feedback given already. Sure, I will address the comment

[GitHub] [hudi] hudi-bot removed a comment on pull request #4252: [HUDI-2959] Fix the thread leak of cleaning service

2021-12-10 Thread GitBox
hudi-bot removed a comment on pull request #4252: URL: https://github.com/apache/hudi/pull/4252#issuecomment-990944726 ## CI report: * 41efa313a197fce137e851bc129dd7a941021a8e Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[GitHub] [hudi] hudi-bot commented on pull request #4270: [HUDI-2811] Support Spark 3.2 and Parquet 1.12.x

2021-12-10 Thread GitBox
hudi-bot commented on pull request #4270: URL: https://github.com/apache/hudi/pull/4270#issuecomment-990974729 ## CI report: * bcc62e5eeea6a2929e4144c00f2d0b29bcc786cd Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] hudi-bot commented on pull request #4252: [HUDI-2959] Fix the thread leak of cleaning service

2021-12-10 Thread GitBox
hudi-bot commented on pull request #4252: URL: https://github.com/apache/hudi/pull/4252#issuecomment-990974653 ## CI report: * 41efa313a197fce137e851bc129dd7a941021a8e Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] hudi-bot removed a comment on pull request #4270: [HUDI-2811] Support Spark 3.2 and Parquet 1.12.x

2021-12-10 Thread GitBox
hudi-bot removed a comment on pull request #4270: URL: https://github.com/apache/hudi/pull/4270#issuecomment-990681037 ## CI report: * bcc62e5eeea6a2929e4144c00f2d0b29bcc786cd Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[GitHub] [hudi] hudi-bot removed a comment on pull request #4270: [HUDI-2811] Support Spark 3.2 and Parquet 1.12.x

2021-12-10 Thread GitBox
hudi-bot removed a comment on pull request #4270: URL: https://github.com/apache/hudi/pull/4270#issuecomment-990974729 ## CI report: * bcc62e5eeea6a2929e4144c00f2d0b29bcc786cd Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[GitHub] [hudi] hudi-bot commented on pull request #4270: [HUDI-2811] Support Spark 3.2 and Parquet 1.12.x

2021-12-10 Thread GitBox
hudi-bot commented on pull request #4270: URL: https://github.com/apache/hudi/pull/4270#issuecomment-990976695 ## CI report: * bcc62e5eeea6a2929e4144c00f2d0b29bcc786cd Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] danny0405 opened a new pull request #4276: [HUDI-2975] Update website docs for 0.10.0 release

2021-12-10 Thread GitBox
danny0405 opened a new pull request #4276: URL: https://github.com/apache/hudi/pull/4276 ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contribute/how-to-contribute before opening a pull request.* ## What is the purpo

[jira] [Updated] (HUDI-2975) Update website docs for 0.10.0 release

2021-12-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2975?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-2975: - Labels: pull-request-available (was: ) > Update website docs for 0.10.0 release > ---

[GitHub] [hudi] danny0405 commented on pull request #4257: [HUDI-2921] - Docs organization for Flink quickstart

2021-12-10 Thread GitBox
danny0405 commented on pull request #4257: URL: https://github.com/apache/hudi/pull/4257#issuecomment-990997176 Thanks for the fix but can we fix the broken notes of the pages ? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub

[GitHub] [hudi] hudi-bot commented on pull request #4270: [HUDI-2811] Support Spark 3.2 and Parquet 1.12.x

2021-12-10 Thread GitBox
hudi-bot commented on pull request #4270: URL: https://github.com/apache/hudi/pull/4270#issuecomment-991008132 ## CI report: * 12c1b3c30684dde5c870fe4c26d2992dc9a9b495 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] hudi-bot removed a comment on pull request #4270: [HUDI-2811] Support Spark 3.2 and Parquet 1.12.x

2021-12-10 Thread GitBox
hudi-bot removed a comment on pull request #4270: URL: https://github.com/apache/hudi/pull/4270#issuecomment-990976695 ## CI report: * bcc62e5eeea6a2929e4144c00f2d0b29bcc786cd Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[GitHub] [hudi] nsivabalan merged pull request #4236: [HUDI-2936] Add data count checks in async clustering tests

2021-12-10 Thread GitBox
nsivabalan merged pull request #4236: URL: https://github.com/apache/hudi/pull/4236 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubs

[hudi] branch master updated (456d74c -> c7473a7)

2021-12-10 Thread sivabalan
This is an automated email from the ASF dual-hosted git repository. sivabalan pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git. from 456d74c [HUDI-2901] Fixed the bug clustering jobs cannot running in parallel (#4178) add c7473a7 [HUDI-2936]

[GitHub] [hudi] nsivabalan commented on pull request #4243: [HUDI-2952] Fixing metadata table for non-partitioned dataset

2021-12-10 Thread GitBox
nsivabalan commented on pull request #4243: URL: https://github.com/apache/hudi/pull/4243#issuecomment-991046551 @hudi-bot azure run -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[GitHub] [hudi] nsivabalan commented on pull request #4166: [MINOR] Adding verbose output for metadata validate files command

2021-12-10 Thread GitBox
nsivabalan commented on pull request #4166: URL: https://github.com/apache/hudi/pull/4166#issuecomment-991047647 @hudi-bot azure run -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[GitHub] [hudi] hudi-bot removed a comment on pull request #4243: [HUDI-2952] Fixing metadata table for non-partitioned dataset

2021-12-10 Thread GitBox
hudi-bot removed a comment on pull request #4243: URL: https://github.com/apache/hudi/pull/4243#issuecomment-990378500 ## CI report: * d048e61abb71819411453c90ef69acbc78ea8b3f Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[GitHub] [hudi] hudi-bot commented on pull request #4243: [HUDI-2952] Fixing metadata table for non-partitioned dataset

2021-12-10 Thread GitBox
hudi-bot commented on pull request #4243: URL: https://github.com/apache/hudi/pull/4243#issuecomment-991048108 ## CI report: * d048e61abb71819411453c90ef69acbc78ea8b3f Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] nsivabalan commented on issue #4146: [SUPPORT] Deltastreamer commits with a custom checkpoint configuration are skipped if the generated checkpoint matches the previous commit checkpoi

2021-12-10 Thread GitBox
nsivabalan commented on issue #4146: URL: https://github.com/apache/hudi/issues/4146#issuecomment-991049417 I plan to close this out as epoch times work. Please re-open if need be. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to Gi

[GitHub] [hudi] nsivabalan closed issue #4146: [SUPPORT] Deltastreamer commits with a custom checkpoint configuration are skipped if the generated checkpoint matches the previous commit checkpoint

2021-12-10 Thread GitBox
nsivabalan closed issue #4146: URL: https://github.com/apache/hudi/issues/4146 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...

[GitHub] [hudi] hudi-bot commented on pull request #4166: [MINOR] Adding verbose output for metadata validate files command

2021-12-10 Thread GitBox
hudi-bot commented on pull request #4166: URL: https://github.com/apache/hudi/pull/4166#issuecomment-991050227 ## CI report: * a14277e707c099478099e1f63d0d46e0d7295c68 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] hudi-bot removed a comment on pull request #4166: [MINOR] Adding verbose output for metadata validate files command

2021-12-10 Thread GitBox
hudi-bot removed a comment on pull request #4166: URL: https://github.com/apache/hudi/pull/4166#issuecomment-990406408 ## CI report: * a14277e707c099478099e1f63d0d46e0d7295c68 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[GitHub] [hudi] hudi-bot removed a comment on pull request #4243: [HUDI-2952] Fixing metadata table for non-partitioned dataset

2021-12-10 Thread GitBox
hudi-bot removed a comment on pull request #4243: URL: https://github.com/apache/hudi/pull/4243#issuecomment-991048108 ## CI report: * d048e61abb71819411453c90ef69acbc78ea8b3f Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[GitHub] [hudi] hudi-bot commented on pull request #4243: [HUDI-2952] Fixing metadata table for non-partitioned dataset

2021-12-10 Thread GitBox
hudi-bot commented on pull request #4243: URL: https://github.com/apache/hudi/pull/4243#issuecomment-991050375 ## CI report: * d048e61abb71819411453c90ef69acbc78ea8b3f Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] nsivabalan commented on issue #4242: [SUPPORT] Split Data into Multiple Parquet files under Partitions

2021-12-10 Thread GitBox
nsivabalan commented on issue #4242: URL: https://github.com/apache/hudi/issues/4242#issuecomment-991051340 @codope : can you chime in here please. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[GitHub] [hudi] hudi-bot removed a comment on pull request #4166: [MINOR] Adding verbose output for metadata validate files command

2021-12-10 Thread GitBox
hudi-bot removed a comment on pull request #4166: URL: https://github.com/apache/hudi/pull/4166#issuecomment-991050227 ## CI report: * a14277e707c099478099e1f63d0d46e0d7295c68 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[GitHub] [hudi] hudi-bot commented on pull request #4166: [MINOR] Adding verbose output for metadata validate files command

2021-12-10 Thread GitBox
hudi-bot commented on pull request #4166: URL: https://github.com/apache/hudi/pull/4166#issuecomment-991052624 ## CI report: * a14277e707c099478099e1f63d0d46e0d7295c68 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] leesf merged pull request #4222: [HUDI-2849] improve SparkUI job description for write path

2021-12-10 Thread GitBox
leesf merged pull request #4222: URL: https://github.com/apache/hudi/pull/4222 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...

[hudi] branch master updated (c7473a7 -> f194566)

2021-12-10 Thread leesf
This is an automated email from the ASF dual-hosted git repository. leesf pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git. from c7473a7 [HUDI-2936] Add data count checks in async clustering tests (#4236) add f194566 [HUDI-2849] Improve Spar

[GitHub] [hudi] hudi-bot removed a comment on pull request #4243: [HUDI-2952] Fixing metadata table for non-partitioned dataset

2021-12-10 Thread GitBox
hudi-bot removed a comment on pull request #4243: URL: https://github.com/apache/hudi/pull/4243#issuecomment-991050375 ## CI report: * d048e61abb71819411453c90ef69acbc78ea8b3f Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[GitHub] [hudi] hudi-bot commented on pull request #4243: [HUDI-2952] Fixing metadata table for non-partitioned dataset

2021-12-10 Thread GitBox
hudi-bot commented on pull request #4243: URL: https://github.com/apache/hudi/pull/4243#issuecomment-991083494 ## CI report: * 4fdb574a68202e20b42f6efc4d10007c7ce78fa8 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] codope commented on issue #4242: [SUPPORT] Split Data into Multiple Parquet files under Partitions

2021-12-10 Thread GitBox
codope commented on issue #4242: URL: https://github.com/apache/hudi/issues/4242#issuecomment-991089546 @Rap70r These two blogs should help in understanding clustering in Hudi: * [Clustering intro](https://hudi.apache.org/blog/2021/01/27/hudi-clustering-intro/) * [Async clustering](h

[GitHub] [hudi] danny0405 opened a new pull request #4277: [HUDI-2976] Add Hudi 0.10.0 release page with highlights

2021-12-10 Thread GitBox
danny0405 opened a new pull request #4277: URL: https://github.com/apache/hudi/pull/4277 ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contribute/how-to-contribute before opening a pull request.* ## What is the purpo

[jira] [Updated] (HUDI-2976) Add Hudi 0.10.0 release page with highlights

2021-12-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2976?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-2976: - Labels: pull-request-available (was: ) > Add Hudi 0.10.0 release page with highlights > -

[jira] [Updated] (HUDI-2971) Timestamp values being corrupted when using BULK INSERT with row writing enabled

2021-12-10 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit updated HUDI-2971: -- Priority: Blocker (was: Major) > Timestamp values being corrupted when using BULK INSERT with row writi

[jira] [Assigned] (HUDI-2971) Timestamp values being corrupted when using BULK INSERT with row writing enabled

2021-12-10 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit reassigned HUDI-2971: - Assignee: Sagar Sumit > Timestamp values being corrupted when using BULK INSERT with row writing

[GitHub] [hudi] hudi-bot removed a comment on pull request #4166: [MINOR] Adding verbose output for metadata validate files command

2021-12-10 Thread GitBox
hudi-bot removed a comment on pull request #4166: URL: https://github.com/apache/hudi/pull/4166#issuecomment-991052624 ## CI report: * a14277e707c099478099e1f63d0d46e0d7295c68 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[GitHub] [hudi] hudi-bot commented on pull request #4166: [MINOR] Adding verbose output for metadata validate files command

2021-12-10 Thread GitBox
hudi-bot commented on pull request #4166: URL: https://github.com/apache/hudi/pull/4166#issuecomment-991097352 ## CI report: * 8dfd3ac96717870a8be8310fea1272a33fc7d7a4 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] danny0405 commented on pull request #4252: [HUDI-2959] Fix the thread leak of cleaning service

2021-12-10 Thread GitBox
danny0405 commented on pull request #4252: URL: https://github.com/apache/hudi/pull/4252#issuecomment-991099989 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific c

[GitHub] [hudi] hudi-bot removed a comment on pull request #4252: [HUDI-2959] Fix the thread leak of cleaning service

2021-12-10 Thread GitBox
hudi-bot removed a comment on pull request #4252: URL: https://github.com/apache/hudi/pull/4252#issuecomment-990974653 ## CI report: * 41efa313a197fce137e851bc129dd7a941021a8e Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[GitHub] [hudi] hudi-bot commented on pull request #4252: [HUDI-2959] Fix the thread leak of cleaning service

2021-12-10 Thread GitBox
hudi-bot commented on pull request #4252: URL: https://github.com/apache/hudi/pull/4252#issuecomment-991100102 ## CI report: * 41efa313a197fce137e851bc129dd7a941021a8e Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] nsivabalan merged pull request #4243: [HUDI-2952] Fixing metadata table for non-partitioned dataset

2021-12-10 Thread GitBox
nsivabalan merged pull request #4243: URL: https://github.com/apache/hudi/pull/4243 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubs

[jira] [Resolved] (HUDI-2952) Metadata table compaction fails for non partitioned dataset

2021-12-10 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2952?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan resolved HUDI-2952. --- > Metadata table compaction fails for non partitioned dataset > --

[hudi] branch master updated (f194566 -> be36826)

2021-12-10 Thread sivabalan
This is an automated email from the ASF dual-hosted git repository. sivabalan pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git. from f194566 [HUDI-2849] Improve SparkUI job description for write path (#4222) add be36826 [HUDI-2952] Fixing me

[jira] [Updated] (HUDI-1834) Please delete old releases from mirroring system

2021-12-10 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1834?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-1834: -- Priority: Critical (was: Blocker) > Please delete old releases from mirroring system >

[jira] [Commented] (HUDI-1834) Please delete old releases from mirroring system

2021-12-10 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17457242#comment-17457242 ] sivabalan narayanan commented on HUDI-1834: --- Sure. will follow on that.  > Plea

[jira] [Updated] (HUDI-2973) Rewrite/re-publish RFC for Data skipping index

2021-12-10 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2973: -- Fix Version/s: 0.11.0 > Rewrite/re-publish RFC for Data skipping index > ---

[jira] [Updated] (HUDI-2973) Rewrite/re-publish RFC for Data skipping index

2021-12-10 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2973: -- Priority: Blocker (was: Major) > Rewrite/re-publish RFC for Data skipping index > -

[GitHub] [hudi] nikita-sheremet-clearscale commented on issue #4062: [SUPPORT] How debug hive sync?

2021-12-10 Thread GitBox
nikita-sheremet-clearscale commented on issue #4062: URL: https://github.com/apache/hudi/issues/4062#issuecomment-991112752 @xushiyan Many thanks for reply! It turned out that logs appeared in intecase log not in step logs in EMR. E.g. to see logstatement you have to open EC2 isntace

[GitHub] [hudi] nikita-sheremet-clearscale closed issue #4062: [SUPPORT] How debug hive sync?

2021-12-10 Thread GitBox
nikita-sheremet-clearscale closed issue #4062: URL: https://github.com/apache/hudi/issues/4062 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: com

[jira] [Updated] (HUDI-2821) Docs for Hudi Metadata

2021-12-10 Thread Kyle Weller (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kyle Weller updated HUDI-2821: -- Status: Resolved (was: Patch Available) > Docs for Hudi Metadata > -- > >

[jira] [Updated] (HUDI-2823) Docs for checkpointing

2021-12-10 Thread Kyle Weller (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2823?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kyle Weller updated HUDI-2823: -- Status: Resolved (was: Patch Available) > Docs for checkpointing > -- > >

[jira] [Updated] (HUDI-2922) Improve Use Cases

2021-12-10 Thread Kyle Weller (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2922?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kyle Weller updated HUDI-2922: -- Status: Resolved (was: Patch Available) > Improve Use Cases > - > > Key

[jira] [Updated] (HUDI-2827) Docs for schema provider

2021-12-10 Thread Kyle Weller (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2827?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kyle Weller updated HUDI-2827: -- Status: Resolved (was: Patch Available) > Docs for schema provider > > >

[jira] [Updated] (HUDI-2829) Document PublicAPIs

2021-12-10 Thread Kyle Weller (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2829?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kyle Weller updated HUDI-2829: -- Status: Resolved (was: Patch Available) > Document PublicAPIs > --- > >

[jira] [Updated] (HUDI-2828) Add bring your own key generator docs

2021-12-10 Thread Kyle Weller (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2828?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kyle Weller updated HUDI-2828: -- Status: Resolved (was: Patch Available) > Add bring your own key generator docs > -

[jira] [Updated] (HUDI-2921) Shorten Flink QuickStart

2021-12-10 Thread Kyle Weller (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kyle Weller updated HUDI-2921: -- Status: Resolved (was: Patch Available) > Shorten Flink QuickStart > > >

[jira] [Updated] (HUDI-2956) Improve Write docs

2021-12-10 Thread Kyle Weller (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2956?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kyle Weller updated HUDI-2956: -- Status: Resolved (was: Patch Available) > Improve Write docs > -- > > K

[jira] [Updated] (HUDI-2825) Docs for markers

2021-12-10 Thread Kyle Weller (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2825?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kyle Weller updated HUDI-2825: -- Status: Resolved (was: Patch Available) > Docs for markers > > > Key:

[jira] [Commented] (HUDI-1258) Small file handling Merges can be handled without actual merging

2021-12-10 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17457244#comment-17457244 ] Raymond Xu commented on HUDI-1258: -- [~vinoth] After going through the code, had some idea

[jira] [Updated] (HUDI-2936) Add data correctness tests for async clustering

2021-12-10 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2936?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit updated HUDI-2936: -- Fix Version/s: 0.11.0 > Add data correctness tests for async clustering > --

[jira] [Updated] (HUDI-2936) Add data correctness tests for async clustering

2021-12-10 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2936?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit updated HUDI-2936: -- Priority: Blocker (was: Major) > Add data correctness tests for async clustering >

[GitHub] [hudi] yihua opened a new pull request #4278: [HUDI-2906] Add a repair util to clean up dangling data and log files

2021-12-10 Thread GitBox
yihua opened a new pull request #4278: URL: https://github.com/apache/hudi/pull/4278 ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contribute/how-to-contribute before opening a pull request.* ## What is the purpose o

[jira] [Updated] (HUDI-2906) Add a repair command to clean up duplicate/uncommitted data files in a table

2021-12-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2906?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-2906: - Labels: pull-request-available (was: ) > Add a repair command to clean up duplicate/uncommitted d

[jira] [Updated] (HUDI-2906) Add a repair command to clean up duplicate/uncommitted data files in a table

2021-12-10 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2906?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-2906: Fix Version/s: 0.11.0 > Add a repair command to clean up duplicate/uncommitted data files in a table > -

[jira] [Updated] (HUDI-2906) Add a repair command to clean up duplicate/uncommitted data files in a table

2021-12-10 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2906?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-2906: Priority: Blocker (was: Major) > Add a repair command to clean up duplicate/uncommitted data files in a tab

[jira] [Commented] (HUDI-1856) Upstream changes made in PrestoDB to eliminate file listing to Trino

2021-12-10 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1856?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17457247#comment-17457247 ] Sagar Sumit commented on HUDI-1856: --- Closing it in favour of HUDI-2687 The changes to s

[jira] [Updated] (HUDI-1856) Upstream changes made in PrestoDB to eliminate file listing to Trino

2021-12-10 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1856?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit updated HUDI-1856: -- Status: Resolved (was: Patch Available) > Upstream changes made in PrestoDB to eliminate file listing t

[GitHub] [hudi] hudi-bot commented on pull request #4278: [HUDI-2906] Add a repair util to clean up dangling data and log files

2021-12-10 Thread GitBox
hudi-bot commented on pull request #4278: URL: https://github.com/apache/hudi/pull/4278#issuecomment-991120863 ## CI report: * bfa6aa245ac894cf9187ca73e741daed595cc27c UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` r

[jira] [Resolved] (HUDI-2936) Add data correctness tests for async clustering

2021-12-10 Thread Rajesh Mahindra (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2936?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Mahindra resolved HUDI-2936. --- > Add data correctness tests for async clustering > --

[GitHub] [hudi] hudi-bot removed a comment on pull request #4278: [HUDI-2906] Add a repair util to clean up dangling data and log files

2021-12-10 Thread GitBox
hudi-bot removed a comment on pull request #4278: URL: https://github.com/apache/hudi/pull/4278#issuecomment-991120863 ## CI report: * bfa6aa245ac894cf9187ca73e741daed595cc27c UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run

[GitHub] [hudi] hudi-bot commented on pull request #4278: [HUDI-2906] Add a repair util to clean up dangling data and log files

2021-12-10 Thread GitBox
hudi-bot commented on pull request #4278: URL: https://github.com/apache/hudi/pull/4278#issuecomment-991123083 ## CI report: * bfa6aa245ac894cf9187ca73e741daed595cc27c Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] hudi-bot removed a comment on pull request #4278: [HUDI-2906] Add a repair util to clean up dangling data and log files

2021-12-10 Thread GitBox
hudi-bot removed a comment on pull request #4278: URL: https://github.com/apache/hudi/pull/4278#issuecomment-991123083 ## CI report: * bfa6aa245ac894cf9187ca73e741daed595cc27c Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[GitHub] [hudi] hudi-bot commented on pull request #4278: [HUDI-2906] Add a repair util to clean up dangling data and log files

2021-12-10 Thread GitBox
hudi-bot commented on pull request #4278: URL: https://github.com/apache/hudi/pull/4278#issuecomment-991127663 ## CI report: * bfa6aa245ac894cf9187ca73e741daed595cc27c Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] hudi-bot commented on pull request #4278: [HUDI-2906] Add a repair util to clean up dangling data and log files

2021-12-10 Thread GitBox
hudi-bot commented on pull request #4278: URL: https://github.com/apache/hudi/pull/4278#issuecomment-991129712 ## CI report: * bfa6aa245ac894cf9187ca73e741daed595cc27c Azure: [CANCELED](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?b

  1   2   3   4   >