[GitHub] [hudi] hudi-bot removed a comment on pull request #4065: [HUDI-2817] Sync the configuration inference for HoodieFlinkStreamer

2021-11-23 Thread GitBox
hudi-bot removed a comment on pull request #4065: URL: https://github.com/apache/hudi/pull/4065#issuecomment-977589109 ## CI report: * 1059b75d6c57334f46dfcb4a0c4fc3ee8c815b02 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[GitHub] [hudi] hudi-bot commented on pull request #4065: [HUDI-2817] Sync the configuration inference for HoodieFlinkStreamer

2021-11-23 Thread GitBox
hudi-bot commented on pull request #4065: URL: https://github.com/apache/hudi/pull/4065#issuecomment-977617562 ## CI report: * 7b4e1a1a27e962b920e6801aa205c5a45d89ef91 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] boneanxs commented on pull request #4014: [HUDI-2779] Cache BaseDir if HudiTableNotFound Exception thrown

2021-11-23 Thread GitBox
boneanxs commented on pull request #4014: URL: https://github.com/apache/hudi/pull/4014#issuecomment-977610252 > @boneanxs Thanks for this contribution. I have a question. What happens for hidden files and directories? Will it not return true for them if we only check the basePath?

[GitHub] [hudi] boneanxs commented on a change in pull request #4014: [HUDI-2779] Cache BaseDir if HudiTableNotFound Exception thrown

2021-11-23 Thread GitBox
boneanxs commented on a change in pull request #4014: URL: https://github.com/apache/hudi/pull/4014#discussion_r755766928 ## File path: hudi-hadoop-mr/src/main/java/org/apache/hudi/hadoop/HoodieROTablePathFilter.java ## @@ -173,6 +173,13 @@ public boolean accept(Path path) {

[GitHub] [hudi] hudi-bot removed a comment on pull request #4100: When I run hudi-cli.sh using hadoop 3.2.1 , this is a error about guava class conflict

2021-11-23 Thread GitBox
hudi-bot removed a comment on pull request #4100: URL: https://github.com/apache/hudi/pull/4100#issuecomment-977600511 ## CI report: * 7e5a547b09e20b518c10fbeffe44ffc0653adb13 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run

[GitHub] [hudi] hudi-bot commented on pull request #4100: When I run hudi-cli.sh using hadoop 3.2.1 , this is a error about guava class conflict

2021-11-23 Thread GitBox
hudi-bot commented on pull request #4100: URL: https://github.com/apache/hudi/pull/4100#issuecomment-977602198 ## CI report: * 7e5a547b09e20b518c10fbeffe44ffc0653adb13 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] hudi-bot commented on pull request #4096: [HUDI-2847] Flink metadata table supports virtual keys

2021-11-23 Thread GitBox
hudi-bot commented on pull request #4096: URL: https://github.com/apache/hudi/pull/4096#issuecomment-977602180 ## CI report: * 621d58bd83a6cc417c1f7606123c768d9a73e988 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] hudi-bot removed a comment on pull request #4096: [HUDI-2847] Flink metadata table supports virtual keys

2021-11-23 Thread GitBox
hudi-bot removed a comment on pull request #4096: URL: https://github.com/apache/hudi/pull/4096#issuecomment-977582358 ## CI report: * 621d58bd83a6cc417c1f7606123c768d9a73e988 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[jira] [Created] (HUDI-2848) When I run hudi-cli.sh using hadoop 3.2.1, there is a error about class conflict

2021-11-23 Thread KaiXinXIaoLei (Jira)
KaiXinXIaoLei created HUDI-2848: --- Summary: When I run hudi-cli.sh using hadoop 3.2.1, there is a error about class conflict Key: HUDI-2848 URL: https://issues.apache.org/jira/browse/HUDI-2848 Project: A

[GitHub] [hudi] huleilei closed issue #4099: [SUPPORT]When I run hudi-cli.sh using hadoop 3.2.1, there is a error about class conflict

2021-11-23 Thread GitBox
huleilei closed issue #4099: URL: https://github.com/apache/hudi/issues/4099 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@h

[GitHub] [hudi] danny0405 commented on pull request #4096: [HUDI-2847] Flink metadata table supports virtual keys

2021-11-23 Thread GitBox
danny0405 commented on pull request #4096: URL: https://github.com/apache/hudi/pull/4096#issuecomment-977600845 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific c

[GitHub] [hudi] hudi-bot commented on pull request #4100: When I run hudi-cli.sh using hadoop 3.2.1 , this is a error about guava class conflict

2021-11-23 Thread GitBox
hudi-bot commented on pull request #4100: URL: https://github.com/apache/hudi/pull/4100#issuecomment-977600511 ## CI report: * 7e5a547b09e20b518c10fbeffe44ffc0653adb13 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` r

[GitHub] [hudi] hudi-bot commented on pull request #3889: [HUDI-2443] Hudi KVComparator for all HFile writer usages

2021-11-23 Thread GitBox
hudi-bot commented on pull request #3889: URL: https://github.com/apache/hudi/pull/3889#issuecomment-977600225 ## CI report: * c934b7ef0f4654b9258f82aff03b27910fcf4573 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] hudi-bot removed a comment on pull request #3889: [HUDI-2443] Hudi KVComparator for all HFile writer usages

2021-11-23 Thread GitBox
hudi-bot removed a comment on pull request #3889: URL: https://github.com/apache/hudi/pull/3889#issuecomment-977563513 ## CI report: * 4fb8df69496faf0431e8222b365ca77c7a33e3a0 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[GitHub] [hudi] huleilei opened a new pull request #4100: I run hudi-cli.sh using hadoop 3.2.1 , this is a error about guava class conflict

2021-11-23 Thread GitBox
huleilei opened a new pull request #4100: URL: https://github.com/apache/hudi/pull/4100 ## What is the purpose of the pull request When hadoop.version is 3.2.1, spark.version is 3.1.1,I run hudi-cli.sh, this is a error: ![image](https://user-images.githubusercontent.com/9431947/14319

[GitHub] [hudi] xiarixiaoyao commented on issue #3981: [SUPPORT] If the HUDI table(MOR) contains only log files, the Spark Datasource cannot obtain data in snapshot mode

2021-11-23 Thread GitBox
xiarixiaoyao commented on issue #3981: URL: https://github.com/apache/hudi/issues/3981#issuecomment-977595302 @JoshuaZhuCN spark should support read pure log table. When you specify the load path, do not use wildcards, just specify the path to the table level。 load("hdfs://localhost

[GitHub] [hudi] huleilei opened a new issue #4099: [SUPPORT] The guava version shuold be 27.

2021-11-23 Thread GitBox
huleilei opened a new issue #4099: URL: https://github.com/apache/hudi/issues/4099 When hadoop.version is 3.2.1, spark.version is 3.1.1,I run `hudi-cli.sh`, this is a error: ![image](https://user-images.githubusercontent.com/9431947/143191082-2cb6cd1f-d3e3-48c2-bd9f-61b88308d82d.png

[GitHub] [hudi] hudi-bot removed a comment on pull request #4093: [HUDI-2766] Cluster update strategy should not be fenced by write config

2021-11-23 Thread GitBox
hudi-bot removed a comment on pull request #4093: URL: https://github.com/apache/hudi/pull/4093#issuecomment-977555475 ## CI report: * 77989b656e77ca91ddb19972dad197e229b98a52 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[GitHub] [hudi] hudi-bot commented on pull request #4093: [HUDI-2766] Cluster update strategy should not be fenced by write config

2021-11-23 Thread GitBox
hudi-bot commented on pull request #4093: URL: https://github.com/apache/hudi/pull/4093#issuecomment-977592359 ## CI report: * f12a5d8437ffaf61a332078232f6444e98d60275 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] hudi-bot removed a comment on pull request #4065: [HUDI-2817] Sync the configuration inference for HoodieFlinkStreamer

2021-11-23 Thread GitBox
hudi-bot removed a comment on pull request #4065: URL: https://github.com/apache/hudi/pull/4065#issuecomment-977587469 ## CI report: * 1059b75d6c57334f46dfcb4a0c4fc3ee8c815b02 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[GitHub] [hudi] hudi-bot commented on pull request #4065: [HUDI-2817] Sync the configuration inference for HoodieFlinkStreamer

2021-11-23 Thread GitBox
hudi-bot commented on pull request #4065: URL: https://github.com/apache/hudi/pull/4065#issuecomment-977589109 ## CI report: * 1059b75d6c57334f46dfcb4a0c4fc3ee8c815b02 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] hudi-bot removed a comment on pull request #4065: [HUDI-2817] Sync the configuration inference for HoodieFlinkStreamer

2021-11-23 Thread GitBox
hudi-bot removed a comment on pull request #4065: URL: https://github.com/apache/hudi/pull/4065#issuecomment-976238479 ## CI report: * 1059b75d6c57334f46dfcb4a0c4fc3ee8c815b02 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[GitHub] [hudi] hudi-bot commented on pull request #4065: [HUDI-2817] Sync the configuration inference for HoodieFlinkStreamer

2021-11-23 Thread GitBox
hudi-bot commented on pull request #4065: URL: https://github.com/apache/hudi/pull/4065#issuecomment-977587469 ## CI report: * 1059b75d6c57334f46dfcb4a0c4fc3ee8c815b02 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] yuzhaojing commented on a change in pull request #4065: [HUDI-2817] Sync the configuration inference for HoodieFlinkStreamer

2021-11-23 Thread GitBox
yuzhaojing commented on a change in pull request #4065: URL: https://github.com/apache/hudi/pull/4065#discussion_r755748144 ## File path: hudi-flink/src/main/java/org/apache/hudi/streamer/HoodieFlinkStreamer.java ## @@ -97,8 +99,21 @@ public static void main(String[] args) thr

[jira] [Assigned] (HUDI-2825) Docs for markers

2021-11-23 Thread Kyle Weller (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2825?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kyle Weller reassigned HUDI-2825: - Assignee: Kyle Weller > Docs for markers > > > Key: HUDI-2825 >

[jira] [Updated] (HUDI-2825) Docs for markers

2021-11-23 Thread Kyle Weller (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2825?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kyle Weller updated HUDI-2825: -- Priority: Blocker (was: Major) > Docs for markers > > > Key: HUDI-2825

[jira] [Updated] (HUDI-2825) Docs for markers

2021-11-23 Thread Kyle Weller (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2825?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kyle Weller updated HUDI-2825: -- Fix Version/s: 0.10.0 > Docs for markers > > > Key: HUDI-2825 >

[GitHub] [hudi] prashantwason commented on a change in pull request #4067: [WIP] [HUDI-2763] Metadata table records storage reduction

2021-11-23 Thread GitBox
prashantwason commented on a change in pull request #4067: URL: https://github.com/apache/hudi/pull/4067#discussion_r755744509 ## File path: hudi-common/src/main/java/org/apache/hudi/common/table/log/block/HoodieHFileDataBlock.java ## @@ -162,6 +167,18 @@ protected void create

[GitHub] [hudi] prashantwason commented on pull request #4067: [WIP] [HUDI-2763] Metadata table records storage reduction

2021-11-23 Thread GitBox
prashantwason commented on pull request #4067: URL: https://github.com/apache/hudi/pull/4067#issuecomment-977583545 Concept looks good. But why introduce a new block type and not do it for the HoodieHFileDataBlock itself? When the HFile format is used, whether for Metadata Table or e

[GitHub] [hudi] hudi-bot commented on pull request #4096: [HUDI-2847] Flink metadata table supports virtual keys

2021-11-23 Thread GitBox
hudi-bot commented on pull request #4096: URL: https://github.com/apache/hudi/pull/4096#issuecomment-977582358 ## CI report: * 621d58bd83a6cc417c1f7606123c768d9a73e988 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] hudi-bot removed a comment on pull request #4096: [HUDI-2847] Flink metadata table supports virtual keys

2021-11-23 Thread GitBox
hudi-bot removed a comment on pull request #4096: URL: https://github.com/apache/hudi/pull/4096#issuecomment-977542873 ## CI report: * 621d58bd83a6cc417c1f7606123c768d9a73e988 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[GitHub] [hudi] xiarixiaoyao commented on pull request #3865: [HUDI-2005][WIP] Removing direct fs call in HoodieLogFileReader

2021-11-23 Thread GitBox
xiarixiaoyao commented on pull request #3865: URL: https://github.com/apache/hudi/pull/3865#issuecomment-977581730 @nsivabalan i think i have found that why the UT failed。 pls add those code: **writeStat.setFileSizeInBytes(new File(new Path(basePath.toString(), filePath).toString())

[GitHub] [hudi] danny0405 commented on a change in pull request #4065: [HUDI-2817] Sync the configuration inference for HoodieFlinkStreamer

2021-11-23 Thread GitBox
danny0405 commented on a change in pull request #4065: URL: https://github.com/apache/hudi/pull/4065#discussion_r755743573 ## File path: hudi-flink/src/main/java/org/apache/hudi/streamer/HoodieFlinkStreamer.java ## @@ -97,8 +99,21 @@ public static void main(String[] args) thro

[GitHub] [hudi] prashantwason commented on a change in pull request #4067: [WIP] [HUDI-2763] Metadata table records storage reduction

2021-11-23 Thread GitBox
prashantwason commented on a change in pull request #4067: URL: https://github.com/apache/hudi/pull/4067#discussion_r755742592 ## File path: hudi-common/src/main/avro/HoodieMetadata.avsc ## @@ -23,7 +23,10 @@ "fields": [ { "name": "key", -

[GitHub] [hudi] danny0405 commented on a change in pull request #4037: [HUDI-2794] Guarding table service commits within a single lock to commit to both data table and metadata table

2021-11-23 Thread GitBox
danny0405 commented on a change in pull request #4037: URL: https://github.com/apache/hudi/pull/4037#discussion_r755741499 ## File path: hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/client/SparkRDDWriteClient.java ## @@ -386,9 +396,13 @@ private void completeCl

[jira] [Updated] (HUDI-2727) Add prometheus reporter docs

2021-11-23 Thread Kyle Weller (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2727?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kyle Weller updated HUDI-2727: -- Status: Patch Available (was: In Progress) > Add prometheus reporter docs > ---

[jira] [Updated] (HUDI-2727) Add prometheus reporter docs

2021-11-23 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2727?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-2727: - Labels: pull-request-available (was: ) > Add prometheus reporter docs > -

[GitHub] [hudi] kywe665 opened a new pull request #4098: [HUDI-2727] - Docs for Prometheus metrics reporter

2021-11-23 Thread GitBox
kywe665 opened a new pull request #4098: URL: https://github.com/apache/hudi/pull/4098 ## What is the purpose of the pull request Docs for Prometheus metrics reporter ## Brief change log - Added Docs for Prometheus metrics reporter ## Verify this pull request

[GitHub] [hudi] hudi-bot removed a comment on pull request #4083: [HUDI-2837] The original hoodie.table.name should be maintained in Spark SQL

2021-11-23 Thread GitBox
hudi-bot removed a comment on pull request #4083: URL: https://github.com/apache/hudi/pull/4083#issuecomment-977532613 ## CI report: * ac5c1d542215d7893b0546e7158d4afb2fd79ea5 Azure: [CANCELED](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/r

[GitHub] [hudi] hudi-bot commented on pull request #4083: [HUDI-2837] The original hoodie.table.name should be maintained in Spark SQL

2021-11-23 Thread GitBox
hudi-bot commented on pull request #4083: URL: https://github.com/apache/hudi/pull/4083#issuecomment-977568517 ## CI report: * cf2f24cc1dda0fc070310dc25c831838c51f1777 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] hudi-bot removed a comment on pull request #3889: [HUDI-2443] Hudi KVComparator for all HFile writer usages

2021-11-23 Thread GitBox
hudi-bot removed a comment on pull request #3889: URL: https://github.com/apache/hudi/pull/3889#issuecomment-977562233 ## CI report: * 4fb8df69496faf0431e8222b365ca77c7a33e3a0 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[GitHub] [hudi] hudi-bot commented on pull request #3889: [HUDI-2443] Hudi KVComparator for all HFile writer usages

2021-11-23 Thread GitBox
hudi-bot commented on pull request #3889: URL: https://github.com/apache/hudi/pull/3889#issuecomment-977563513 ## CI report: * 4fb8df69496faf0431e8222b365ca77c7a33e3a0 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] hudi-bot removed a comment on pull request #3889: [HUDI-2443] Hudi KVComparator for all HFile writer usages

2021-11-23 Thread GitBox
hudi-bot removed a comment on pull request #3889: URL: https://github.com/apache/hudi/pull/3889#issuecomment-976556941 ## CI report: * 4fb8df69496faf0431e8222b365ca77c7a33e3a0 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[GitHub] [hudi] hudi-bot commented on pull request #3889: [HUDI-2443] Hudi KVComparator for all HFile writer usages

2021-11-23 Thread GitBox
hudi-bot commented on pull request #3889: URL: https://github.com/apache/hudi/pull/3889#issuecomment-977562233 ## CI report: * 4fb8df69496faf0431e8222b365ca77c7a33e3a0 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] hudi-bot commented on pull request #4093: [HUDI-2766] Cluster update strategy should not be fenced by write config

2021-11-23 Thread GitBox
hudi-bot commented on pull request #4093: URL: https://github.com/apache/hudi/pull/4093#issuecomment-977555475 ## CI report: * 77989b656e77ca91ddb19972dad197e229b98a52 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] hudi-bot removed a comment on pull request #4093: [HUDI-2766] Cluster update strategy should not be fenced by write config

2021-11-23 Thread GitBox
hudi-bot removed a comment on pull request #4093: URL: https://github.com/apache/hudi/pull/4093#issuecomment-977554333 ## CI report: * 77989b656e77ca91ddb19972dad197e229b98a52 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[GitHub] [hudi] hudi-bot removed a comment on pull request #4093: [HUDI-2766] Cluster update strategy should not be fenced by write config

2021-11-23 Thread GitBox
hudi-bot removed a comment on pull request #4093: URL: https://github.com/apache/hudi/pull/4093#issuecomment-977526285 ## CI report: * 77989b656e77ca91ddb19972dad197e229b98a52 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[GitHub] [hudi] hudi-bot commented on pull request #4093: [HUDI-2766] Cluster update strategy should not be fenced by write config

2021-11-23 Thread GitBox
hudi-bot commented on pull request #4093: URL: https://github.com/apache/hudi/pull/4093#issuecomment-977554333 ## CI report: * 77989b656e77ca91ddb19972dad197e229b98a52 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[jira] [Closed] (HUDI-2630) Add undocumented features

2021-11-23 Thread Kyle Weller (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kyle Weller closed HUDI-2630. - Resolution: Duplicate this was broken down into other tasks > Add undocumented features > ---

[jira] [Commented] (HUDI-2728) Clean up concepts and consolidate from cwiki

2021-11-23 Thread Kyle Weller (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17448371#comment-17448371 ] Kyle Weller commented on HUDI-2728: --- covered by: https://github.com/apache/hudi/pull/407

[jira] [Updated] (HUDI-2728) Clean up concepts and consolidate from cwiki

2021-11-23 Thread Kyle Weller (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2728?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kyle Weller updated HUDI-2728: -- Status: Patch Available (was: In Progress) > Clean up concepts and consolidate from cwiki > ---

[GitHub] [hudi] kywe665 commented on pull request #4097: [WIP] - [HUDI-2806] - Docs for Transformer Utilities

2021-11-23 Thread GitBox
kywe665 commented on pull request #4097: URL: https://github.com/apache/hudi/pull/4097#issuecomment-977551479 @bhasudha let's collab on code samples for this. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL ab

[jira] [Updated] (HUDI-2806) Add docs for HoodieTransformer utilities

2021-11-23 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2806?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-2806: - Labels: pull-request-available (was: ) > Add docs for HoodieTransformer utilities > -

[GitHub] [hudi] kywe665 opened a new pull request #4097: [WIP] - [HUDI-2806] - Docs for Transformer Utilities

2021-11-23 Thread GitBox
kywe665 opened a new pull request #4097: URL: https://github.com/apache/hudi/pull/4097 ## What is the purpose of the pull request Adding basic documentation for the Hoodie Transformer Utilities ## Brief change log - Added transforms.md ## Verify this pull reques

[GitHub] [hudi] hudi-bot commented on pull request #4092: [HUDI-2845] Metadata CLI - files/partition file listing fix and new validate option

2021-11-23 Thread GitBox
hudi-bot commented on pull request #4092: URL: https://github.com/apache/hudi/pull/4092#issuecomment-977549187 ## CI report: * de6987e729fcaa6329282370984ae448ce3aa46d Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] hudi-bot removed a comment on pull request #4092: [HUDI-2845] Metadata CLI - files/partition file listing fix and new validate option

2021-11-23 Thread GitBox
hudi-bot removed a comment on pull request #4092: URL: https://github.com/apache/hudi/pull/4092#issuecomment-977471123 ## CI report: * 50ca5024cec7e9b2a0eb0139685e7515c3b651bb Azure: [CANCELED](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/r

[GitHub] [hudi] hudi-bot removed a comment on pull request #4096: [HUDI-2847] Flink metadata table supports virtual keys

2021-11-23 Thread GitBox
hudi-bot removed a comment on pull request #4096: URL: https://github.com/apache/hudi/pull/4096#issuecomment-977541684 ## CI report: * 621d58bd83a6cc417c1f7606123c768d9a73e988 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run

[GitHub] [hudi] hudi-bot commented on pull request #4096: [HUDI-2847] Flink metadata table supports virtual keys

2021-11-23 Thread GitBox
hudi-bot commented on pull request #4096: URL: https://github.com/apache/hudi/pull/4096#issuecomment-977542873 ## CI report: * 621d58bd83a6cc417c1f7606123c768d9a73e988 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] hudi-bot removed a comment on pull request #4079: [HUDI-2834] Validate against supported hive versions

2021-11-23 Thread GitBox
hudi-bot removed a comment on pull request #4079: URL: https://github.com/apache/hudi/pull/4079#issuecomment-977465905 ## CI report: * 3d7bf7f2e4e40da8495a5ffe56241b1c64c5e0eb UNKNOWN * c29be65367748049bb1e96b71723386357c52749 Azure: [FAILURE](https://dev.azure.com/apache-hudi

[GitHub] [hudi] hudi-bot commented on pull request #4096: [HUDI-2847] Flink metadata table supports virtual keys

2021-11-23 Thread GitBox
hudi-bot commented on pull request #4096: URL: https://github.com/apache/hudi/pull/4096#issuecomment-977541684 ## CI report: * 621d58bd83a6cc417c1f7606123c768d9a73e988 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` r

[GitHub] [hudi] hudi-bot commented on pull request #4079: [HUDI-2834] Validate against supported hive versions

2021-11-23 Thread GitBox
hudi-bot commented on pull request #4079: URL: https://github.com/apache/hudi/pull/4079#issuecomment-977541630 ## CI report: * 3d7bf7f2e4e40da8495a5ffe56241b1c64c5e0eb UNKNOWN * 23f7351eed382422b0e057a096b8ec39fb61e4c7 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/

[jira] [Updated] (HUDI-2847) Flink metadata table supports virtual keys

2021-11-23 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2847?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-2847: - Labels: pull-request-available (was: ) > Flink metadata table supports virtual keys > ---

[GitHub] [hudi] danny0405 opened a new pull request #4096: [HUDI-2847] Flink metadata table supports virtual keys

2021-11-23 Thread GitBox
danny0405 opened a new pull request #4096: URL: https://github.com/apache/hudi/pull/4096 ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contribute/how-to-contribute before opening a pull request.* ## What is the purpo

[jira] [Created] (HUDI-2847) Flink metadata table supports virtual keys

2021-11-23 Thread Danny Chen (Jira)
Danny Chen created HUDI-2847: Summary: Flink metadata table supports virtual keys Key: HUDI-2847 URL: https://issues.apache.org/jira/browse/HUDI-2847 Project: Apache Hudi Issue Type: Bug

[GitHub] [hudi] kywe665 commented on pull request #4095: [WIP] - [HUDI-2830] Docs for Commit Notifications

2021-11-23 Thread GitBox
kywe665 commented on pull request #4095: URL: https://github.com/apache/hudi/pull/4095#issuecomment-977537021 @bhasudha couple questions we can discuss: - How should we document the class_name - We should add simple code sample - Where do you think we should fit this doc? It fel

[jira] [Updated] (HUDI-2830) Docs for commit notifications

2021-11-23 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2830?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-2830: - Labels: pull-request-available (was: ) > Docs for commit notifications >

[GitHub] [hudi] kywe665 opened a new pull request #4095: [WIP] - [HUDI-2830] Docs for Commit Notifications

2021-11-23 Thread GitBox
kywe665 opened a new pull request #4095: URL: https://github.com/apache/hudi/pull/4095 ## What is the purpose of the pull request Adding docs for commit notification callbacks ## Brief change log - added commit_notifications.md ## Verify this pull request

[GitHub] [hudi] hudi-bot commented on pull request #4091: [HUDI-2844][CLI] Fixing archived Timeline crashing if timeline contains REPLACE_COMMIT

2021-11-23 Thread GitBox
hudi-bot commented on pull request #4091: URL: https://github.com/apache/hudi/pull/4091#issuecomment-977535262 ## CI report: * bb7ddb58721bb6837eb73c90c1a0f44f31c98065 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] hudi-bot removed a comment on pull request #4091: [HUDI-2844][CLI] Fixing archived Timeline crashing if timeline contains REPLACE_COMMIT

2021-11-23 Thread GitBox
hudi-bot removed a comment on pull request #4091: URL: https://github.com/apache/hudi/pull/4091#issuecomment-977453842 ## CI report: * 0260bd53a3ea05d0e8c2024c84c15b100e64fe5d Azure: [CANCELED](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/r

[GitHub] [hudi] hudi-bot commented on pull request #4083: [HUDI-2837] The original hoodie.table.name should be maintained in Spark SQL

2021-11-23 Thread GitBox
hudi-bot commented on pull request #4083: URL: https://github.com/apache/hudi/pull/4083#issuecomment-977532613 ## CI report: * ac5c1d542215d7893b0546e7158d4afb2fd79ea5 Azure: [CANCELED](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?b

[GitHub] [hudi] hudi-bot removed a comment on pull request #4083: [HUDI-2837] The original hoodie.table.name should be maintained in Spark SQL

2021-11-23 Thread GitBox
hudi-bot removed a comment on pull request #4083: URL: https://github.com/apache/hudi/pull/4083#issuecomment-977531643 ## CI report: * ca0337119a336bdb85d57ead739d744c7f94060d Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[GitHub] [hudi] hudi-bot removed a comment on pull request #4083: [HUDI-2837] The original hoodie.table.name should be maintained in Spark SQL

2021-11-23 Thread GitBox
hudi-bot removed a comment on pull request #4083: URL: https://github.com/apache/hudi/pull/4083#issuecomment-977481958 ## CI report: * ca0337119a336bdb85d57ead739d744c7f94060d Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[GitHub] [hudi] hudi-bot commented on pull request #4083: [HUDI-2837] The original hoodie.table.name should be maintained in Spark SQL

2021-11-23 Thread GitBox
hudi-bot commented on pull request #4083: URL: https://github.com/apache/hudi/pull/4083#issuecomment-977531643 ## CI report: * ca0337119a336bdb85d57ead739d744c7f94060d Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] absognety opened a new issue #4094: [SUPPORT] Expose Hudi tables synced with glue catalog in snowflake

2021-11-23 Thread GitBox
absognety opened a new issue #4094: URL: https://github.com/apache/hudi/issues/4094 I would like to know if there are separate instructions to be followed, if I want to expose hudi tables written to S3 and synced with hive metastore and glue catalog in snowflake as external table? I

[GitHub] [hudi] hudi-bot removed a comment on pull request #4093: [HUDI-2766] Cluster update strategy should not be fenced by write config

2021-11-23 Thread GitBox
hudi-bot removed a comment on pull request #4093: URL: https://github.com/apache/hudi/pull/4093#issuecomment-977450232 ## CI report: * 77989b656e77ca91ddb19972dad197e229b98a52 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[GitHub] [hudi] hudi-bot commented on pull request #4093: [HUDI-2766] Cluster update strategy should not be fenced by write config

2021-11-23 Thread GitBox
hudi-bot commented on pull request #4093: URL: https://github.com/apache/hudi/pull/4093#issuecomment-977526285 ## CI report: * 77989b656e77ca91ddb19972dad197e229b98a52 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] kywe665 commented on a change in pull request #4076: [HUDI-2822] Doc for auto file sizing guide

2021-11-23 Thread GitBox
kywe665 commented on a change in pull request #4076: URL: https://github.com/apache/hudi/pull/4076#discussion_r755695067 ## File path: website/docs/file_sizing.md ## @@ -0,0 +1,53 @@ +--- +title: "Auto File Size Management" +toc: true +--- + +This doc will show you how Apache H

[GitHub] [hudi] kywe665 commented on a change in pull request #4075: [HUDI-2629] - Docs to improve Overview page

2021-11-23 Thread GitBox
kywe665 commented on a change in pull request #4075: URL: https://github.com/apache/hudi/pull/4075#discussion_r755693624 ## File path: website/docs/overview.md ## @@ -6,167 +6,63 @@ toc: true last_modified_at: 2019-12-30T15:59:57-04:00 --- -Apache Hudi (pronounced “hoodie”)

[GitHub] [hudi] kywe665 commented on a change in pull request #4075: [HUDI-2629] - Docs to improve Overview page

2021-11-23 Thread GitBox
kywe665 commented on a change in pull request #4075: URL: https://github.com/apache/hudi/pull/4075#discussion_r755693501 ## File path: website/docs/overview.md ## @@ -6,167 +6,63 @@ toc: true last_modified_at: 2019-12-30T15:59:57-04:00 --- -Apache Hudi (pronounced “hoodie”)

[jira] [Updated] (HUDI-2822) Docs for File Sizing

2021-11-23 Thread Kyle Weller (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kyle Weller updated HUDI-2822: -- Status: Patch Available (was: In Progress) > Docs for File Sizing > > >

[jira] [Updated] (HUDI-2846) Docs for file layout concept

2021-11-23 Thread Kyle Weller (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2846?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kyle Weller updated HUDI-2846: -- Status: Patch Available (was: In Progress) > Docs for file layout concept > ---

[jira] [Commented] (HUDI-2846) Docs for file layout concept

2021-11-23 Thread Kyle Weller (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2846?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17448357#comment-17448357 ] Kyle Weller commented on HUDI-2846: --- Patch in: https://github.com/apache/hudi/pull/4075

[jira] [Created] (HUDI-2846) Docs for file layout concept

2021-11-23 Thread Kyle Weller (Jira)
Kyle Weller created HUDI-2846: - Summary: Docs for file layout concept Key: HUDI-2846 URL: https://issues.apache.org/jira/browse/HUDI-2846 Project: Apache Hudi Issue Type: Sub-task Rep

[GitHub] [hudi] hudi-bot commented on pull request #4088: [HUDI-2671][WIP] Making error -> warn logs from timeline server with concurrent writers for inconsistent state

2021-11-23 Thread GitBox
hudi-bot commented on pull request #4088: URL: https://github.com/apache/hudi/pull/4088#issuecomment-977519310 ## CI report: * 1af672ad3ce20459f07161aaf56f1ffc7a8f4e53 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] hudi-bot removed a comment on pull request #4088: [HUDI-2671][WIP] Making error -> warn logs from timeline server with concurrent writers for inconsistent state

2021-11-23 Thread GitBox
hudi-bot removed a comment on pull request #4088: URL: https://github.com/apache/hudi/pull/4088#issuecomment-977428025 ## CI report: * 24fa1fa2e6d4acfbd209863b3de8af522b328a0f Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[jira] [Updated] (HUDI-2819) Revert "[HUDI-2799] Fix the classloader of flink write task (#4042)"

2021-11-23 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-2819: - Summary: Revert "[HUDI-2799] Fix the classloader of flink write task (#4042)" (was: Set up explicit class

[jira] [Commented] (HUDI-2819) Set up explicit classloader for flink hadoop configuration

2021-11-23 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2819?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17448352#comment-17448352 ] Danny Chen commented on HUDI-2819: -- Fixed via master branch: 323be33f185fbb4d3dd51acd5bcc

[GitHub] [hudi] danny0405 merged pull request #4069: Revert "[HUDI-2799] Fix the classloader of flink write task (#4042)"

2021-11-23 Thread GitBox
danny0405 merged pull request #4069: URL: https://github.com/apache/hudi/pull/4069 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubsc

[jira] [Resolved] (HUDI-2819) Set up explicit classloader for flink hadoop configuration

2021-11-23 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen resolved HUDI-2819. -- > Set up explicit classloader for flink hadoop configuration > -

[hudi] branch master updated (0cf2f10 -> 323be33)

2021-11-23 Thread danny0405
This is an automated email from the ASF dual-hosted git repository. danny0405 pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git. from 0cf2f10 [HUDI-2838] refresh table after drop partition (#4084) add 323be33 Revert "[HUDI-2799] Fix the classl

[GitHub] [hudi] xiarixiaoyao commented on a change in pull request #4060: [HUDI-2814][WIP] Addressing issues w/ Z-order Layout Optimization

2021-11-23 Thread GitBox
xiarixiaoyao commented on a change in pull request #4060: URL: https://github.com/apache/hudi/pull/4060#discussion_r755682161 ## File path: hudi-spark-datasource/hudi-spark/src/main/scala/org/apache/spark/sql/hudi/DataSkippingUtils.scala ## @@ -29,148 +30,186 @@ import org.apa

[jira] [Resolved] (HUDI-2838) refresh table manually after drop partition

2021-11-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2838?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu resolved HUDI-2838. -- > refresh table manually after drop partition > --- > >

[jira] [Closed] (HUDI-2838) refresh table manually after drop partition

2021-11-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2838?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu closed HUDI-2838. > refresh table manually after drop partition > --- > >

[GitHub] [hudi] hudi-bot removed a comment on pull request #4083: [HUDI-2837] The original hoodie.table.name should be maintained in Spark SQL

2021-11-23 Thread GitBox
hudi-bot removed a comment on pull request #4083: URL: https://github.com/apache/hudi/pull/4083#issuecomment-977478615 ## CI report: * ca0337119a336bdb85d57ead739d744c7f94060d Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[GitHub] [hudi] hudi-bot commented on pull request #4083: [HUDI-2837] The original hoodie.table.name should be maintained in Spark SQL

2021-11-23 Thread GitBox
hudi-bot commented on pull request #4083: URL: https://github.com/apache/hudi/pull/4083#issuecomment-977481958 ## CI report: * ca0337119a336bdb85d57ead739d744c7f94060d Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] xiarixiaoyao commented on pull request #4026: [HUDI-2788] Fixing issues w/ Z-order Layout Optimization

2021-11-23 Thread GitBox
xiarixiaoyao commented on pull request #4026: URL: https://github.com/apache/hudi/pull/4026#issuecomment-977481712 @alexeykudinkin It is wrong to convert the not condition of dataskipping, could we use original logic. LGTM to other modify -- This is an automated message from the Apac

[GitHub] [hudi] xushiyan merged pull request #4084: [HUDI-2838] refresh table after drop partition

2021-11-23 Thread GitBox
xushiyan merged pull request #4084: URL: https://github.com/apache/hudi/pull/4084 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr

[hudi] branch master updated (5078d29 -> 0cf2f10)

2021-11-23 Thread xushiyan
This is an automated email from the ASF dual-hosted git repository. xushiyan pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git. from 5078d29 [HUDI-2818] Fix 2to3 upgrade when set `hoodie.table.keygenerator.class` (#4077) add 0cf2f10 [HUDI-283

[GitHub] [hudi] hudi-bot removed a comment on pull request #4083: [HUDI-2837] The original hoodie.table.name should be maintained in Spark SQL

2021-11-23 Thread GitBox
hudi-bot removed a comment on pull request #4083: URL: https://github.com/apache/hudi/pull/4083#issuecomment-976389519 ## CI report: * ca0337119a336bdb85d57ead739d744c7f94060d Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[GitHub] [hudi] hudi-bot commented on pull request #4083: [HUDI-2837] The original hoodie.table.name should be maintained in Spark SQL

2021-11-23 Thread GitBox
hudi-bot commented on pull request #4083: URL: https://github.com/apache/hudi/pull/4083#issuecomment-977478615 ## CI report: * ca0337119a336bdb85d57ead739d744c7f94060d Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

  1   2   3   4   >