[GitHub] [hudi] hudi-bot removed a comment on pull request #4421: [HUDI-3096] fixed the bug that the cow table(contains decimalType) wriite by flink cannot be read by spark.

2021-12-22 Thread GitBox
hudi-bot removed a comment on pull request #4421: URL: https://github.com/apache/hudi/pull/4421#issuecomment-999357891 ## CI report: * 5f28ef6f4a255f6962b0900e5e1dc00e2a98f1b5 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[GitHub] [hudi] hudi-bot commented on pull request #4421: [HUDI-3096] fixed the bug that the cow table(contains decimalType) wriite by flink cannot be read by spark.

2021-12-22 Thread GitBox
hudi-bot commented on pull request #4421: URL: https://github.com/apache/hudi/pull/4421#issuecomment-999363938 ## CI report: * 5f28ef6f4a255f6962b0900e5e1dc00e2a98f1b5 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] hudi-bot commented on pull request #4333: [HUDI-431] Adding support for Parquet in MOR `LogBlock`s

2021-12-22 Thread GitBox
hudi-bot commented on pull request #4333: URL: https://github.com/apache/hudi/pull/4333#issuecomment-999365421 ## CI report: * 286aa8b95627eaaa01114567797186263a830774 UNKNOWN * e722499ee75403ab62f646fdabca1a2c59570164 UNKNOWN * de0d4385394dc5d820964cefc872f099cee7a02b UNKNO

[GitHub] [hudi] hudi-bot removed a comment on pull request #4333: [HUDI-431] Adding support for Parquet in MOR `LogBlock`s

2021-12-22 Thread GitBox
hudi-bot removed a comment on pull request #4333: URL: https://github.com/apache/hudi/pull/4333#issuecomment-999335468 ## CI report: * 286aa8b95627eaaa01114567797186263a830774 UNKNOWN * e722499ee75403ab62f646fdabca1a2c59570164 UNKNOWN * de0d4385394dc5d820964cefc872f099cee7a0

[GitHub] [hudi] hudi-bot commented on pull request #4421: [HUDI-3096] fixed the bug that the cow table(contains decimalType) wriite by flink cannot be read by spark.

2021-12-22 Thread GitBox
hudi-bot commented on pull request #4421: URL: https://github.com/apache/hudi/pull/4421#issuecomment-999365546 ## CI report: * 5f28ef6f4a255f6962b0900e5e1dc00e2a98f1b5 Azure: [CANCELED](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?b

[GitHub] [hudi] hudi-bot removed a comment on pull request #4421: [HUDI-3096] fixed the bug that the cow table(contains decimalType) wriite by flink cannot be read by spark.

2021-12-22 Thread GitBox
hudi-bot removed a comment on pull request #4421: URL: https://github.com/apache/hudi/pull/4421#issuecomment-999363938 ## CI report: * 5f28ef6f4a255f6962b0900e5e1dc00e2a98f1b5 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[GitHub] [hudi] manojpec commented on a change in pull request #3977: [HUDI-2716] InLineFS support for S3FS logs

2021-12-22 Thread GitBox
manojpec commented on a change in pull request #3977: URL: https://github.com/apache/hudi/pull/3977#discussion_r773675837 ## File path: hudi-common/src/main/java/org/apache/hudi/common/fs/inline/InLineFSUtils.java ## @@ -29,46 +32,58 @@ public class InLineFSUtils { private

[GitHub] [hudi] danny0405 commented on a change in pull request #4421: [HUDI-3096] fixed the bug that the cow table(contains decimalType) wriite by flink cannot be read by spark.

2021-12-22 Thread GitBox
danny0405 commented on a change in pull request #4421: URL: https://github.com/apache/hudi/pull/4421#discussion_r773676079 ## File path: hudi-flink/src/main/java/org/apache/hudi/util/AvroSchemaConverter.java ## @@ -244,10 +244,13 @@ public static Schema convertToSchema(Logical

[GitHub] [hudi] danny0405 commented on a change in pull request #4420: [HUDI-1847] [UNDER_DISCUSSION] Adding async scheduling support for spark datasource path for compaction and clustering

2021-12-22 Thread GitBox
danny0405 commented on a change in pull request #4420: URL: https://github.com/apache/hudi/pull/4420#discussion_r773684448 ## File path: hudi-client/hudi-client-common/src/main/java/org/apache/hudi/client/AbstractHoodieWriteClient.java ## @@ -989,12 +989,14 @@ protected void r

[GitHub] [hudi] prashantwason commented on a change in pull request #4336: [HUDI-3032] Do not clean the log files right after compaction for met…

2021-12-22 Thread GitBox
prashantwason commented on a change in pull request #4336: URL: https://github.com/apache/hudi/pull/4336#discussion_r773685014 ## File path: hudi-client/hudi-client-common/src/main/java/org/apache/hudi/metadata/HoodieBackedTableMetadataWriter.java ## @@ -706,7 +706,20 @@ prote

[GitHub] [hudi] danny0405 commented on a change in pull request #4420: [HUDI-1847] [UNDER_DISCUSSION] Adding async scheduling support for spark datasource path for compaction and clustering

2021-12-22 Thread GitBox
danny0405 commented on a change in pull request #4420: URL: https://github.com/apache/hudi/pull/4420#discussion_r773685897 ## File path: hudi-client/hudi-client-common/src/main/java/org/apache/hudi/client/AbstractHoodieWriteClient.java ## @@ -460,24 +460,24 @@ protected void p

[GitHub] [hudi] yanenze commented on issue #4419: [SUPPORT] Not An Avro File

2021-12-22 Thread GitBox
yanenze commented on issue #4419: URL: https://github.com/apache/hudi/issues/4419#issuecomment-999375807 > Thanks, can you make sure the file handle or file name ? ![image](https://user-images.githubusercontent.com/34880077/147059624-c3427709-f7db-4f01-bfaa-7ffbd26d59a5.png) only

[GitHub] [hudi] danny0405 commented on a change in pull request #4420: [HUDI-1847] [UNDER_DISCUSSION] Adding async scheduling support for spark datasource path for compaction and clustering

2021-12-22 Thread GitBox
danny0405 commented on a change in pull request #4420: URL: https://github.com/apache/hudi/pull/4420#discussion_r773687455 ## File path: hudi-client/hudi-client-common/src/main/java/org/apache/hudi/config/HoodieClusteringConfig.java ## @@ -143,6 +143,12 @@ .withDocument

[GitHub] [hudi] danny0405 commented on a change in pull request #4420: [HUDI-1847] [UNDER_DISCUSSION] Adding async scheduling support for spark datasource path for compaction and clustering

2021-12-22 Thread GitBox
danny0405 commented on a change in pull request #4420: URL: https://github.com/apache/hudi/pull/4420#discussion_r773688261 ## File path: hudi-client/hudi-client-common/src/main/java/org/apache/hudi/config/HoodieCompactionConfig.java ## @@ -90,6 +90,12 @@ .withDocumentat

[GitHub] [hudi] yanenze edited a comment on issue #4419: [SUPPORT] Not An Avro File

2021-12-22 Thread GitBox
yanenze edited a comment on issue #4419: URL: https://github.com/apache/hudi/issues/4419#issuecomment-999375807 > Thanks, can you make sure the file handle or file name ? ![image](https://user-images.githubusercontent.com/34880077/14706-a4643daf-2b78-4a7c-9573-d322714ed68a.png)

[GitHub] [hudi] yanenze edited a comment on issue #4419: [SUPPORT] Not An Avro File

2021-12-22 Thread GitBox
yanenze edited a comment on issue #4419: URL: https://github.com/apache/hudi/issues/4419#issuecomment-999375807 > Thanks, can you make sure the file handle or file name ? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and us

[GitHub] [hudi] yanenze removed a comment on issue #4419: [SUPPORT] Not An Avro File

2021-12-22 Thread GitBox
yanenze removed a comment on issue #4419: URL: https://github.com/apache/hudi/issues/4419#issuecomment-999375807 > Thanks, can you make sure the file handle or file name ? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and u

[jira] [Assigned] (HUDI-3066) Very slow file listing after enabling metadata for existing tables in 0.10.0 release

2021-12-22 Thread Manoj Govindassamy (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Govindassamy reassigned HUDI-3066: Assignee: sivabalan narayanan (was: Manoj Govindassamy) > Very slow file listing a

[jira] [Commented] (HUDI-3066) Very slow file listing after enabling metadata for existing tables in 0.10.0 release

2021-12-22 Thread Manoj Govindassamy (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3066?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17463639#comment-17463639 ] Manoj Govindassamy commented on HUDI-3066: -- Assigning the ticket to [~shivnarayan

[GitHub] [hudi] xiarixiaoyao commented on a change in pull request #4421: [HUDI-3096] fixed the bug that the cow table(contains decimalType) wriite by flink cannot be read by spark.

2021-12-22 Thread GitBox
xiarixiaoyao commented on a change in pull request #4421: URL: https://github.com/apache/hudi/pull/4421#discussion_r773694729 ## File path: hudi-flink/src/main/java/org/apache/hudi/util/AvroSchemaConverter.java ## @@ -244,10 +244,13 @@ public static Schema convertToSchema(Logi

[GitHub] [hudi] danny0405 commented on a change in pull request #4336: [HUDI-3032] Do not clean the log files right after compaction for met…

2021-12-22 Thread GitBox
danny0405 commented on a change in pull request #4336: URL: https://github.com/apache/hudi/pull/4336#discussion_r773696840 ## File path: hudi-client/hudi-client-common/src/main/java/org/apache/hudi/metadata/HoodieBackedTableMetadataWriter.java ## @@ -706,7 +706,20 @@ protected

[jira] [Created] (HUDI-3097) Address dependency issue with hudi-trino-bundle in connector

2021-12-22 Thread Ethan Guo (Jira)
Ethan Guo created HUDI-3097: --- Summary: Address dependency issue with hudi-trino-bundle in connector Key: HUDI-3097 URL: https://issues.apache.org/jira/browse/HUDI-3097 Project: Apache Hudi Issue T

[jira] [Updated] (HUDI-3097) Address dependency issue with hudi-trino-bundle in connector

2021-12-22 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-3097: Fix Version/s: 0.10.1 > Address dependency issue with hudi-trino-bundle in connector > -

[jira] [Updated] (HUDI-3097) Address dependency issue with hudi-trino-bundle in connector

2021-12-22 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-3097: Priority: Blocker (was: Major) > Address dependency issue with hudi-trino-bundle in connector > ---

[jira] [Updated] (HUDI-3097) Address dependency issue with hudi-trino-bundle in connector

2021-12-22 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-3097: Fix Version/s: 0.11.0 > Address dependency issue with hudi-trino-bundle in connector > -

[jira] [Assigned] (HUDI-3097) Address dependency issue with hudi-trino-bundle in connector

2021-12-22 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo reassigned HUDI-3097: --- Assignee: Ethan Guo > Address dependency issue with hudi-trino-bundle in connector >

[jira] [Updated] (HUDI-3097) Address dependency issue with hudi-trino-bundle in connector

2021-12-22 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-3097: Status: In Progress (was: Open) > Address dependency issue with hudi-trino-bundle in connector > --

[GitHub] [hudi] yanenze commented on issue #4419: [SUPPORT] Not An Avro File

2021-12-22 Thread GitBox
yanenze commented on issue #4419: URL: https://github.com/apache/hudi/issues/4419#issuecomment-999386836 > Not an Avro data file 2021-12-22 16:32:03 org.apache.flink.util.FlinkException: Global failure triggered by OperatorCoordinator for 'hoodie_stream_write' (operator 3753fee1b

[GitHub] [hudi] danny0405 commented on a change in pull request #4421: [HUDI-3096] fixed the bug that the cow table(contains decimalType) wriite by flink cannot be read by spark.

2021-12-22 Thread GitBox
danny0405 commented on a change in pull request #4421: URL: https://github.com/apache/hudi/pull/4421#discussion_r773698637 ## File path: hudi-flink/src/main/java/org/apache/hudi/util/AvroSchemaConverter.java ## @@ -244,10 +244,13 @@ public static Schema convertToSchema(Logical

[jira] [Updated] (HUDI-3097) Address dependency issue with hudi-trino-bundle in connector

2021-12-22 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-3097: Story Points: 1 > Address dependency issue with hudi-trino-bundle in connector > ---

[jira] [Updated] (HUDI-2518) Implement stats/range tracking as a part of Metadata table

2021-12-22 Thread Manoj Govindassamy (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2518?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Govindassamy updated HUDI-2518: - Priority: Blocker (was: Critical) > Implement stats/range tracking as a part of Metadata

[GitHub] [hudi] danny0405 commented on issue #4419: [SUPPORT] Not An Avro File

2021-12-22 Thread GitBox
danny0405 commented on issue #4419: URL: https://github.com/apache/hudi/issues/4419#issuecomment-999389224 These file seems normal, is the `xxx.deltacommit` file a normal json meta file ? -- This is an automated message from the Apache Git Service. To respond to the message, please log o

[jira] [Updated] (HUDI-2714) Benchmark MetaIndex performance w/ bloom and column stat metadata

2021-12-22 Thread Manoj Govindassamy (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2714?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Govindassamy updated HUDI-2714: - Priority: Blocker (was: Critical) > Benchmark MetaIndex performance w/ bloom and column s

[jira] [Closed] (HUDI-2587) Impl metadata table based bloom index

2021-12-22 Thread Manoj Govindassamy (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2587?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Govindassamy closed HUDI-2587. Resolution: Duplicate Closing this as a DUP of HUDI-1295 > Impl metadata table based bloom

[jira] [Updated] (HUDI-2518) Implement stats/range tracking as a part of Metadata table

2021-12-22 Thread Manoj Govindassamy (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2518?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Govindassamy updated HUDI-2518: - Status: In Progress (was: Open) > Implement stats/range tracking as a part of Metadata ta

[jira] [Updated] (HUDI-2714) Benchmark MetaIndex performance w/ bloom and column stat metadata

2021-12-22 Thread Manoj Govindassamy (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2714?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Govindassamy updated HUDI-2714: - Status: In Progress (was: Open) > Benchmark MetaIndex performance w/ bloom and column sta

[jira] [Closed] (HUDI-2588) Test metadata based bloom index

2021-12-22 Thread Manoj Govindassamy (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Govindassamy closed HUDI-2588. Resolution: Duplicate Closing this as DUP of HUDI-2584 > Test metadata based bloom index >

[jira] [Updated] (HUDI-2584) Test Bloom filter based out of metadata table.

2021-12-22 Thread Manoj Govindassamy (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2584?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Govindassamy updated HUDI-2584: - Priority: Blocker (was: Critical) > Test Bloom filter based out of metadata table. > ---

[GitHub] [hudi] hudi-bot commented on pull request #4404: [HUDI-2558] Fixing Clustering w/ sort columns with null values fails

2021-12-22 Thread GitBox
hudi-bot commented on pull request #4404: URL: https://github.com/apache/hudi/pull/4404#issuecomment-999390839 ## CI report: * fbca0322a0dc67f938d3489536fd660c70a4b506 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[jira] [Updated] (HUDI-2584) Test Bloom filter based out of metadata table.

2021-12-22 Thread Manoj Govindassamy (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2584?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Govindassamy updated HUDI-2584: - Status: In Progress (was: Open) > Test Bloom filter based out of metadata table. > -

[GitHub] [hudi] hudi-bot removed a comment on pull request #4404: [HUDI-2558] Fixing Clustering w/ sort columns with null values fails

2021-12-22 Thread GitBox
hudi-bot removed a comment on pull request #4404: URL: https://github.com/apache/hudi/pull/4404#issuecomment-997874729 ## CI report: * fbca0322a0dc67f938d3489536fd660c70a4b506 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[jira] [Updated] (HUDI-2584) Unit tests for bloom filter index based out of metadata table.

2021-12-22 Thread Manoj Govindassamy (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2584?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Govindassamy updated HUDI-2584: - Summary: Unit tests for bloom filter index based out of metadata table. (was: Test Bloom

[GitHub] [hudi] xiarixiaoyao commented on a change in pull request #4421: [HUDI-3096] fixed the bug that the cow table(contains decimalType) wriite by flink cannot be read by spark.

2021-12-22 Thread GitBox
xiarixiaoyao commented on a change in pull request #4421: URL: https://github.com/apache/hudi/pull/4421#discussion_r773703134 ## File path: hudi-flink/src/main/java/org/apache/hudi/util/AvroSchemaConverter.java ## @@ -244,10 +244,13 @@ public static Schema convertToSchema(Logi

[GitHub] [hudi] hudi-bot commented on pull request #4404: [HUDI-2558] Fixing Clustering w/ sort columns with null values fails

2021-12-22 Thread GitBox
hudi-bot commented on pull request #4404: URL: https://github.com/apache/hudi/pull/4404#issuecomment-999392623 ## CI report: * fbca0322a0dc67f938d3489536fd660c70a4b506 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] hudi-bot removed a comment on pull request #4404: [HUDI-2558] Fixing Clustering w/ sort columns with null values fails

2021-12-22 Thread GitBox
hudi-bot removed a comment on pull request #4404: URL: https://github.com/apache/hudi/pull/4404#issuecomment-999390839 ## CI report: * fbca0322a0dc67f938d3489536fd660c70a4b506 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[GitHub] [hudi] yanenze commented on issue #4419: [SUPPORT] Not An Avro File

2021-12-22 Thread GitBox
yanenze commented on issue #4419: URL: https://github.com/apache/hudi/issues/4419#issuecomment-999394463 -- 原始邮件 -- 发件人: "apache/hudi"

[GitHub] [hudi] hudi-bot commented on pull request #4420: [HUDI-1847] [UNDER_DISCUSSION] Adding async scheduling support for spark datasource path for compaction and clustering

2021-12-22 Thread GitBox
hudi-bot commented on pull request #4420: URL: https://github.com/apache/hudi/pull/4420#issuecomment-999394613 ## CI report: * 066bfb55adff0afd0e0b0e86c3c42d88f842e37c Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] xiarixiaoyao merged pull request #4308: [HUDI-3008] Fixing HoodieFileIndex partition column parsing for nested fields

2021-12-22 Thread GitBox
xiarixiaoyao merged pull request #4308: URL: https://github.com/apache/hudi/pull/4308 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsu

[GitHub] [hudi] hudi-bot removed a comment on pull request #4420: [HUDI-1847] [UNDER_DISCUSSION] Adding async scheduling support for spark datasource path for compaction and clustering

2021-12-22 Thread GitBox
hudi-bot removed a comment on pull request #4420: URL: https://github.com/apache/hudi/pull/4420#issuecomment-999351864 ## CI report: * 066bfb55adff0afd0e0b0e86c3c42d88f842e37c Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[GitHub] [hudi] yanenze commented on issue #4419: [SUPPORT] Not An Avro File

2021-12-22 Thread GitBox
yanenze commented on issue #4419: URL: https://github.com/apache/hudi/issues/4419#issuecomment-999394854 > These file seems normal, is the `xxx.deltacommit` file a normal json meta file ? yes, it is normal,i have sent the file to your mailbox -- This is an automated message from t

[hudi] branch master updated: [HUDI-3008] Fixing HoodieFileIndex partition column parsing for nested fields

2021-12-22 Thread mengtao
This is an automated email from the ASF dual-hosted git repository. mengtao pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/master by this push: new 7d046f9 [HUDI-3008] Fixing HoodieFileIndex parti

[GitHub] [hudi] danny0405 commented on issue #4419: [SUPPORT] Not An Avro File

2021-12-22 Thread GitBox
danny0405 commented on issue #4419: URL: https://github.com/apache/hudi/issues/4419#issuecomment-999396419 Does the table has concurrent reader there ? If the file is a complete json file, it seems an issue caused by concurrent modification. -- This is an automated message from the Apach

[GitHub] [hudi] nsivabalan commented on issue #2936: [SUPPORT] OverwriteNonDefaultsWithLatestAvroPayload not work in mor table

2021-12-22 Thread GitBox
nsivabalan commented on issue #2936: URL: https://github.com/apache/hudi/issues/2936#issuecomment-999397069 @shenbinglife : I could not reproduce this. Here is what I tried. Inserted a bunch of records to a new hudi MOR table. Generated updates and marked one of the column as null (col

[GitHub] [hudi] yanenze commented on issue #4419: [SUPPORT] Not An Avro File

2021-12-22 Thread GitBox
yanenze commented on issue #4419: URL: https://github.com/apache/hudi/issues/4419#issuecomment-999397802 > These file seems normal, is the `xxx.deltacommit` file a normal json meta file ? now my flink mission like this it can also checkpoint sussessful, but the exception occured whe

[GitHub] [hudi] hudi-bot commented on pull request #4404: [HUDI-2558] Fixing Clustering w/ sort columns with null values fails

2021-12-22 Thread GitBox
hudi-bot commented on pull request #4404: URL: https://github.com/apache/hudi/pull/4404#issuecomment-999398343 ## CI report: * fbca0322a0dc67f938d3489536fd660c70a4b506 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] hudi-bot removed a comment on pull request #4421: [HUDI-3096] fixed the bug that the cow table(contains decimalType) wriite by flink cannot be read by spark.

2021-12-22 Thread GitBox
hudi-bot removed a comment on pull request #4421: URL: https://github.com/apache/hudi/pull/4421#issuecomment-999365546 ## CI report: * 5f28ef6f4a255f6962b0900e5e1dc00e2a98f1b5 Azure: [CANCELED](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/r

[GitHub] [hudi] hudi-bot commented on pull request #4421: [HUDI-3096] fixed the bug that the cow table(contains decimalType) wriite by flink cannot be read by spark.

2021-12-22 Thread GitBox
hudi-bot commented on pull request #4421: URL: https://github.com/apache/hudi/pull/4421#issuecomment-999398407 ## CI report: * 5f28ef6f4a255f6962b0900e5e1dc00e2a98f1b5 Azure: [CANCELED](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?b

[GitHub] [hudi] hudi-bot removed a comment on pull request #4404: [HUDI-2558] Fixing Clustering w/ sort columns with null values fails

2021-12-22 Thread GitBox
hudi-bot removed a comment on pull request #4404: URL: https://github.com/apache/hudi/pull/4404#issuecomment-999392623 ## CI report: * fbca0322a0dc67f938d3489536fd660c70a4b506 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[GitHub] [hudi] harsh1231 commented on a change in pull request #4404: [HUDI-2558] Fixing Clustering w/ sort columns with null values fails

2021-12-22 Thread GitBox
harsh1231 commented on a change in pull request #4404: URL: https://github.com/apache/hudi/pull/4404#discussion_r773711405 ## File path: hudi-client/hudi-spark-client/src/main/scala/org/apache/spark/sql/hudi/HoodieComparator.scala ## @@ -0,0 +1,30 @@ +/* + * Licensed to the Ap

[GitHub] [hudi] hudi-bot commented on pull request #4404: [HUDI-2558] Fixing Clustering w/ sort columns with null values fails

2021-12-22 Thread GitBox
hudi-bot commented on pull request #4404: URL: https://github.com/apache/hudi/pull/4404#issuecomment-999400297 ## CI report: * cb1cc9010c0c9d572bc5db55ee67b51df735ee59 Azure: [CANCELED](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?b

[GitHub] [hudi] hudi-bot removed a comment on pull request #4404: [HUDI-2558] Fixing Clustering w/ sort columns with null values fails

2021-12-22 Thread GitBox
hudi-bot removed a comment on pull request #4404: URL: https://github.com/apache/hudi/pull/4404#issuecomment-999398343 ## CI report: * fbca0322a0dc67f938d3489536fd660c70a4b506 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[GitHub] [hudi] yanenze commented on issue #4419: [SUPPORT] Not An Avro File

2021-12-22 Thread GitBox
yanenze commented on issue #4419: URL: https://github.com/apache/hudi/issues/4419#issuecomment-999400428 > Does the table has concurrent reader there ? If the file is a complete json file, it seems an issue caused by concurrent modification. there is only one mission to write and rea

[GitHub] [hudi] nsivabalan commented on issue #3429: [SUPPORT] Upserting timestamp with microseconds precision truncate the microseconds part

2021-12-22 Thread GitBox
nsivabalan commented on issue #3429: URL: https://github.com/apache/hudi/issues/3429#issuecomment-999413753 got it. I could able to repro. @cdmikechen : Do you know if there is a way we can get around this. -- This is an automated message from the Apache Git Service. To respond to th

[GitHub] [hudi] nsivabalan opened a new pull request #4422: [MINOR] Fixing dynamoDbLockConfig required prop check

2021-12-22 Thread GitBox
nsivabalan opened a new pull request #4422: URL: https://github.com/apache/hudi/pull/4422 ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contribute/how-to-contribute before opening a pull request.* ## What is the purp

[GitHub] [hudi] nsivabalan commented on issue #4414: [SUPPORT] Required params in DynamoDBBasedLockProvider

2021-12-22 Thread GitBox
nsivabalan commented on issue #4414: URL: https://github.com/apache/hudi/issues/4414#issuecomment-999418710 https://github.com/apache/hudi/pull/4422 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[GitHub] [hudi] hudi-bot commented on pull request #4422: [MINOR] Fixing dynamoDbLockConfig required prop check

2021-12-22 Thread GitBox
hudi-bot commented on pull request #4422: URL: https://github.com/apache/hudi/pull/4422#issuecomment-999420410 ## CI report: * 65ea08c06490604b1a05bf22a5de5f34767ee01c UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` r

[jira] [Created] (HUDI-3098) Billion mode in dynamo db based lock configs has a default value, but still checks for mandatory setting

2021-12-22 Thread sivabalan narayanan (Jira)
sivabalan narayanan created HUDI-3098: - Summary: Billion mode in dynamo db based lock configs has a default value, but still checks for mandatory setting Key: HUDI-3098 URL: https://issues.apache.org/jira/brow

[GitHub] [hudi] nsivabalan commented on issue #4414: [SUPPORT] Required params in DynamoDBBasedLockProvider

2021-12-22 Thread GitBox
nsivabalan commented on issue #4414: URL: https://github.com/apache/hudi/issues/4414#issuecomment-999421567 Filed a tracking jira too. https://issues.apache.org/jira/browse/HUDI-3098 -- This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [hudi] nsivabalan closed issue #4414: [SUPPORT] Required params in DynamoDBBasedLockProvider

2021-12-22 Thread GitBox
nsivabalan closed issue #4414: URL: https://github.com/apache/hudi/issues/4414 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...

[jira] [Assigned] (HUDI-3098) Billion mode in dynamo db based lock configs has a default value, but still checks for mandatory setting

2021-12-22 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan reassigned HUDI-3098: - Assignee: sivabalan narayanan > Billion mode in dynamo db based lock configs has

[GitHub] [hudi] hudi-bot removed a comment on pull request #4422: [HUDI-3098] Fixing dynamoDbLockConfig required prop check

2021-12-22 Thread GitBox
hudi-bot removed a comment on pull request #4422: URL: https://github.com/apache/hudi/pull/4422#issuecomment-999420410 ## CI report: * 65ea08c06490604b1a05bf22a5de5f34767ee01c UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run

[GitHub] [hudi] hudi-bot commented on pull request #4422: [HUDI-3098] Fixing dynamoDbLockConfig required prop check

2021-12-22 Thread GitBox
hudi-bot commented on pull request #4422: URL: https://github.com/apache/hudi/pull/4422#issuecomment-999422510 ## CI report: * 65ea08c06490604b1a05bf22a5de5f34767ee01c Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] zhangyue19921010 commented on a change in pull request #4078: [HUDI-2833] Clean up unused archive files instead of expanding indefinitely.

2021-12-22 Thread GitBox
zhangyue19921010 commented on a change in pull request #4078: URL: https://github.com/apache/hudi/pull/4078#discussion_r773735747 ## File path: hudi-client/hudi-client-common/src/main/java/org/apache/hudi/config/HoodieCompactionConfig.java ## @@ -249,6 +249,21 @@ +

[jira] [Updated] (HUDI-3098) Billion mode in dynamo db based lock configs has a default value, but still checks for mandatory setting

2021-12-22 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-3098: - Labels: pull-request-available (was: ) > Billion mode in dynamo db based lock configs has a defau

[GitHub] [hudi] hudi-bot commented on pull request #4404: [HUDI-2558] Fixing Clustering w/ sort columns with null values fails

2021-12-22 Thread GitBox
hudi-bot commented on pull request #4404: URL: https://github.com/apache/hudi/pull/4404#issuecomment-999434326 ## CI report: * cb1cc9010c0c9d572bc5db55ee67b51df735ee59 Azure: [CANCELED](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?b

[GitHub] [hudi] hudi-bot removed a comment on pull request #4404: [HUDI-2558] Fixing Clustering w/ sort columns with null values fails

2021-12-22 Thread GitBox
hudi-bot removed a comment on pull request #4404: URL: https://github.com/apache/hudi/pull/4404#issuecomment-999400297 ## CI report: * cb1cc9010c0c9d572bc5db55ee67b51df735ee59 Azure: [CANCELED](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/r

[GitHub] [hudi] hudi-bot removed a comment on pull request #4421: [HUDI-3096] fixed the bug that the cow table(contains decimalType) wriite by flink cannot be read by spark.

2021-12-22 Thread GitBox
hudi-bot removed a comment on pull request #4421: URL: https://github.com/apache/hudi/pull/4421#issuecomment-999398407 ## CI report: * 5f28ef6f4a255f6962b0900e5e1dc00e2a98f1b5 Azure: [CANCELED](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/r

[GitHub] [hudi] hudi-bot commented on pull request #4421: [HUDI-3096] fixed the bug that the cow table(contains decimalType) wriite by flink cannot be read by spark.

2021-12-22 Thread GitBox
hudi-bot commented on pull request #4421: URL: https://github.com/apache/hudi/pull/4421#issuecomment-999438005 ## CI report: * 3a87fd4953ed883b8c7041f50d57423d04955bf4 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] guanziyue removed a comment on pull request #4264: [HUDI-2875] Make HoodieParquetWriter Thread safe and memory executor …

2021-12-22 Thread GitBox
guanziyue removed a comment on pull request #4264: URL: https://github.com/apache/hudi/pull/4264#issuecomment-997980848 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the sp

[GitHub] [hudi] hudi-bot removed a comment on pull request #4264: [HUDI-2875] Make HoodieParquetWriter Thread safe and memory executor …

2021-12-22 Thread GitBox
hudi-bot removed a comment on pull request #4264: URL: https://github.com/apache/hudi/pull/4264#issuecomment-998061031 ## CI report: * 74beb1e38e63aff2a08e71243da1202b5473d7a9 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[GitHub] [hudi] hudi-bot commented on pull request #4264: [HUDI-2875] Make HoodieParquetWriter Thread safe and memory executor …

2021-12-22 Thread GitBox
hudi-bot commented on pull request #4264: URL: https://github.com/apache/hudi/pull/4264#issuecomment-999449513 ## CI report: * 74beb1e38e63aff2a08e71243da1202b5473d7a9 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] hudi-bot commented on pull request #4264: [HUDI-2875] Make HoodieParquetWriter Thread safe and memory executor …

2021-12-22 Thread GitBox
hudi-bot commented on pull request #4264: URL: https://github.com/apache/hudi/pull/4264#issuecomment-999451578 ## CI report: * 74beb1e38e63aff2a08e71243da1202b5473d7a9 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] hudi-bot removed a comment on pull request #4264: [HUDI-2875] Make HoodieParquetWriter Thread safe and memory executor …

2021-12-22 Thread GitBox
hudi-bot removed a comment on pull request #4264: URL: https://github.com/apache/hudi/pull/4264#issuecomment-999449513 ## CI report: * 74beb1e38e63aff2a08e71243da1202b5473d7a9 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[GitHub] [hudi] codope merged pull request #4334: [HUDI-3011] Adding ability to read entire data with HoodieIncrSource with empty checkpoint

2021-12-22 Thread GitBox
codope merged pull request #4334: URL: https://github.com/apache/hudi/pull/4334 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr..

[hudi] branch master updated (b5890cd -> 1a5f869)

2021-12-22 Thread codope
This is an automated email from the ASF dual-hosted git repository. codope pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git. from b5890cd Merge pull request #4308 from harsh1231/HUDI-3008 add 1a5f869 [HUDI-3011] Adding ability to read entire

[GitHub] [hudi] hudi-bot commented on pull request #4404: [HUDI-2558] Fixing Clustering w/ sort columns with null values fails

2021-12-22 Thread GitBox
hudi-bot commented on pull request #4404: URL: https://github.com/apache/hudi/pull/4404#issuecomment-999474624 ## CI report: * 71505f07ad3215a36e8a617b98c2e372abd8d2a5 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] hudi-bot removed a comment on pull request #4404: [HUDI-2558] Fixing Clustering w/ sort columns with null values fails

2021-12-22 Thread GitBox
hudi-bot removed a comment on pull request #4404: URL: https://github.com/apache/hudi/pull/4404#issuecomment-999434326 ## CI report: * cb1cc9010c0c9d572bc5db55ee67b51df735ee59 Azure: [CANCELED](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/r

[GitHub] [hudi] hudi-bot removed a comment on pull request #4422: [HUDI-3098] Fixing dynamoDbLockConfig required prop check

2021-12-22 Thread GitBox
hudi-bot removed a comment on pull request #4422: URL: https://github.com/apache/hudi/pull/4422#issuecomment-999422510 ## CI report: * 65ea08c06490604b1a05bf22a5de5f34767ee01c Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[GitHub] [hudi] hudi-bot commented on pull request #4422: [HUDI-3098] Fixing dynamoDbLockConfig required prop check

2021-12-22 Thread GitBox
hudi-bot commented on pull request #4422: URL: https://github.com/apache/hudi/pull/4422#issuecomment-999476448 ## CI report: * 65ea08c06490604b1a05bf22a5de5f34767ee01c Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] leesf merged pull request #4364: [HUDI-3060] drop table for spark sql

2021-12-22 Thread GitBox
leesf merged pull request #4364: URL: https://github.com/apache/hudi/pull/4364 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...

[hudi] branch master updated (1a5f869 -> 5d93edc)

2021-12-22 Thread leesf
This is an automated email from the ASF dual-hosted git repository. leesf pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git. from 1a5f869 [HUDI-3011] Adding ability to read entire data with HoodieIncrSource with empty checkpoint (#4334) add 5d

[GitHub] [hudi] hudi-bot removed a comment on pull request #4264: [HUDI-2875] Make HoodieParquetWriter Thread safe and memory executor …

2021-12-22 Thread GitBox
hudi-bot removed a comment on pull request #4264: URL: https://github.com/apache/hudi/pull/4264#issuecomment-999451578 ## CI report: * 74beb1e38e63aff2a08e71243da1202b5473d7a9 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[GitHub] [hudi] hudi-bot commented on pull request #4264: [HUDI-2875] Make HoodieParquetWriter Thread safe and memory executor …

2021-12-22 Thread GitBox
hudi-bot commented on pull request #4264: URL: https://github.com/apache/hudi/pull/4264#issuecomment-999496794 ## CI report: * b5c58235a5582e2edcbdaebc00e9774de08686a5 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] leesf commented on pull request #4418: [HUDI-3087] Fix DedupeSparkJob typo

2021-12-22 Thread GitBox
leesf commented on pull request #4418: URL: https://github.com/apache/hudi/pull/4418#issuecomment-999497213 @Aimiyoo hi, you would just use `[MINOR] description ` for the typo fix instead of creating a jira ticket. -- This is an automated message from the Apache Git Service. To respond t

[GitHub] [hudi] hudi-bot commented on pull request #4350: [HUDI-3047] Basic Implementation of Spark Datasource V2

2021-12-22 Thread GitBox
hudi-bot commented on pull request #4350: URL: https://github.com/apache/hudi/pull/4350#issuecomment-999513228 ## CI report: * 70b4892db2c44f4a03e5747a28b28e6dc761f959 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] hudi-bot removed a comment on pull request #4350: [HUDI-3047] Basic Implementation of Spark Datasource V2

2021-12-22 Thread GitBox
hudi-bot removed a comment on pull request #4350: URL: https://github.com/apache/hudi/pull/4350#issuecomment-998736509 ## CI report: * 70b4892db2c44f4a03e5747a28b28e6dc761f959 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[GitHub] [hudi] hudi-bot commented on pull request #4350: [HUDI-3047] Basic Implementation of Spark Datasource V2

2021-12-22 Thread GitBox
hudi-bot commented on pull request #4350: URL: https://github.com/apache/hudi/pull/4350#issuecomment-999514948 ## CI report: * 70b4892db2c44f4a03e5747a28b28e6dc761f959 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] hudi-bot removed a comment on pull request #4350: [HUDI-3047] Basic Implementation of Spark Datasource V2

2021-12-22 Thread GitBox
hudi-bot removed a comment on pull request #4350: URL: https://github.com/apache/hudi/pull/4350#issuecomment-999513228 ## CI report: * 70b4892db2c44f4a03e5747a28b28e6dc761f959 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

[GitHub] [hudi] hudi-bot commented on pull request #4350: [HUDI-3047] Basic Implementation of Spark Datasource V2

2021-12-22 Thread GitBox
hudi-bot commented on pull request #4350: URL: https://github.com/apache/hudi/pull/4350#issuecomment-999518712 ## CI report: * 70b4892db2c44f4a03e5747a28b28e6dc761f959 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?bu

[GitHub] [hudi] hudi-bot removed a comment on pull request #4350: [HUDI-3047] Basic Implementation of Spark Datasource V2

2021-12-22 Thread GitBox
hudi-bot removed a comment on pull request #4350: URL: https://github.com/apache/hudi/pull/4350#issuecomment-999514948 ## CI report: * 70b4892db2c44f4a03e5747a28b28e6dc761f959 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/re

  1   2   3   >