[jira] [Updated] (HUDI-2833) Clean up unused archive files instead of expanding indefinitely

2022-01-18 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2833?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2833: - Remaining Estimate: 0.5h Original Estimate: 0.5h > Clean up unused archive files instead of expanding

[jira] [Updated] (HUDI-2917) Rollback may be incorrect for canIndexLogFile index

2022-01-18 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2917: - Remaining Estimate: 1h Original Estimate: 1h > Rollback may be incorrect for canIndexLogFile index >

[jira] [Updated] (HUDI-3267) On-call team to triage GH issues, PRs, and JIRAs

2022-01-18 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3267: - Remaining Estimate: 8h (was: 16h) Original Estimate: 8h (was: 16h) > On-call team to triage GH issu

[GitHub] [hudi] hudi-bot commented on pull request #4450: [HUDI-3103] Enable MultiTableDeltaStreamer to update a single target table from multiple source tables.

2022-01-18 Thread GitBox
hudi-bot commented on pull request #4450: URL: https://github.com/apache/hudi/pull/4450#issuecomment-1016165488 ## CI report: * 1bbf4fd0d9d9d5aa5521cce9800c69cf8ca87795 UNKNOWN * 5cfd689bb854805b86e40f7b607d4815f0745a4a Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org

[GitHub] [hudi] hudi-bot removed a comment on pull request #4450: [HUDI-3103] Enable MultiTableDeltaStreamer to update a single target table from multiple source tables.

2022-01-18 Thread GitBox
hudi-bot removed a comment on pull request #4450: URL: https://github.com/apache/hudi/pull/4450#issuecomment-1001849593 ## CI report: * 1bbf4fd0d9d9d5aa5521cce9800c69cf8ca87795 UNKNOWN * 5cfd689bb854805b86e40f7b607d4815f0745a4a Azure: [FAILURE](https://dev.azure.com/apache-hud

[GitHub] [hudi] hudi-bot commented on pull request #3745: [HUDI-2514] Fix delete exception for Spark SQL when sync Hive

2022-01-18 Thread GitBox
hudi-bot commented on pull request #3745: URL: https://github.com/apache/hudi/pull/3745#issuecomment-1016165115 ## CI report: * 071b13bfe3ebee1875d12432e550c8718566bfd4 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?b

[GitHub] [hudi] hudi-bot removed a comment on pull request #3745: [HUDI-2514] Fix delete exception for Spark SQL when sync Hive

2022-01-18 Thread GitBox
hudi-bot removed a comment on pull request #3745: URL: https://github.com/apache/hudi/pull/3745#issuecomment-1016163282 ## CI report: * 071b13bfe3ebee1875d12432e550c8718566bfd4 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/r

[jira] [Updated] (HUDI-2941) Show _hoodie_operation in spark sql results

2022-01-18 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2941?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2941: - Sprint: Cont' improve - 2021/01/10, Cont' improve - 2021/01/24 (was: Cont' improve - 2021/01/10, Cont'

[jira] [Updated] (HUDI-2514) Add default hiveTableSerdeProperties for Spark SQL when sync Hive

2022-01-18 Thread Jira
[ https://issues.apache.org/jira/browse/HUDI-2514?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] 董可伦 updated HUDI-2514: -- Attachment: (was: t3.png) > Add default hiveTableSerdeProperties for Spark SQL when sync Hive >

[jira] [Updated] (HUDI-2514) Add default hiveTableSerdeProperties for Spark SQL when sync Hive

2022-01-18 Thread Jira
[ https://issues.apache.org/jira/browse/HUDI-2514?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] 董可伦 updated HUDI-2514: -- Description: If do not add the default hiveTableSerdeProperties,Spark SQL will not work properly For example,update:

[GitHub] [hudi] hudi-bot removed a comment on pull request #3745: [HUDI-2514] Fix delete exception for Spark SQL when sync Hive

2022-01-18 Thread GitBox
hudi-bot removed a comment on pull request #3745: URL: https://github.com/apache/hudi/pull/3745#issuecomment-1016072357 ## CI report: * 071b13bfe3ebee1875d12432e550c8718566bfd4 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/r

[GitHub] [hudi] hudi-bot commented on pull request #3745: [HUDI-2514] Fix delete exception for Spark SQL when sync Hive

2022-01-18 Thread GitBox
hudi-bot commented on pull request #3745: URL: https://github.com/apache/hudi/pull/3745#issuecomment-1016163282 ## CI report: * 071b13bfe3ebee1875d12432e550c8718566bfd4 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?b

[jira] [Updated] (HUDI-2514) Add default hiveTableSerdeProperties for Spark SQL when sync Hive

2022-01-18 Thread Jira
[ https://issues.apache.org/jira/browse/HUDI-2514?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] 董可伦 updated HUDI-2514: -- Description: If do not add the default hiveTableSerdeProperties,Spark SQL will not work properly For example,update:

[jira] [Updated] (HUDI-3200) File Index config affects partition fields shown in printSchema results

2022-01-18 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3200: - Sprint: Cont' improve - 2021/01/24 (was: Cont' improve - 2021/01/18) > File Index config affects partit

[GitHub] [hudi] yuzhaojing opened a new issue #4636: [SUPPORT] Sync timeline from embedded timeline server in flink pipline

2022-01-18 Thread GitBox
yuzhaojing opened a new issue #4636: URL: https://github.com/apache/hudi/issues/4636 At present, in the Flink-Hudi pipeline, each task will scan the meta directory to obtain the latest timeline, which will cause frequent get listing operations on HDFS and cause a lot of pressure. A prop

[GitHub] [hudi] xiarixiaoyao commented on issue #4609: [SUPPORT] Got exception while using clustering with z-order

2022-01-18 Thread GitBox
xiarixiaoyao commented on issue #4609: URL: https://github.com/apache/hudi/issues/4609#issuecomment-1016141358 @xushiyan @ravs11 since this problem has solved, let's close it. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub

[GitHub] [hudi] ChangbingChen closed issue #4618: [SUPPORT] When querying a hudi table in hive, there have duplicated records.

2022-01-18 Thread GitBox
ChangbingChen closed issue #4618: URL: https://github.com/apache/hudi/issues/4618 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr

[hudi] branch master updated (4bea758 -> 7647562)

2022-01-18 Thread yihua
This is an automated email from the ASF dual-hosted git repository. yihua pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git. from 4bea758 [HUDI-3191] Rebasing Hive's FileInputFormat onto `AbstractHoodieTableFileIndex` (#4531) add 7647562 [HUD

[GitHub] [hudi] yihua merged pull request #4078: [HUDI-2833][Design] Merge small archive files instead of expanding indefinitely.

2022-01-18 Thread GitBox
yihua merged pull request #4078: URL: https://github.com/apache/hudi/pull/4078 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...

[GitHub] [hudi] hudi-bot commented on pull request #4078: [HUDI-2833][Design] Merge small archive files instead of expanding indefinitely.

2022-01-18 Thread GitBox
hudi-bot commented on pull request #4078: URL: https://github.com/apache/hudi/pull/4078#issuecomment-1016130852 ## CI report: * 5ba0b03136e6ecf6199c88363f3a99cd60207b9f Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?b

[GitHub] [hudi] hudi-bot removed a comment on pull request #4078: [HUDI-2833][Design] Merge small archive files instead of expanding indefinitely.

2022-01-18 Thread GitBox
hudi-bot removed a comment on pull request #4078: URL: https://github.com/apache/hudi/pull/4078#issuecomment-1016090319 ## CI report: * 14a63372e1dbe358fef2bd9d8033adc4997d7767 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/r

[GitHub] [hudi] york-yu-ctw commented on issue #4583: [SUPPORT] Got NoSuchElementException while using hudi 0.10.0 and Flink (COW)

2022-01-18 Thread GitBox
york-yu-ctw commented on issue #4583: URL: https://github.com/apache/hudi/issues/4583#issuecomment-1016116684 @xushiyan Not yet, since this error not happen every day, I will keep looking into it -- This is an automated message from the Apache Git Service. To respond to the message, p

[jira] [Assigned] (HUDI-2597) Improve code quality around Generics with Java 8

2022-01-18 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2597?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo reassigned HUDI-2597: --- Assignee: Raymond Xu (was: Ethan Guo) > Improve code quality around Generics with Java 8 > -

[jira] [Assigned] (HUDI-2439) Refactor table.action.commit package (CommitActionExecutors) in hudi-client module

2022-01-18 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2439?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo reassigned HUDI-2439: --- Assignee: Raymond Xu (was: Ethan Guo) > Refactor table.action.commit package (CommitActionExecutors)

[jira] [Assigned] (HUDI-2596) Make class names consistent in hudi-client

2022-01-18 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2596?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo reassigned HUDI-2596: --- Assignee: Raymond Xu (was: Ethan Guo) > Make class names consistent in hudi-client > ---

[jira] [Assigned] (HUDI-2598) Redesign record payload class to decouple HoodieRecordPayload from Avro

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar reassigned HUDI-2598: Assignee: Alexey Kudinkin (was: Ethan Guo) > Redesign record payload class to decouple Hoo

[jira] [Assigned] (HUDI-2656) Generalize HoodieIndex for flexible record data type

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2656?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar reassigned HUDI-2656: Assignee: Raymond Xu (was: Ethan Guo) > Generalize HoodieIndex for flexible record data ty

[jira] [Updated] (HUDI-2596) Make class names consistent in hudi-client

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2596?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-2596: - Sprint: Hudi-Sprint-Jan-18 > Make class names consistent in hudi-client >

[jira] [Updated] (HUDI-2439) Refactor table.action.commit package (CommitActionExecutors) in hudi-client module

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2439?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-2439: - Sprint: Hudi-Sprint-Jan-18 > Refactor table.action.commit package (CommitActionExecutors) in hudi-

[jira] [Updated] (HUDI-2656) Generalize HoodieIndex for flexible record data type

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2656?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-2656: - Sprint: Hudi-Sprint-Jan-18 > Generalize HoodieIndex for flexible record data type > --

[jira] [Updated] (HUDI-2751) To avoid the duplicates for streaming read MOR table

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2751?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-2751: - Sprint: Hudi-Sprint-Jan-18 > To avoid the duplicates for streaming read MOR table > --

[jira] [Updated] (HUDI-2751) To avoid the duplicates for streaming read MOR table

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2751?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-2751: - Story Points: 4 > To avoid the duplicates for streaming read MOR table > -

[jira] [Assigned] (HUDI-2751) To avoid the duplicates for streaming read MOR table

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2751?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar reassigned HUDI-2751: Assignee: Alexey Kudinkin > To avoid the duplicates for streaming read MOR table >

[jira] [Assigned] (HUDI-2749) Improve the streaming read for hudi

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2749?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar reassigned HUDI-2749: Assignee: Alexey Kudinkin > Improve the streaming read for hudi > -

[jira] [Updated] (HUDI-2749) Improve the streaming read for hudi

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2749?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-2749: - Priority: Blocker (was: Major) > Improve the streaming read for hudi > --

[jira] [Updated] (HUDI-3247) Support incremental queries in AbstractHoodieTableFileIndex

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3247?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-3247: - Sprint: Hudi-Sprint-Jan-18 > Support incremental queries in AbstractHoodieTableFileIndex > ---

[jira] [Updated] (HUDI-3276) Make HoodieParquetInputFormat extend MapredParquetInputFormat again

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-3276: - Sprint: Hudi-Sprint-Jan-18 > Make HoodieParquetInputFormat extend MapredParquetInputFormat again >

[jira] [Updated] (HUDI-3239) Convert AbstractHoodieTableFileIndex to Java

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-3239: - Sprint: Hudi-Sprint-Jan-18 > Convert AbstractHoodieTableFileIndex to Java > --

[jira] [Assigned] (HUDI-2987) event time not recorded in commit metadata when insert or bulk insert

2022-01-18 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2987?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan reassigned HUDI-2987: - Assignee: sivabalan narayanan (was: Raymond Xu) > event time not recorded in com

[jira] [Created] (HUDI-3276) Make HoodieParquetInputFormat extend MapredParquetInputFormat again

2022-01-18 Thread Vinoth Chandar (Jira)
Vinoth Chandar created HUDI-3276: Summary: Make HoodieParquetInputFormat extend MapredParquetInputFormat again Key: HUDI-3276 URL: https://issues.apache.org/jira/browse/HUDI-3276 Project: Apache Hudi

[jira] [Updated] (HUDI-3247) Support incremental queries in AbstractHoodieTableFileIndex

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3247?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-3247: - Fix Version/s: 0.11.0 > Support incremental queries in AbstractHoodieTableFileIndex >

[jira] [Updated] (HUDI-3247) Support incremental queries in AbstractHoodieTableFileIndex

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3247?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-3247: - Priority: Blocker (was: Major) > Support incremental queries in AbstractHoodieTableFileIndex > --

[jira] [Updated] (HUDI-1847) Add ability to decouple configs for scheduling inline and running async

2022-01-18 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1847?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-1847: -- Remaining Estimate: 2h (was: 0h) > Add ability to decouple configs for scheduling inlin

[jira] [Updated] (HUDI-3072) AutoCommit misses to detect write conflicts during concurrent transactions

2022-01-18 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-3072: -- Remaining Estimate: 1h (was: 0h) > AutoCommit misses to detect write conflicts during c

[GitHub] [hudi] hudi-bot commented on pull request #4587: [HUDI-3236] use fields'comments persisted in catalog to fill in schema

2022-01-18 Thread GitBox
hudi-bot commented on pull request #4587: URL: https://github.com/apache/hudi/pull/4587#issuecomment-1016095785 ## CI report: * 4ca67729c349de96fe204323b20ce98137c434cb Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?b

[GitHub] [hudi] hudi-bot removed a comment on pull request #4587: [HUDI-3236] use fields'comments persisted in catalog to fill in schema

2022-01-18 Thread GitBox
hudi-bot removed a comment on pull request #4587: URL: https://github.com/apache/hudi/pull/4587#issuecomment-1016063475 ## CI report: * dec3b88d3cd1ea9475ce35b8789a721d940ae3f2 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/r

[jira] [Updated] (HUDI-3166) Implement new HoodieIndex based on metadata indices

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3166?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-3166: - Summary: Implement new HoodieIndex based on metadata indices (was: Implement new HoodieIndex typ

[jira] [Updated] (HUDI-2589) RFC: Metadata based index for bloom filter and column stats

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-2589: - Story Points: 1 (was: 2) > RFC: Metadata based index for bloom filter and column stats >

[jira] [Updated] (HUDI-3143) Support multiple file groups for metadata table index partitions

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3143?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-3143: - Sprint: Hudi-Sprint-Jan-18 > Support multiple file groups for metadata table index partitions > --

[jira] [Updated] (HUDI-3273) Performance: Metadata table log file scanning and base file merging are repeated for each keys lookup request

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-3273: - Sprint: Hudi-Sprint-Jan-18 > Performance: Metadata table log file scanning and base file merging a

[jira] [Updated] (HUDI-3144) Make metadata table getRecordsByKeys() operations more performant by doing range reads

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-3144: - Sprint: Hudi-Sprint-Jan-18 > Make metadata table getRecordsByKeys() operations more performant by

[jira] [Updated] (HUDI-2973) Rewrite/re-publish RFC for Data skipping index

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-2973: - Sprint: Hudi-Sprint-Jan-18 > Rewrite/re-publish RFC for Data skipping index >

[jira] [Updated] (HUDI-2589) RFC: Metadata based index for bloom filter and column stats

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-2589: - Sprint: Hudi-Sprint-Jan-18 > RFC: Metadata based index for bloom filter and column stats > ---

[jira] [Updated] (HUDI-1180) Upgrade HBase to 2.x

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-1180: - Sprint: Hudi-Sprint-Jan-18 > Upgrade HBase to 2.x > > > Key:

[jira] [Updated] (HUDI-1370) Scoping work needed to support bootstrap and RFC-15 together

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-1370: - Sprint: Hudi-Sprint-Jan-18 > Scoping work needed to support bootstrap and RFC-15 together > --

[GitHub] [hudi] hudi-bot commented on pull request #4078: [HUDI-2833][Design] Merge small archive files instead of expanding indefinitely.

2022-01-18 Thread GitBox
hudi-bot commented on pull request #4078: URL: https://github.com/apache/hudi/pull/4078#issuecomment-1016090319 ## CI report: * 14a63372e1dbe358fef2bd9d8033adc4997d7767 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?b

[GitHub] [hudi] hudi-bot removed a comment on pull request #4078: [HUDI-2833][Design] Merge small archive files instead of expanding indefinitely.

2022-01-18 Thread GitBox
hudi-bot removed a comment on pull request #4078: URL: https://github.com/apache/hudi/pull/4078#issuecomment-1016085460 ## CI report: * 5ba0b03136e6ecf6199c88363f3a99cd60207b9f UNKNOWN * 14a63372e1dbe358fef2bd9d8033adc4997d7767 Azure: [SUCCESS](https://dev.azure.com/apache-hud

[jira] [Assigned] (HUDI-2708) Support boostrapping of metadata table even when async table service is in progress

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2708?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar reassigned HUDI-2708: Assignee: Sagar Sumit (was: Vinoth Chandar) > Support boostrapping of metadata table even

[jira] [Updated] (HUDI-2708) Support boostrapping of metadata table even when async table service is in progress

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2708?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-2708: - Priority: Blocker (was: Critical) > Support boostrapping of metadata table even when async table

[jira] [Assigned] (HUDI-2458) Relax compaction in metadata being fenced based on inflight requests in data table

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar reassigned HUDI-2458: Assignee: Ethan Guo (was: sivabalan narayanan) > Relax compaction in metadata being fenced

[jira] [Updated] (HUDI-2458) Relax compaction in metadata being fenced based on inflight requests in data table

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-2458: - Priority: Blocker (was: Major) > Relax compaction in metadata being fenced based on inflight requ

[jira] [Closed] (HUDI-2005) Audit and remove references of fs.listStatus() and fs.getFileStatus() or fs.exists()

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2005?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar closed HUDI-2005. Fix Version/s: 0.10.0 (was: 0.11.0) Resolution: Fixed > Audit and remo

[jira] [Updated] (HUDI-1492) Enhance DeltaWriteStat with block level metadata correctly for storage schemes that support appends

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1492?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-1492: - Priority: Blocker (was: Critical) > Enhance DeltaWriteStat with block level metadata correctly fo

[jira] [Updated] (HUDI-1492) Enhance DeltaWriteStat with block level metadata correctly for storage schemes that support appends

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1492?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-1492: - Epic Link: HUDI-1822 (was: HUDI-1292) > Enhance DeltaWriteStat with block level metadata correctl

[jira] [Assigned] (HUDI-1370) Scoping work needed to support bootstrap and RFC-15 together

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar reassigned HUDI-1370: Assignee: Ethan Guo (was: Manoj Govindassamy) > Scoping work needed to support bootstrap a

[jira] [Closed] (HUDI-2472) Tests failure follow up when metadata is enabled by default

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar closed HUDI-2472. Resolution: Fixed > Tests failure follow up when metadata is enabled by default > --

[jira] [Assigned] (HUDI-1180) Upgrade HBase to 2.x

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar reassigned HUDI-1180: Assignee: Ethan Guo (was: Vinoth Chandar) > Upgrade HBase to 2.x > >

[jira] [Assigned] (HUDI-3208) Come up with rollout plan for enabling metadata table by default in 0.11

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3208?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar reassigned HUDI-3208: Assignee: Ethan Guo (was: Manoj Govindassamy) > Come up with rollout plan for enabling met

[jira] [Updated] (HUDI-1822) [Umbrella] Multi Modal Indexing

2022-01-18 Thread Manoj Govindassamy (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Govindassamy updated HUDI-1822: - Priority: Blocker (was: Major) > [Umbrella] Multi Modal Indexing > --

[jira] [Updated] (HUDI-1292) [Umbrella] RFC-15 : Metadata Table for File Listing and other table metadata

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1292?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-1292: - Summary: [Umbrella] RFC-15 : Metadata Table for File Listing and other table metadata (was: [Umbr

[GitHub] [hudi] hudi-bot removed a comment on pull request #4078: [HUDI-2833][Design] Merge small archive files instead of expanding indefinitely.

2022-01-18 Thread GitBox
hudi-bot removed a comment on pull request #4078: URL: https://github.com/apache/hudi/pull/4078#issuecomment-1016083856 ## CI report: * 5ba0b03136e6ecf6199c88363f3a99cd60207b9f UNKNOWN * Unknown: [CANCELED](TBD) * 14a63372e1dbe358fef2bd9d8033adc4997d7767 UNKNOWN

[GitHub] [hudi] hudi-bot commented on pull request #4078: [HUDI-2833][Design] Merge small archive files instead of expanding indefinitely.

2022-01-18 Thread GitBox
hudi-bot commented on pull request #4078: URL: https://github.com/apache/hudi/pull/4078#issuecomment-1016085460 ## CI report: * 5ba0b03136e6ecf6199c88363f3a99cd60207b9f UNKNOWN * 14a63372e1dbe358fef2bd9d8033adc4997d7767 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org

[jira] [Updated] (HUDI-1292) [Umbrella] RFC-15 : File Listing and Query Planning Optimizations

2022-01-18 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1292?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-1292: Priority: Blocker (was: Critical) > [Umbrella] RFC-15 : File Listing and Query Planning Optimizations > --

[jira] [Updated] (HUDI-3274) Support Time travel

2022-01-18 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3274: - Fix Version/s: 0.11.0 > Support Time travel > --- > > Key: HUDI-3274 >

[jira] [Assigned] (HUDI-3221) Support querying a table as of a savepoint

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3221?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar reassigned HUDI-3221: Assignee: Raymond Xu > Support querying a table as of a savepoint > ---

[GitHub] [hudi] hudi-bot commented on pull request #4083: [HUDI-2837] Add support for using database name in incremental query

2022-01-18 Thread GitBox
hudi-bot commented on pull request #4083: URL: https://github.com/apache/hudi/pull/4083#issuecomment-1016083898 ## CI report: * 00221c82e8b1693280fd72625eafcd503d54323c UNKNOWN * 46053bb143d1fd1274ac466197cc9361708e738b UNKNOWN * 2722bfcfd29a95f27338c1c8b026185472eefba0 UNKN

[GitHub] [hudi] hudi-bot removed a comment on pull request #4083: [HUDI-2837] Add support for using database name in incremental query

2022-01-18 Thread GitBox
hudi-bot removed a comment on pull request #4083: URL: https://github.com/apache/hudi/pull/4083#issuecomment-1016025436 ## CI report: * 00221c82e8b1693280fd72625eafcd503d54323c UNKNOWN * 46053bb143d1fd1274ac466197cc9361708e738b UNKNOWN * 2722bfcfd29a95f27338c1c8b026185472eef

[GitHub] [hudi] hudi-bot commented on pull request #4078: [HUDI-2833][Design] Merge small archive files instead of expanding indefinitely.

2022-01-18 Thread GitBox
hudi-bot commented on pull request #4078: URL: https://github.com/apache/hudi/pull/4078#issuecomment-1016083856 ## CI report: * 5ba0b03136e6ecf6199c88363f3a99cd60207b9f UNKNOWN * Unknown: [CANCELED](TBD) * 14a63372e1dbe358fef2bd9d8033adc4997d7767 UNKNOWN Bot

[GitHub] [hudi] hudi-bot removed a comment on pull request #4078: [HUDI-2833][Design] Merge small archive files instead of expanding indefinitely.

2022-01-18 Thread GitBox
hudi-bot removed a comment on pull request #4078: URL: https://github.com/apache/hudi/pull/4078#issuecomment-1016071152 ## CI report: * 5ba0b03136e6ecf6199c88363f3a99cd60207b9f UNKNOWN * Unknown: [CANCELED](TBD) Bot commands @hudi-bot supports the followin

[jira] [Updated] (HUDI-3221) Support querying a table as of a savepoint

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3221?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-3221: - Epic Link: HUDI-3274 (was: HUDI-3220) > Support querying a table as of a savepoint >

[jira] [Deleted] (HUDI-3220) [UMBRELLA] Hudi Query Improvements

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3220?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar deleted HUDI-3220: - > [UMBRELLA] Hudi Query Improvements > -- > > Key: HU

[jira] [Assigned] (HUDI-3220) [UMBRELLA] Hudi Query Improvements

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3220?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar reassigned HUDI-3220: Assignee: Vinoth Chandar > [UMBRELLA] Hudi Query Improvements > ---

[jira] [Updated] (HUDI-2235) [UMBRELLA] Keys support in Hudi

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2235?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-2235: - Fix Version/s: 0.12.0 (was: 0.11.0) > [UMBRELLA] Keys support in Hudi > ---

[GitHub] [hudi] VIKASPATID opened a new issue #4635: [SUPPORT] Bulk writing failing due to hudi timeline archive exception

2022-01-18 Thread GitBox
VIKASPATID opened a new issue #4635: URL: https://github.com/apache/hudi/issues/4635 Seeing repetitive error when bulk writing to cow table, error message not very clear. Please note that we were able to write bunch of files to hudi successfully and started getting this error. **Con

[hudi] branch asf-site updated (187159c -> 67f0096)

2022-01-18 Thread github-bot
This is an automated email from the ASF dual-hosted git repository. github-bot pushed a change to branch asf-site in repository https://gitbox.apache.org/repos/asf/hudi.git. from 187159c [HUDI-3151] Docs for Spark SQL type support 0.10.0+ (#4634) add 67f0096 GitHub Actions build asf-s

[hudi] branch asf-site updated (cf61e5a -> 187159c)

2022-01-18 Thread xushiyan
This is an automated email from the ASF dual-hosted git repository. xushiyan pushed a change to branch asf-site in repository https://gitbox.apache.org/repos/asf/hudi.git. from cf61e5a GitHub Actions build asf-site add 187159c [HUDI-3151] Docs for Spark SQL type support 0.10.0+ (#4634

[jira] [Assigned] (HUDI-1859) [UMBRELLA] RFC - 14 : JDBC incremental puller

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar reassigned HUDI-1859: Assignee: Harshal Patil (was: Sagar Sumit) > [UMBRELLA] RFC - 14 : JDBC incremental puller

[jira] [Updated] (HUDI-2757) Support AWS Glue API for metastore sync

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-2757: - Reviewers: Vinoth Chandar (was: Rajesh Mahindra) > Support AWS Glue API for metastore sync >

[jira] [Updated] (HUDI-2687) [UMBRELLA] A new Trino connector for Hudi

2022-01-18 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2687?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-2687: Priority: Blocker (was: Major) > [UMBRELLA] A new Trino connector for Hudi > --

[jira] [Updated] (HUDI-2687) [UMBRELLA] A new Trino connector for Hudi

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2687?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-2687: - Fix Version/s: 0.11.0 > [UMBRELLA] A new Trino connector for Hudi > --

[jira] [Commented] (HUDI-2968) Support Delete/Update using non-pk fields

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17478332#comment-17478332 ] Vinoth Chandar commented on HUDI-2968: -- [~biyan900...@gmail.com] merge works already

[jira] [Updated] (HUDI-2531) [UMBRELLA] Support Dataset APIs in writer paths

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-2531: - Fix Version/s: 0.12.0 (was: 0.11.0) > [UMBRELLA] Support Dataset APIs in wr

[jira] [Resolved] (HUDI-3151) Docs for Spark SQL type support

2022-01-18 Thread Forward Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3151?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Forward Xu resolved HUDI-3151. -- > Docs for Spark SQL type support > --- > > Key: HUDI-3151 >

[jira] [Updated] (HUDI-2531) [UMBRELLA] Support Dataset APIs in writer paths

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-2531: - Fix Version/s: 0.11.0 > [UMBRELLA] Support Dataset APIs in writer paths >

[jira] [Updated] (HUDI-1297) [Umbrella] Spark Datasource Support

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-1297: - Fix Version/s: 0.11.0 > [Umbrella] Spark Datasource Support > ---

[jira] [Updated] (HUDI-1658) [UMBRELLA] Spark Sql Support For Hudi

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1658?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-1658: - Fix Version/s: 0.11.0 > [UMBRELLA] Spark Sql Support For Hudi > --

[jira] [Updated] (HUDI-3081) Revisiting Read Path Infra across Query Engines

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3081?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-3081: - Fix Version/s: 0.11.0 > Revisiting Read Path Infra across Query Engines >

[GitHub] [hudi] hudi-bot commented on pull request #3745: [HUDI-2514] Add default hiveTableSerdeProperties for Spark SQL when sync Hive

2022-01-18 Thread GitBox
hudi-bot commented on pull request #3745: URL: https://github.com/apache/hudi/pull/3745#issuecomment-1016072357 ## CI report: * 071b13bfe3ebee1875d12432e550c8718566bfd4 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?b

[GitHub] [hudi] hudi-bot removed a comment on pull request #3745: [HUDI-2514] Add default hiveTableSerdeProperties for Spark SQL when sync Hive

2022-01-18 Thread GitBox
hudi-bot removed a comment on pull request #3745: URL: https://github.com/apache/hudi/pull/3745#issuecomment-1016024041 ## CI report: * 08a1b1dcd8d9def2dd0367fab3c39980a14ac310 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/r

[jira] [Updated] (HUDI-1822) [Umbrella] Multi Modal Indexing

2022-01-18 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-1822: - Fix Version/s: 0.11.0 > [Umbrella] Multi Modal Indexing > --- > >

  1   2   3   4   5   6   7   >