[jira] [Updated] (HUDI-2646) Unify configurations for clustering execution strategy and layout optimization

2022-01-18 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2646?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-2646: -- Reviewers: sivabalan narayanan (was: Y Ethan Guo) > Unify configurations for clustering executi

[jira] [Updated] (HUDI-2872) Enable data skipping index even for sort based clustering

2022-01-18 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2872?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-2872: -- Reviewers: sivabalan narayanan (was: Y Ethan Guo) > Enable data skipping index even for sort ba

[jira] [Resolved] (HUDI-3179) Extract common Hudi Table File Index implementation

2022-01-18 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin resolved HUDI-3179. --- > Extract common Hudi Table File Index implementation > -

[jira] [Updated] (HUDI-3239) Convert AbstractHoodieTableFileIndex to Java

2022-01-18 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3239: -- Priority: Blocker (was: Major) > Convert AbstractHoodieTableFileIndex to Java > ---

[jira] [Updated] (HUDI-3239) Convert AbstractHoodieTableFileIndex to Java

2022-01-18 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3239: -- Fix Version/s: 0.11.0 > Convert AbstractHoodieTableFileIndex to Java > -

[jira] [Assigned] (HUDI-3239) Convert AbstractHoodieTableFileIndex to Java

2022-01-18 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin reassigned HUDI-3239: - Assignee: Alexey Kudinkin > Convert AbstractHoodieTableFileIndex to Java > --

[jira] [Created] (HUDI-3279) Metadata table stores incorrect file sizes after Restore

2022-01-19 Thread Alexey Kudinkin (Jira)
Alexey Kudinkin created HUDI-3279: - Summary: Metadata table stores incorrect file sizes after Restore Key: HUDI-3279 URL: https://issues.apache.org/jira/browse/HUDI-3279 Project: Apache Hudi

[jira] [Updated] (HUDI-3279) Metadata table stores incorrect file sizes after Restore

2022-01-19 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3279?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3279: -- Attachment: Screen Shot 2022-01-19 at 12.18.27 PM.png Description: While working on [https:

[jira] [Updated] (HUDI-3279) Metadata table stores incorrect file sizes after Restore

2022-01-19 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3279?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3279: -- Fix Version/s: 0.11.0 > Metadata table stores incorrect file sizes after Restore > -

[jira] [Updated] (HUDI-3279) Metadata table stores incorrect file sizes after Restore

2022-01-19 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3279?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3279: -- Description: While working on [https://github.com/apache/hudi/pull/4556,] I have stumbled upon

[jira] [Assigned] (HUDI-3279) Metadata table stores incorrect file sizes after Restore

2022-01-19 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3279?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin reassigned HUDI-3279: - Assignee: Alexey Kudinkin > Metadata table stores incorrect file sizes after Restore > --

[jira] [Updated] (HUDI-3279) Metadata table stores incorrect file sizes after Restore

2022-01-19 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3279?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3279: -- Description: While working on [https://github.com/apache/hudi/pull/4556,] I have stumbled upon

[jira] [Updated] (HUDI-3279) Metadata table stores incorrect file sizes after Restore

2022-01-19 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3279?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3279: -- Description: While working on [https://github.com/apache/hudi/pull/4556,] I have stumbled upon

[jira] [Closed] (HUDI-3191) Rebase Hive's FileInputFormat onto AbstractHoodieTableFileIndex

2022-01-19 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3191?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin closed HUDI-3191. - Resolution: Fixed > Rebase Hive's FileInputFormat onto AbstractHoodieTableFileIndex >

[jira] [Closed] (HUDI-3179) Extract common Hudi Table File Index implementation

2022-01-19 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin closed HUDI-3179. - Resolution: Fixed > Extract common Hudi Table File Index implementation > ---

[jira] [Closed] (HUDI-3094) Unify Hive `FileInputFormat` implementations

2022-01-19 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin closed HUDI-3094. - Resolution: Fixed > Unify Hive `FileInputFormat` implementations > ---

[jira] [Created] (HUDI-3280) Clean up unused/deprecated methods

2022-01-19 Thread Alexey Kudinkin (Jira)
Alexey Kudinkin created HUDI-3280: - Summary: Clean up unused/deprecated methods Key: HUDI-3280 URL: https://issues.apache.org/jira/browse/HUDI-3280 Project: Apache Hudi Issue Type: Task

[jira] [Updated] (HUDI-3280) Clean up unused/deprecated methods

2022-01-19 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3280?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3280: -- Priority: Blocker (was: Major) > Clean up unused/deprecated methods > -

[jira] [Updated] (HUDI-3280) Clean up unused/deprecated methods

2022-01-19 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3280?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3280: -- Fix Version/s: 0.11.0 > Clean up unused/deprecated methods > --

[jira] [Assigned] (HUDI-3280) Clean up unused/deprecated methods

2022-01-19 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3280?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin reassigned HUDI-3280: - Assignee: Alexey Kudinkin > Clean up unused/deprecated methods >

[jira] [Updated] (HUDI-3279) Metadata table stores incorrect file sizes after Restore

2022-01-19 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3279?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3279: -- Description: While working on [https://github.com/apache/hudi/pull/4556,] I have stumbled upon

[jira] [Commented] (HUDI-3279) Metadata table stores incorrect file sizes after Restore

2022-01-19 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3279?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17479086#comment-17479086 ] Alexey Kudinkin commented on HUDI-3279: --- This is an example of test failing in CI:

[jira] [Updated] (HUDI-3279) Metadata table stores incorrect file sizes after Restore

2022-01-19 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3279?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3279: -- Attachment: Screen Shot 2022-01-19 at 7.56.37 PM.png > Metadata table stores incorrect file size

[jira] [Created] (HUDI-3302) Re-evaluate handling of LogBlock appends when Compaction is pending

2022-01-21 Thread Alexey Kudinkin (Jira)
Alexey Kudinkin created HUDI-3302: - Summary: Re-evaluate handling of LogBlock appends when Compaction is pending Key: HUDI-3302 URL: https://issues.apache.org/jira/browse/HUDI-3302 Project: Apache Hud

[jira] [Closed] (HUDI-2816) Unify file listing method of Spark/Flink/Hive

2022-01-21 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2816?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin closed HUDI-2816. - Resolution: Duplicate > Unify file listing method of Spark/Flink/Hive > --

[jira] [Updated] (HUDI-3276) Make HoodieParquetInputFormat extend MapredParquetInputFormat again

2022-01-21 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3276: -- Reviewers: Vinoth Chandar, Y Ethan Guo > Make HoodieParquetInputFormat extend MapredParquetInput

[jira] [Updated] (HUDI-3276) Make HoodieParquetInputFormat extend MapredParquetInputFormat again

2022-01-21 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3276: -- Status: Patch Available (was: In Progress) > Make HoodieParquetInputFormat extend MapredParquet

[jira] [Updated] (HUDI-3279) Metadata table stores incorrect file sizes after Restore

2022-01-24 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3279?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3279: -- Sprint: Hudi-Sprint-Jan-18 > Metadata table stores incorrect file sizes after Restore >

[jira] [Updated] (HUDI-2751) To avoid the duplicates for streaming read MOR table

2022-01-24 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2751?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-2751: -- Status: In Progress (was: Open) > To avoid the duplicates for streaming read MOR table > --

[jira] [Updated] (HUDI-2928) Evaluate rebasing Hudi's default compression from Gzip to Zstd

2022-01-24 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-2928: -- Sprint: Hudi-Sprint-Jan-10 (was: Hudi-Sprint-Jan-10, Hudi-Sprint-Jan-18) > Evaluate rebasing Hu

[jira] [Updated] (HUDI-3247) Support incremental queries in AbstractHoodieTableFileIndex

2022-01-24 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3247?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3247: -- Sprint: (was: Hudi-Sprint-Jan-18) > Support incremental queries in AbstractHoodieTableFileInde

[jira] [Updated] (HUDI-3218) Upgrade Avro to 1.10.2

2022-01-24 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3218: -- Sprint: Hudi-Sprint-Jan-10 (was: Hudi-Sprint-Jan-10, Hudi-Sprint-Jan-18) > Upgrade Avro to 1.10

[jira] [Updated] (HUDI-3239) Convert AbstractHoodieTableFileIndex to Java

2022-01-24 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3239: -- Story Points: 0 (was: 3) > Convert AbstractHoodieTableFileIndex to Java > -

[jira] [Updated] (HUDI-3276) Make HoodieParquetInputFormat extend MapredParquetInputFormat again

2022-01-24 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3276: -- Story Points: 0 (was: 2) > Make HoodieParquetInputFormat extend MapredParquetInputFormat again

[jira] [Updated] (HUDI-3280) Clean up unused/deprecated methods

2022-01-24 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3280?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3280: -- Sprint: Hudi-Sprint-Jan-25 > Clean up unused/deprecated methods > --

[jira] [Assigned] (HUDI-3318) Write RFC regarding proposed changes to the RecordPayload hierarchy

2022-01-24 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin reassigned HUDI-3318: - Assignee: Alexey Kudinkin > Write RFC regarding proposed changes to the RecordPayload hie

[jira] [Created] (HUDI-3318) Write RFC regarding proposed changes to the RecordPayload hierarchy

2022-01-24 Thread Alexey Kudinkin (Jira)
Alexey Kudinkin created HUDI-3318: - Summary: Write RFC regarding proposed changes to the RecordPayload hierarchy Key: HUDI-3318 URL: https://issues.apache.org/jira/browse/HUDI-3318 Project: Apache Hud

[jira] [Updated] (HUDI-3318) Write RFC regarding proposed changes to the RecordPayload hierarchy

2022-01-24 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3318: -- Fix Version/s: 0.11.0 > Write RFC regarding proposed changes to the RecordPayload hierarchy > --

[jira] [Updated] (HUDI-3318) Write RFC regarding proposed changes to the RecordPayload hierarchy

2022-01-24 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3318: -- Sprint: Hudi-Sprint-Jan-25 > Write RFC regarding proposed changes to the RecordPayload hierarchy

[jira] [Updated] (HUDI-3279) Metadata table stores incorrect file sizes after Restore

2022-01-24 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3279?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3279: -- Story Points: 2 > Metadata table stores incorrect file sizes after Restore > ---

[jira] [Created] (HUDI-3322) Rollback of Delta Commits performed incorrectly for MOR tables

2022-01-25 Thread Alexey Kudinkin (Jira)
Alexey Kudinkin created HUDI-3322: - Summary: Rollback of Delta Commits performed incorrectly for MOR tables Key: HUDI-3322 URL: https://issues.apache.org/jira/browse/HUDI-3322 Project: Apache Hudi

[jira] [Updated] (HUDI-3322) Rollback of Delta Commits performed incorrectly for MOR tables

2022-01-25 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3322: -- Description: Diving deeper into the issue of HUDI-3279, i've realized that the root-cause of th

[jira] [Updated] (HUDI-3322) Rollback of Delta Commits performed incorrectly for MOR tables

2022-01-25 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3322: -- Priority: Blocker (was: Major) > Rollback of Delta Commits performed incorrectly for MOR tables

[jira] [Updated] (HUDI-3322) Rollback of Delta Commits performed incorrectly for MOR tables

2022-01-25 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3322: -- Fix Version/s: 0.11.0 > Rollback of Delta Commits performed incorrectly for MOR tables > ---

[jira] [Updated] (HUDI-3279) Metadata table stores incorrect file sizes after Restore

2022-01-25 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3279?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3279: -- Status: Open (was: In Progress) > Metadata table stores incorrect file sizes after Restore > --

[jira] [Commented] (HUDI-3279) Metadata table stores incorrect file sizes after Restore

2022-01-25 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3279?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17482123#comment-17482123 ] Alexey Kudinkin commented on HUDI-3279: --- Given the findings of HUDI-3322, this actua

[jira] [Updated] (HUDI-3322) Rollback Plan for Delta Commits constructed incorrectly for MOR tables

2022-01-25 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3322: -- Summary: Rollback Plan for Delta Commits constructed incorrectly for MOR tables (was: Rollback

[jira] [Updated] (HUDI-3322) Rollback Plan for Delta Commits constructed incorrectly

2022-01-25 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3322: -- Description: Diving deeper into the issue of HUDI-3279, i've realized that the root-cause of th

[jira] [Updated] (HUDI-3322) Rollback Plan for Delta Commits constructed incorrectly

2022-01-25 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3322: -- Summary: Rollback Plan for Delta Commits constructed incorrectly (was: Rollback Plan for Delta

[jira] [Updated] (HUDI-3217) RFC-XX: Revisit Record Payload handling

2022-01-25 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3217: -- Labels: hudi-umbrellas (was: ) > RFC-XX: Revisit Record Payload handling >

[jira] [Updated] (HUDI-3217) RFC-XX: Revisit Record Payload handling

2022-01-25 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3217: -- Summary: RFC-XX: Revisit Record Payload handling (was: Revisit Record Payload handling) > RFC-

[jira] [Updated] (HUDI-3217) RFC-46: Revisit Record Payload handling

2022-01-25 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3217: -- Summary: RFC-46: Revisit Record Payload handling (was: RFC-XX: Revisit Record Payload handling)

[jira] [Updated] (HUDI-3322) Rollback Plan for Delta Commits constructed incorrectly

2022-01-25 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3322: -- Description: Diving deeper into the issue of HUDI-3279, i've realized that the root-cause of th

[jira] [Updated] (HUDI-3322) Rollback Plan for Delta Commits constructed incorrectly

2022-01-25 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3322: -- Description: Diving deeper into the issue of HUDI-3279, i've realized that the root-cause of th

[jira] [Updated] (HUDI-3217) RFC-46: Optimize Record Payload handling

2022-01-25 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3217: -- Summary: RFC-46: Optimize Record Payload handling (was: RFC-46: Revisit Record Payload handling

[jira] [Updated] (HUDI-2751) To avoid the duplicates for streaming read MOR table

2022-01-26 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2751?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-2751: -- Status: Open (was: In Progress) > To avoid the duplicates for streaming read MOR table > --

[jira] [Updated] (HUDI-3318) Write RFC regarding proposed changes to the RecordPayload hierarchy

2022-01-26 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3318: -- Status: In Progress (was: Open) > Write RFC regarding proposed changes to the RecordPayload hie

[jira] [Assigned] (HUDI-3322) Rollback Plan for Delta Commits constructed incorrectly

2022-01-26 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin reassigned HUDI-3322: - Assignee: Alexey Kudinkin > Rollback Plan for Delta Commits constructed incorrectly > ---

[jira] [Updated] (HUDI-3322) Rollback Plan for Delta Commits constructed incorrectly

2022-01-27 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3322: -- Sprint: Hudi-Sprint-Jan-24 > Rollback Plan for Delta Commits constructed incorrectly > -

[jira] [Updated] (HUDI-3322) Rollback Plan for Delta Commits constructed incorrectly

2022-01-27 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3322: -- Story Points: 2 > Rollback Plan for Delta Commits constructed incorrectly >

[jira] [Updated] (HUDI-3322) Rollback Plan for Delta Commits constructed incorrectly

2022-01-27 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3322: -- Epic Link: HUDI-3081 > Rollback Plan for Delta Commits constructed incorrectly > ---

[jira] [Created] (HUDI-3337) ParquetUtils fails extracting Parquet Column Range Metadata

2022-01-27 Thread Alexey Kudinkin (Jira)
Alexey Kudinkin created HUDI-3337: - Summary: ParquetUtils fails extracting Parquet Column Range Metadata Key: HUDI-3337 URL: https://issues.apache.org/jira/browse/HUDI-3337 Project: Apache Hudi

[jira] [Updated] (HUDI-3337) ParquetUtils fails extracting Parquet Column Range Metadata

2022-01-27 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3337: -- Sprint: Hudi-Sprint-Jan-24 > ParquetUtils fails extracting Parquet Column Range Metadata > -

[jira] [Assigned] (HUDI-3337) ParquetUtils fails extracting Parquet Column Range Metadata

2022-01-27 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin reassigned HUDI-3337: - Assignee: Alexey Kudinkin > ParquetUtils fails extracting Parquet Column Range Metadata >

[jira] [Updated] (HUDI-3337) ParquetUtils fails extracting Parquet Column Range Metadata

2022-01-27 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3337: -- Fix Version/s: 0.11.0 > ParquetUtils fails extracting Parquet Column Range Metadata > --

[jira] [Updated] (HUDI-3342) MOR Delta Block rollbacks is not working correctly if lazy Block reading is disabled

2022-01-28 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3342?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3342: -- Priority: Blocker (was: Major) > MOR Delta Block rollbacks is not working correctly if lazy Blo

[jira] [Created] (HUDI-3342) MOR Delta Block rollbacks is not working correctly if lazy Block reading is disabled

2022-01-28 Thread Alexey Kudinkin (Jira)
Alexey Kudinkin created HUDI-3342: - Summary: MOR Delta Block rollbacks is not working correctly if lazy Block reading is disabled Key: HUDI-3342 URL: https://issues.apache.org/jira/browse/HUDI-3342 Pr

[jira] [Updated] (HUDI-3342) MOR Delta Block rollbacks is not working correctly if lazy Block reading is disabled

2022-01-28 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3342?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3342: -- Fix Version/s: 0.11.0 > MOR Delta Block rollbacks is not working correctly if lazy Block reading

[jira] [Updated] (HUDI-3322) Rollback Plan for Delta Commits constructed incorrectly

2022-01-28 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3322: -- Story Points: 4 (was: 2) > Rollback Plan for Delta Commits constructed incorrectly > --

[jira] [Updated] (HUDI-3342) MOR Delta Block rollbacks is not working correctly if lazy Block reading is disabled

2022-01-28 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3342?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3342: -- Description: While working on HUDI-3322, i've spotted following contraption: When we are rollin

[jira] [Updated] (HUDI-3342) MOR Delta Block Rollbacks not applied if Lazy Block reading is disabled

2022-01-28 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3342?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3342: -- Summary: MOR Delta Block Rollbacks not applied if Lazy Block reading is disabled (was: MOR Delt

[jira] [Updated] (HUDI-3342) MOR Delta Block Rollbacks not applied if Lazy Block reading is disabled

2022-01-28 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3342?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3342: -- Description: While working on HUDI-3322, i've spotted following contraption: When we are rollin

[jira] [Reopened] (HUDI-3180) Include only files belonging to completed commits while bootstrapping metadata table

2022-01-28 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin reopened HUDI-3180: --- > Include only files belonging to completed commits while bootstrapping > metadata table > --

[jira] [Commented] (HUDI-3180) Include only files belonging to completed commits while bootstrapping metadata table

2022-01-28 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3180?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17484010#comment-17484010 ] Alexey Kudinkin commented on HUDI-3180: --- [~shivnarayan] this is not working correctl

[jira] [Comment Edited] (HUDI-3180) Include only files belonging to completed commits while bootstrapping metadata table

2022-01-28 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3180?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17484010#comment-17484010 ] Alexey Kudinkin edited comment on HUDI-3180 at 1/28/22, 11:08 PM: --

[jira] [Created] (HUDI-3343) Metadata Table includes Uncommitted Log Files during Bootstrap

2022-01-28 Thread Alexey Kudinkin (Jira)
Alexey Kudinkin created HUDI-3343: - Summary: Metadata Table includes Uncommitted Log Files during Bootstrap Key: HUDI-3343 URL: https://issues.apache.org/jira/browse/HUDI-3343 Project: Apache Hudi

[jira] [Updated] (HUDI-3343) Metadata Table includes Uncommitted Log Files during Bootstrap

2022-01-28 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3343: -- Priority: Blocker (was: Major) > Metadata Table includes Uncommitted Log Files during Bootstrap

[jira] [Updated] (HUDI-3343) Metadata Table includes Uncommitted Log Files during Bootstrap

2022-01-28 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3343: -- Fix Version/s: 0.11.0 > Metadata Table includes Uncommitted Log Files during Bootstrap > ---

[jira] [Assigned] (HUDI-3343) Metadata Table includes Uncommitted Log Files during Bootstrap

2022-01-28 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin reassigned HUDI-3343: - Assignee: Alexey Kudinkin > Metadata Table includes Uncommitted Log Files during Bootstra

[jira] [Updated] (HUDI-3343) Metadata Table includes Uncommitted Log Files during Bootstrap

2022-01-28 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3343: -- Story Points: 2 > Metadata Table includes Uncommitted Log Files during Bootstrap > -

[jira] [Updated] (HUDI-3343) Metadata Table includes Uncommitted Log Files during Bootstrap

2022-01-28 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3343: -- Sprint: Hudi-Sprint-Jan-24 > Metadata Table includes Uncommitted Log Files during Bootstrap > --

[jira] [Updated] (HUDI-3343) Metadata Table includes Uncommitted Log Files during Bootstrap

2022-01-28 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3343: -- Reviewers: sivabalan narayanan, Y Ethan Guo > Metadata Table includes Uncommitted Log Files duri

[jira] [Updated] (HUDI-3343) Metadata Table includes Uncommitted Log Files during Bootstrap

2022-01-28 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3343: -- Status: In Progress (was: Open) > Metadata Table includes Uncommitted Log Files during Bootstra

[jira] [Updated] (HUDI-3343) Metadata Table includes Uncommitted Log Files during Bootstrap

2022-01-28 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3343: -- Status: Patch Available (was: In Progress) > Metadata Table includes Uncommitted Log Files duri

[jira] [Updated] (HUDI-3322) Rollback Plan for Delta Commits constructed incorrectly

2022-01-28 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3322: -- Status: Patch Available (was: In Progress) > Rollback Plan for Delta Commits constructed incorr

[jira] [Updated] (HUDI-3322) Rollback Plan for Delta Commits constructed incorrectly

2022-01-28 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3322: -- Reviewers: sivabalan narayanan, Y Ethan Guo > Rollback Plan for Delta Commits constructed incorr

[jira] [Updated] (HUDI-3318) Write RFC regarding proposed changes to the RecordPayload hierarchy

2022-01-28 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3318: -- Status: Patch Available (was: In Progress) > Write RFC regarding proposed changes to the Record

[jira] [Updated] (HUDI-3337) ParquetUtils fails extracting Parquet Column Range Metadata

2022-01-28 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3337: -- Status: Patch Available (was: In Progress) > ParquetUtils fails extracting Parquet Column Range

[jira] [Closed] (HUDI-3279) Metadata table stores incorrect file sizes after Restore

2022-01-28 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3279?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin closed HUDI-3279. - Resolution: Duplicate > Metadata table stores incorrect file sizes after Restore > ---

[jira] [Commented] (HUDI-2762) Ensure hive can query insert only logs in MOR

2022-01-31 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2762?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17484984#comment-17484984 ] Alexey Kudinkin commented on HUDI-2762: --- To reproduce: Just follow Kafka Connect gui

[jira] [Updated] (HUDI-1127) Handling late arriving Deletes

2022-01-31 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-1127: -- Status: Open (was: In Progress) > Handling late arriving Deletes >

[jira] [Updated] (HUDI-1127) Handling late arriving Deletes

2022-01-31 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-1127: -- Sprint: Hudi-Sprint-Jan-24 > Handling late arriving Deletes > -- > >

[jira] [Updated] (HUDI-3239) Convert AbstractHoodieTableFileIndex to Java

2022-01-31 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3239: -- Reviewers: Y Ethan Guo > Convert AbstractHoodieTableFileIndex to Java >

[jira] [Updated] (HUDI-3337) ParquetUtils fails extracting Parquet Column Range Metadata

2022-01-31 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3337: -- Reviewers: Manoj Govindassamy, sivabalan narayanan > ParquetUtils fails extracting Parquet Colum

[jira] [Updated] (HUDI-3318) Write RFC regarding proposed changes to the RecordPayload hierarchy

2022-01-31 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3318: -- Story Points: 0 (was: 5) > Write RFC regarding proposed changes to the RecordPayload hierarchy

[jira] [Updated] (HUDI-3322) Rollback Plan for Delta Commits constructed incorrectly

2022-01-31 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3322: -- Story Points: 0 (was: 4) > Rollback Plan for Delta Commits constructed incorrectly > --

[jira] [Updated] (HUDI-3337) ParquetUtils fails extracting Parquet Column Range Metadata

2022-01-31 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3337: -- Story Points: 0 (was: 1) > ParquetUtils fails extracting Parquet Column Range Metadata > --

[jira] [Updated] (HUDI-3343) Metadata Table includes Uncommitted Log Files during Bootstrap

2022-01-31 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-3343: -- Story Points: 0 (was: 2) > Metadata Table includes Uncommitted Log Files during Bootstrap > ---

[jira] [Created] (HUDI-3348) HoodieRealtimeFileSplit losing info when serialized/deserialized

2022-01-31 Thread Alexey Kudinkin (Jira)
Alexey Kudinkin created HUDI-3348: - Summary: HoodieRealtimeFileSplit losing info when serialized/deserialized Key: HUDI-3348 URL: https://issues.apache.org/jira/browse/HUDI-3348 Project: Apache Hudi

[jira] [Assigned] (HUDI-3348) HoodieRealtimeFileSplit losing info when serialized/deserialized

2022-01-31 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3348?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin reassigned HUDI-3348: - Assignee: sivabalan narayanan > HoodieRealtimeFileSplit losing info when serialized/deser

  1   2   3   4   5   6   7   8   9   10   >