[jira] [Updated] (HUDI-8855) Add bucket properties for spark bucket index query pruning

2025-01-10 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-8855: - Description: we support bucket index pruning since HUDI-6207, but the configurations of bucket index doesn

[jira] [Updated] (HUDI-8855) Add bucket properties for spark bucket index query pruning

2025-01-10 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-8855: - Summary: Add bucket properties for spark bucket index query pruning (was: Support spark sql bucket index

[jira] [Updated] (HUDI-8855) Support spark sql bucket index query pruning

2025-01-10 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-8855: - Description: we support bucket index pruning by since HUDI-6207, but the configurations of bucket index do

[jira] [Created] (HUDI-8855) Support spark sql bucket index query pruning

2025-01-10 Thread xi chaomin (Jira)
xi chaomin created HUDI-8855: Summary: Support spark sql bucket index query pruning Key: HUDI-8855 URL: https://issues.apache.org/jira/browse/HUDI-8855 Project: Apache Hudi Issue Type: Bug

[jira] [Assigned] (HUDI-8328) Filegroup name seems incorrect for log file created with NBCC

2024-10-14 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8328?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin reassigned HUDI-8328: Assignee: xi chaomin > Filegroup name seems incorrect for log file created with NBCC >

[jira] [Updated] (HUDI-8198) DELETE_PARTITION does not work when table is partitioned by multiple fields

2024-09-18 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8198?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-8198: - Description: {code:java} fhiveMap writeOptions = getWriterOptions(); writeOptions.put(DataSourceWriteOptio

[jira] [Commented] (HUDI-8198) DELETE_PARTITION does not work when table is partitioned by multiple fields

2024-09-18 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17882599#comment-17882599 ] xi chaomin commented on HUDI-8198: -- Have you set hive_style_partitioning to True? > DELE

[jira] [Updated] (HUDI-7804) Improve flink bucket index partitioner

2024-05-28 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7804?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-7804: - Description: https://github.com/apache/hudi/issues/11288 > Improve flink bucket index partitioner > --

[jira] [Created] (HUDI-7804) Improve flink bucket index partitioner

2024-05-28 Thread xi chaomin (Jira)
xi chaomin created HUDI-7804: Summary: Improve flink bucket index partitioner Key: HUDI-7804 URL: https://issues.apache.org/jira/browse/HUDI-7804 Project: Apache Hudi Issue Type: Bug

[jira] [Created] (HUDI-7715) Partition TTL for Flink

2024-05-05 Thread xi chaomin (Jira)
xi chaomin created HUDI-7715: Summary: Partition TTL for Flink Key: HUDI-7715 URL: https://issues.apache.org/jira/browse/HUDI-7715 Project: Apache Hudi Issue Type: Improvement Reporte

[jira] [Created] (HUDI-7714) Partition TTL for Flink

2024-05-05 Thread xi chaomin (Jira)
xi chaomin created HUDI-7714: Summary: Partition TTL for Flink Key: HUDI-7714 URL: https://issues.apache.org/jira/browse/HUDI-7714 Project: Apache Hudi Issue Type: Improvement Reporte

[jira] [Updated] (HUDI-7685) Fix delete partition instant commit in partition TTL

2024-05-05 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-7685: - Description: After cherry pick [https://github.com/apache/hudi/pull/9723] I find the delete partition act

[jira] [Updated] (HUDI-7685) Fix bug in partition TTL

2024-04-28 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-7685: - Description: After cherry pick [https://github.com/apache/hudi/pull/9723] I find the delete partition act

[jira] [Updated] (HUDI-7685) Fix bug in partition TTL

2024-04-28 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-7685: - Description: After cherry pick [https://github.com/apache/hudi/pull/9723] I find the delete partition acti

[jira] [Updated] (HUDI-7685) Fix bug in partition TTL

2024-04-28 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-7685: - Description: After cherry pick [https://github.com/apache/hudi/pull/9723] I find the delete partition acti

[jira] [Created] (HUDI-7685) Fix bug in partition TTL

2024-04-28 Thread xi chaomin (Jira)
xi chaomin created HUDI-7685: Summary: Fix bug in partition TTL Key: HUDI-7685 URL: https://issues.apache.org/jira/browse/HUDI-7685 Project: Apache Hudi Issue Type: Bug Reporter: xi c

[jira] [Created] (HUDI-7514) Update Manifest file after the parquet writer closed in LSMTimelineWriter

2024-03-19 Thread xi chaomin (Jira)
xi chaomin created HUDI-7514: Summary: Update Manifest file after the parquet writer closed in LSMTimelineWriter Key: HUDI-7514 URL: https://issues.apache.org/jira/browse/HUDI-7514 Project: Apache Hudi

[jira] [Updated] (HUDI-7513) Add jackson-module-scala to spark bundle

2024-03-17 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7513?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-7513: - Description: When I do spark stream reading, get NoClassDefFoundError. {code:java} // code placeholder 24

[jira] [Updated] (HUDI-7513) Add jackson-module-scala to spark bundle

2024-03-17 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7513?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-7513: - Affects Version/s: 0.14.1 > Add jackson-module-scala to spark bundle > ---

[jira] [Updated] (HUDI-7513) Add jackson-module-scala to spark bundle

2024-03-17 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7513?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-7513: - Description: When I do spark stream read, get NoClassDefFoundError. {code:java} // code placeholder 24/03

[jira] [Created] (HUDI-7513) Add jackson-module-scala to spark bundle

2024-03-17 Thread xi chaomin (Jira)
xi chaomin created HUDI-7513: Summary: Add jackson-module-scala to spark bundle Key: HUDI-7513 URL: https://issues.apache.org/jira/browse/HUDI-7513 Project: Apache Hudi Issue Type: Bug

[jira] [Comment Edited] (HUDI-6946) Data Duplicates with range pruning while using hoodie.bloom.index.use.metadata

2023-10-20 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6946?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=1063#comment-1063 ] xi chaomin edited comment on HUDI-6946 at 10/20/23 7:38 AM: Th

[jira] [Commented] (HUDI-6946) Data Duplicates with range pruning while using hoodie.bloom.index.use.metadata

2023-10-18 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6946?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=1063#comment-1063 ] xi chaomin commented on HUDI-6946: -- This is caused by the format of the value in record k

[jira] [Updated] (HUDI-6946) Data Duplicates with range pruning while using hoodie.bloom.index.use.metadata

2023-10-18 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6946?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-6946: - Attachment: WX20231019-094414.png > Data Duplicates with range pruning while using hoodie.bloom.index.use.

[jira] [Updated] (HUDI-6946) Data Duplicates with range pruning while using hoodie.bloom.index.use.metadata

2023-10-18 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6946?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-6946: - Attachment: (was: WX20231019-094414.png) > Data Duplicates with range pruning while using hoodie.bloom

[jira] [Assigned] (HUDI-6946) Data Duplicates with range pruning while using hoodie.bloom.index.use.metadata

2023-10-18 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6946?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin reassigned HUDI-6946: Assignee: xi chaomin > Data Duplicates with range pruning while using hoodie.bloom.index.use.metada

[jira] [Updated] (HUDI-6946) Data Duplicates with range pruning while using hoodie.bloom.index.use.metadata

2023-10-18 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6946?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-6946: - Attachment: WX20231019-094414.png > Data Duplicates with range pruning while using hoodie.bloom.index.use.

[jira] [Created] (HUDI-6309) Fix bug when hive queries Array type

2023-06-01 Thread xi chaomin (Jira)
xi chaomin created HUDI-6309: Summary: Fix bug when hive queries Array type Key: HUDI-6309 URL: https://issues.apache.org/jira/browse/HUDI-6309 Project: Apache Hudi Issue Type: Bug Re

[jira] [Updated] (HUDI-6307) Sync TIMESTAMP_MILLIS to hive

2023-06-01 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6307?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-6307: - Description: https://github.com/apache/hudi/issues/8844#issuecomment-1571299404 > Sync TIMESTAMP_MILLIS t

[jira] [Created] (HUDI-6307) Sync TIMESTAMP_MILLIS to hive

2023-06-01 Thread xi chaomin (Jira)
xi chaomin created HUDI-6307: Summary: Sync TIMESTAMP_MILLIS to hive Key: HUDI-6307 URL: https://issues.apache.org/jira/browse/HUDI-6307 Project: Apache Hudi Issue Type: Improvement R

[jira] [Assigned] (HUDI-5018) Make user-provided copyOnWriteRecordSizeEstimate first precedence

2023-05-29 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin reassigned HUDI-5018: Assignee: (was: xi chaomin) > Make user-provided copyOnWriteRecordSizeEstimate first precedence

[jira] [Commented] (HUDI-6144) [Spark][Flink]bucket index and then insert data in bulk, the correct file cannot be created

2023-05-06 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17720215#comment-17720215 ] xi chaomin commented on HUDI-6144: -- Currently bucket index doesn't support bulk insert. T

[jira] [Created] (HUDI-6080) Use hoodie.properties to determine if the table exists

2023-04-14 Thread xi chaomin (Jira)
xi chaomin created HUDI-6080: Summary: Use hoodie.properties to determine if the table exists Key: HUDI-6080 URL: https://issues.apache.org/jira/browse/HUDI-6080 Project: Apache Hudi Issue Type:

[jira] [Closed] (HUDI-6058) Bump Curator referred in spark3 to 2.13.0

2023-04-10 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6058?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin closed HUDI-6058. Resolution: Not A Bug > Bump Curator referred in spark3 to 2.13.0 >

[jira] [Updated] (HUDI-6058) Bump Curator referred in spark3 to 2.13.0

2023-04-10 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6058?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-6058: - Summary: Bump Curator referred in spark3 to 2.13.0 (was: Curator in hudi conflicts with spark3) > Bump C

[jira] [Reopened] (HUDI-6058) Curator in hudi conflicts with spark3

2023-04-10 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6058?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin reopened HUDI-6058: -- Assignee: xi chaomin > Curator in hudi conflicts with spark3 > - >

[jira] [Closed] (HUDI-6058) Curator in hudi conflicts with spark3

2023-04-10 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6058?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin closed HUDI-6058. Resolution: Not A Bug > Curator in hudi conflicts with spark3 > - > >

[jira] [Updated] (HUDI-6058) Curator in hudi conflicts with spark3

2023-04-10 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6058?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-6058: - Summary: Curator in hudi conflicts with spark3 (was: curator-client in hudi conflicts with spark3) > Cur

[jira] [Created] (HUDI-6058) curator-client in hudi conflicts with spark3

2023-04-10 Thread xi chaomin (Jira)
xi chaomin created HUDI-6058: Summary: curator-client in hudi conflicts with spark3 Key: HUDI-6058 URL: https://issues.apache.org/jira/browse/HUDI-6058 Project: Apache Hudi Issue Type: Bug

[jira] [Updated] (HUDI-5921) Partition path should be considered in BucketIndexConcurrentFileWritesConflictResolutionStrategy

2023-03-12 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-5921: - Summary: Partition path should be considered in BucketIndexConcurrentFileWritesConflictResolutionStrategy

[jira] [Updated] (HUDI-5921) Partition path should be considered BucketIndexConcurrentFileWritesConflictResolutionStrategy

2023-03-12 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-5921: - Summary: Partition path should be considered BucketIndexConcurrentFileWritesConflictResolutionStrategy (

[jira] [Created] (HUDI-5921) Partition path should be considered BucketIndexConcurrentFileWritesConflictResolutionStrategy should

2023-03-12 Thread xi chaomin (Jira)
xi chaomin created HUDI-5921: Summary: Partition path should be considered BucketIndexConcurrentFileWritesConflictResolutionStrategy should Key: HUDI-5921 URL: https://issues.apache.org/jira/browse/HUDI-5921

[jira] [Created] (HUDI-5911) SimpleTransactionDirectMarkerBasedDetectionStrategy can't work with none-partitioned table

2023-03-09 Thread xi chaomin (Jira)
xi chaomin created HUDI-5911: Summary: SimpleTransactionDirectMarkerBasedDetectionStrategy can't work with none-partitioned table Key: HUDI-5911 URL: https://issues.apache.org/jira/browse/HUDI-5911 Projec

[jira] [Closed] (HUDI-5661) Add ConflictResolutionStrategy for bucket index

2023-02-05 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5661?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin closed HUDI-5661. Resolution: Duplicate > Add ConflictResolutionStrategy for bucket index > --

[jira] [Closed] (HUDI-5561) The preCombine method of PartialUpdateAvroPayload is not called

2023-02-01 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin closed HUDI-5561. Resolution: Duplicate > The preCombine method of PartialUpdateAvroPayload is not called > --

[jira] [Updated] (HUDI-5660) Support bucket index for spark bulk_insert

2023-01-31 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5660?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-5660: - Issue Type: Improvement (was: Bug) > Support bucket index for spark bulk_insert > ---

[jira] [Updated] (HUDI-5661) Add ConflictResolutionStrategy for bucket index

2023-01-31 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5661?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-5661: - Issue Type: Bug (was: Improvement) > Add ConflictResolutionStrategy for bucket index > --

[jira] [Updated] (HUDI-5660) Support bucket index for spark bulk_insert

2023-01-31 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5660?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-5660: - Issue Type: Bug (was: Improvement) > Support bucket index for spark bulk_insert > ---

[jira] [Created] (HUDI-5661) Add ConflictResolutionStrategy for bucket index

2023-01-30 Thread xi chaomin (Jira)
xi chaomin created HUDI-5661: Summary: Add ConflictResolutionStrategy for bucket index Key: HUDI-5661 URL: https://issues.apache.org/jira/browse/HUDI-5661 Project: Apache Hudi Issue Type: Improve

[jira] [Closed] (HUDI-5069) TestInlineCompaction.testSuccessfulCompactionBasedOnNumAndTime is flaky

2023-01-16 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5069?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin closed HUDI-5069. Resolution: Not A Problem > TestInlineCompaction.testSuccessfulCompactionBasedOnNumAndTime is flaky > --

[jira] [Created] (HUDI-5561) The preCombine method of PartialUpdateAvroPayload is not called

2023-01-15 Thread xi chaomin (Jira)
xi chaomin created HUDI-5561: Summary: The preCombine method of PartialUpdateAvroPayload is not called Key: HUDI-5561 URL: https://issues.apache.org/jira/browse/HUDI-5561 Project: Apache Hudi Is

[jira] [Closed] (HUDI-5047) Add partition value in HoodieLogRecordReader when hoodie.datasource.write.drop.partition.columns=true

2022-12-07 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5047?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin closed HUDI-5047. Resolution: Won't Fix > Add partition value in HoodieLogRecordReader when > hoodie.datasource.write.drop.pa

[jira] [Updated] (HUDI-5308) Hive query returns null when the where condition has a partition field

2022-12-04 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5308?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-5308: - Summary: Hive query returns null when the where condition has a partition field (was: The hive query retu

[jira] [Updated] (HUDI-5308) Hive query returns null when the where clause has a partition field

2022-12-04 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5308?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-5308: - Summary: Hive query returns null when the where clause has a partition field (was: Hive query returns nul

[jira] [Updated] (HUDI-5308) The hive query returns null when the where condition has a partition field

2022-12-01 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5308?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-5308: - Summary: The hive query returns null when the where condition has a partition field (was: The hive query

[jira] [Created] (HUDI-5308) The hive query returns null when the where condition has a partitioned field

2022-12-01 Thread xi chaomin (Jira)
xi chaomin created HUDI-5308: Summary: The hive query returns null when the where condition has a partitioned field Key: HUDI-5308 URL: https://issues.apache.org/jira/browse/HUDI-5308 Project: Apache Hudi

[jira] [Updated] (HUDI-5189) Make HiveAvroSerializer compatible with hive3

2022-11-10 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-5189: - Description: Compilation failure: Compilation failure: [ERROR] /Users/xi/work/github/hudi/hudi-hadoop-mr/

[jira] [Created] (HUDI-5189) Make HiveAvroSerializer compatible with hive3

2022-11-10 Thread xi chaomin (Jira)
xi chaomin created HUDI-5189: Summary: Make HiveAvroSerializer compatible with hive3 Key: HUDI-5189 URL: https://issues.apache.org/jira/browse/HUDI-5189 Project: Apache Hudi Issue Type: Bug

[jira] [Assigned] (HUDI-5185) Compaction run fails with --hoodieConfigs

2022-11-09 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin reassigned HUDI-5185: Assignee: xi chaomin > Compaction run fails with --hoodieConfigs >

[jira] [Updated] (HUDI-5185) Compaction run fails with --hoodieConfigs

2022-11-08 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-5185: - Description: compaction run --schemaFilePath /tmp/compaction.schema --hoodieConfigs hoodie.embed.timeline

[jira] [Updated] (HUDI-5185) Compaction run fails with --hoodieConfigs

2022-11-08 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-5185: - Description: compaction run --schemaFilePath /tmp/compaction.schema --hoodieConfigs hoodie.embed.timeline

[jira] [Created] (HUDI-5185) Compaction run fails with --hoodieConfigs

2022-11-08 Thread xi chaomin (Jira)
xi chaomin created HUDI-5185: Summary: Compaction run fails with --hoodieConfigs Key: HUDI-5185 URL: https://issues.apache.org/jira/browse/HUDI-5185 Project: Apache Hudi Issue Type: Bug

[jira] [Commented] (HUDI-5018) Make user-provided copyOnWriteRecordSizeEstimate first precedence

2022-11-06 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17629576#comment-17629576 ] xi chaomin commented on HUDI-5018: -- Hi [~xushiyan] , I can do this improvement, but I hav

[jira] [Updated] (HUDI-5096) boolean param is broken in HiveSyncTool

2022-10-27 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-5096: - Summary: boolean param is broken in HiveSyncTool (was: boolean params is broken in HiveSyncTool) > boole

[jira] [Created] (HUDI-5096) boolean params is broken in HiveSyncTool

2022-10-26 Thread xi chaomin (Jira)
xi chaomin created HUDI-5096: Summary: boolean params is broken in HiveSyncTool Key: HUDI-5096 URL: https://issues.apache.org/jira/browse/HUDI-5096 Project: Apache Hudi Issue Type: Bug

[jira] [Updated] (HUDI-5069) TestInlineCompaction.testSuccessfulCompactionBasedOnNumAndTime is flaky

2022-10-25 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5069?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-5069: - Description: {code:java} org.opentest4j.AssertionFailedError: Expect baseInstant to be less than or equal

[jira] [Created] (HUDI-5069) TestInlineCompaction.testSuccessfulCompactionBasedOnNumAndTime is flaky

2022-10-21 Thread xi chaomin (Jira)
xi chaomin created HUDI-5069: Summary: TestInlineCompaction.testSuccessfulCompactionBasedOnNumAndTime is flaky Key: HUDI-5069 URL: https://issues.apache.org/jira/browse/HUDI-5069 Project: Apache Hudi

[jira] [Updated] (HUDI-5047) Add partition value in HoodieLogRecordReader when hoodie.datasource.write.drop.partition.columns=true

2022-10-21 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5047?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-5047: - Description: When I sync hive with hoodie.datasource.write.drop.partition.columns=false, the query with p

[jira] [Updated] (HUDI-5047) Add partition value in HoodieLogRecordReader when hoodie.datasource.write.drop.partition.columns=true

2022-10-21 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5047?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-5047: - Description: When I sync hive with hoodie.datasource.write.drop.partition.columns=false, the query with p

[jira] [Updated] (HUDI-5047) Add partition value in HoodieLogRecordReader when hoodie.datasource.write.drop.partition.columns=true

2022-10-21 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5047?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-5047: - Description: When I sync hive with hoodie.datasource.write.drop.partition.columns=false, the query with p

[jira] [Updated] (HUDI-5047) Add partition value in HoodieLogRecordReader when hoodie.datasource.write.drop.partition.columns=true

2022-10-18 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5047?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-5047: - Description: When I sync hive with hoodie.datasource.write.drop.partition.columns=false, the query with p

[jira] [Updated] (HUDI-5047) Add partition value in HoodieLogRecordReader when hoodie.datasource.write.drop.partition.columns=true

2022-10-18 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5047?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-5047: - Description: When I sync hive with hoodie.datasource.write.drop.partition.columns=false, the query with p

[jira] [Updated] (HUDI-5047) Add partition value in HoodieLogRecordReader when hoodie.datasource.write.drop.partition.columns=true

2022-10-18 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5047?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-5047: - Summary: Add partition value in HoodieLogRecordReader when hoodie.datasource.write.drop.partition.columns=

[jira] [Updated] (HUDI-5047) Add partition field in HoodieLogRecordReader when hoodie.datasource.write.drop.partition.columns=true

2022-10-18 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5047?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-5047: - Summary: Add partition field in HoodieLogRecordReader when hoodie.datasource.write.drop.partition.columns=

[jira] [Updated] (HUDI-5047) Set hoodie.datasource.write.drop.partition.columns=true, the update record cann't be read in mor table.

2022-10-18 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5047?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-5047: - Description: When I sync hive with hoodie.datasource.write.drop.partition.columns=false, the query with p

[jira] [Updated] (HUDI-5047) Set hoodie.datasource.write.drop.partition.columns=true, the update record cann't be read in mor table.

2022-10-18 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5047?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-5047: - Description: When I sync hive with hoodie.datasource.write.drop.partition.columns=false, the query with p

[jira] [Updated] (HUDI-5047) Set hoodie.datasource.write.drop.partition.columns=true, the update record cann't be read in mor table.

2022-10-18 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5047?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-5047: - Description: When I sync hive with hoodie.datasource.write.drop.partition.columns=false, the query with p

[jira] [Created] (HUDI-5047) Set hoodie.datasource.write.drop.partition.columns=true, the update record cann't be read in mor table.

2022-10-18 Thread xi chaomin (Jira)
xi chaomin created HUDI-5047: Summary: Set hoodie.datasource.write.drop.partition.columns=true, the update record cann't be read in mor table. Key: HUDI-5047 URL: https://issues.apache.org/jira/browse/HUDI-5047

[jira] [Updated] (HUDI-4998) Inference of META_SYNC_PARTITION_EXTRACTOR_CLASS does not work

2022-10-09 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4998?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-4998: - Issue Type: Bug (was: Improvement) > Inference of META_SYNC_PARTITION_EXTRACTOR_CLASS does not work > ---

[jira] [Created] (HUDI-4998) Inference of META_SYNC_PARTITION_EXTRACTOR_CLASS does not work

2022-10-09 Thread xi chaomin (Jira)
xi chaomin created HUDI-4998: Summary: Inference of META_SYNC_PARTITION_EXTRACTOR_CLASS does not work Key: HUDI-4998 URL: https://issues.apache.org/jira/browse/HUDI-4998 Project: Apache Hudi Iss

[jira] [Updated] (HUDI-4996) Update cleaning doc

2022-10-08 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-4996: - Description: The parameter *--hoodieConfigs* in "cleans run" is a String array, the value should be separa

[jira] [Created] (HUDI-4996) Update cleaning doc

2022-10-08 Thread xi chaomin (Jira)
xi chaomin created HUDI-4996: Summary: Update cleaning doc Key: HUDI-4996 URL: https://issues.apache.org/jira/browse/HUDI-4996 Project: Apache Hudi Issue Type: Improvement Components: c

[jira] [Updated] (HUDI-4902) Set default partitioner for SIMPLE BUCKET index

2022-09-22 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4902?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-4902: - Summary: Set default partitioner for SIMPLE BUCKET index (was: Add default partitioner for SIMPLE BUCKET

[jira] [Updated] (HUDI-4902) Add default partitioner for SIMPLE BUCKET index

2022-09-22 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4902?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-4902: - Description: https://github.com/apache/hudi/issues/6723 > Add default partitioner for SIMPLE BUCKET index

[jira] [Created] (HUDI-4902) Add default partitioner for SIMPLE BUCKET index

2022-09-22 Thread xi chaomin (Jira)
xi chaomin created HUDI-4902: Summary: Add default partitioner for SIMPLE BUCKET index Key: HUDI-4902 URL: https://issues.apache.org/jira/browse/HUDI-4902 Project: Apache Hudi Issue Type: Bug

[jira] [Commented] (HUDI-3983) ClassNotFoundException when using hudi-spark-bundle to write table with hbase index

2022-09-19 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3983?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17606468#comment-17606468 ] xi chaomin commented on HUDI-3983: -- Hi, [~codope] I pushed a pr to remove the defaualt va

[jira] [Comment Edited] (HUDI-3983) ClassNotFoundException when using hudi-spark-bundle to write table with hbase index

2022-09-16 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3983?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17605706#comment-17605706 ] xi chaomin edited comment on HUDI-3983 at 9/16/22 9:25 AM: --- I tr

[jira] [Comment Edited] (HUDI-3983) ClassNotFoundException when using hudi-spark-bundle to write table with hbase index

2022-09-16 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3983?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17605704#comment-17605704 ] xi chaomin edited comment on HUDI-3983 at 9/16/22 9:18 AM: --- [~co

[jira] [Commented] (HUDI-3983) ClassNotFoundException when using hudi-spark-bundle to write table with hbase index

2022-09-16 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3983?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17605739#comment-17605739 ] xi chaomin commented on HUDI-3983: -- [~codope]  Also check with you, have you encountered

[jira] [Comment Edited] (HUDI-3983) ClassNotFoundException when using hudi-spark-bundle to write table with hbase index

2022-09-16 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3983?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17605706#comment-17605706 ] xi chaomin edited comment on HUDI-3983 at 9/16/22 9:14 AM: --- I tr

[jira] [Commented] (HUDI-3983) ClassNotFoundException when using hudi-spark-bundle to write table with hbase index

2022-09-16 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3983?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17605706#comment-17605706 ] xi chaomin commented on HUDI-3983: -- I tried the method mentioned in [https://github.com/a

[jira] [Commented] (HUDI-3983) ClassNotFoundException when using hudi-spark-bundle to write table with hbase index

2022-09-16 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3983?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17605704#comment-17605704 ] xi chaomin commented on HUDI-3983: -- [~codope] Thanks for your help, ClassNotFoundExceptio

[jira] [Closed] (HUDI-4820) ORC dependency conflicts with spark 3.1

2022-09-11 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin closed HUDI-4820. Resolution: Fixed duplicate to HUDI-4496 > ORC dependency conflicts with spark 3.1 > --

[jira] [Commented] (HUDI-4821) Presto query for bootstrapped table fails due to IOException

2022-09-09 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17602224#comment-17602224 ] xi chaomin commented on HUDI-4821: -- Could you set dfs.client.use.legacy.blockreader to fa

[jira] [Updated] (HUDI-4820) ORC dependency conflicts with spark 3.1

2022-09-09 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-4820: - Description: Set _*hoodie.table.base.file.format*_ to *ORC,* I get an Exception   {code:java} java.lang.N

[jira] [Updated] (HUDI-4820) ORC dependency conflicts with spark 3.1

2022-09-09 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-4820: - Description: Set _*hoodie.table.base.file.format*_ to *ORC,* I get an Exception   {code:java} java.lang.N

[jira] [Updated] (HUDI-4820) ORC dependency conflicts with spark 3.1

2022-09-09 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-4820: - Description: Set _*hoodie.table.base.file.format*_ to *ORC,* I get an Exception   {code:java} java.lang.N

[jira] [Updated] (HUDI-4820) ORC dependency conflicts with spark 3.1

2022-09-08 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-4820: - Summary: ORC dependency conflicts with spark 3.1 (was: ORC dependency conflict with spark 3.1) > ORC dep

[jira] [Updated] (HUDI-4820) ORC dependency conflict with spark 3.1

2022-09-08 Thread xi chaomin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xi chaomin updated HUDI-4820: - Description: Set _*hoodie.table.base.file.format*_ to *ORC,* I get an Exception   {code:java} java.lang.N

[jira] [Created] (HUDI-4820) ORC dependency conflict with spark 3.1

2022-09-08 Thread xi chaomin (Jira)
xi chaomin created HUDI-4820: Summary: ORC dependency conflict with spark 3.1 Key: HUDI-4820 URL: https://issues.apache.org/jira/browse/HUDI-4820 Project: Apache Hudi Issue Type: Bug

  1   2   >