Hi,
I wanted to share some important information regarding the CDP Hive version
that I am currently using, which is Hive 3.1.3000.7.1.7.0-551.

In CDP Hive version 3.x, a significant change has been introduced regarding
managed tables. By default, managed tables in Hive 3.x are considered
transactional. This means that if you create a managed table explicitly
specifying it as transactional with a value of "false," it will be treated
as an external table instead.
Thank you for your attention to this matter, and I appreciate your
continued support.
https://www.thecodersstop.com/hadoop/apache-hive-3-changes-in-cdp-upgrade-part-1/
Best regards,
Joice

On Thu, Oct 5, 2023 at 7:01 PM Mingyu Chen <morning...@163.com> wrote:

> I saw that your hive table is with property "transactional" = "true",
> And Doris only support ACID table with Hive 3, not support with Hive 2.x.
> So you may need to create non-transactional hive table for Doris to visit.
>
>
> --
> Best Regards
> Mingyu Chen
>
> Email:
> morning...@apache.org
>
>
> 在 2023-10-05 13:30:25,"Joice Jacob" <joicejacob1...@gmail.com> 写道:
>
> Dear Doris Community & Jiafeng.Zhang
> ,
> Thank you for your prompt response and your willingness to assist with the
> issue I've been encountering with the integration of Doris and Cloudera's
> Hive.
>
> To provide you with the necessary information, here are the details you
> requested:
>
> *1. Hive Catalog Creation Statement:*
>
> My Hive catalog creation statement is as follows:
>
> CREATE CATALOG hive PROPERTIES (
>     'type'='hms',
>     'hive.metastore.uris' = 'thrift://10.128.0.4:9083',
> 'hive.version' = '3.1.3',
>     'hive.metastore.sasl.enabled' = 'true',
>     'hive.metastore.kerberos.principal' = 'hive/instanc...@hadoop.com',
>     'hadoop.security.authentication' = 'kerberos',
>     'hadoop.kerberos.keytab' = '/home/techuser/doris/hive.keytab',
>     'hadoop.kerberos.principal' = 'hive/instanc...@hadoop.com',
>     'yarn.resourcemanager.principal' = 'yarn/instanc...@hadoop.com'
> );
>
> *2. CDP Hive Version:*
>
> The CDP Hive version I am using is: Hive 3.1.3000.7.1.7.0-551
>
> *3. Configuration Attempt:*
>
> I have tried specifying the Hive version in my catalog configuration with
> the following statement:
> "hive.version=2.1.0"
> Despite attempting to set the Hive version to 2.1.0, I continue to
> experience the same issue, which is detailed in my previous emails.
>
> *4. Logs: *
> I will attach both the fe.log and fe.warn logs to this email for your
> reference. These logs should provide additional context regarding the issue
> I'm facing.
>
> Once again, I want to express my gratitude for your assistance and support
> in resolving this matter. I look forward to your insights and
> recommendations based on the provided logs and catalog configuration.
>
> Please feel free to let me know if you require any further information or
> if there are additional steps I should take to assist in diagnosing and
> resolving the issue.
>
> [image: doris_error.JPG]
>  fe.log log at hive catlog creation time
> [image: fe_log_at_hivecatalog.JPG]
> Hive managed table script
> [image: image.png]
>
>
> Best regards,
> Joice
>
> On Thu, Oct 5, 2023 at 8:10 AM Jiafeng.Zhang <zhang...@gmail.com> wrote:
>
>> Can you provide your hive catalog creation statement, your fe.log log at
>> that time, and your cdp hive version? This will help us locate the
>> problem,
>> thank you.
>> You can also try specifying your hive version in your catalog statement:
>> "hive.version=2.1.0"
>>
>> Joice Jacob <joicejacob1...@gmail.com> 于2023年10月5日周四 00:45写道:
>>
>> > I am reaching out once again to seek assistance and share a specific
>> issue
>> > I've encountered while integrating Doris with Cloudera-flavored Hive.
>> The
>> > error message I'm facing is as follows:
>> > detailMessage = get file split failed for table: baby1, err:
>> > java.lang.Exception: Hive 2.x versioned full-acid tables need to run
>> major
>> > compaction.
>> >
>> > This error message appears when I attempt to query a Hive table from
>> > Doris, and it seems to be related to Hive 2.x versioned full-ACID tables
>> > requiring a major compaction.
>> > I would like to ask the Doris community for guidance on how to handle
>> this
>> > issue effectively. Specifically, I am interested in understanding the
>> best
>> > practices and steps to follow when dealing with Cloudera-flavored Hive
>> > tables that require major compaction for Doris integration.
>> >
>> > If anyone in the community has successfully addressed this issue or can
>> > provide insights into how to configure and manage Cloudera-flavored Hive
>> > tables for integration with Doris, your expertise would be highly
>> > appreciated.
>> >
>> > Thank you for your time and support, and I look forward to receiving
>> your
>> > valuable input.
>> > Joice
>> >
>> > On Wed, Oct 4, 2023 at 9:23 PM Joice Jacob <joicejacob1...@gmail.com>
>> > wrote:
>> >
>> >> Dear Doris Community,
>> >> I have an update on the issue I previously mentioned regarding the
>> >> integration of Hive on a CDP distribution with Doris 2.0.1.1.
>> >>
>> >> After further investigation, I have identified that the issue is
>> related
>> >> to Hive managed tables being transactional by default. This appears to
>> be
>> >> causing the error I encountered earlier.
>> >>
>> >> To address this issue, I am seeking guidance from the community on any
>> >> specific configurations or settings that need to be adjusted for Hive
>> >> transactional tables when using Hive as the catalog in Doris. Are
>> there any
>> >> recommended configurations or best practices that I should follow to
>> ensure
>> >> smooth integration and query execution?
>> >>
>> >> Any insights or recommendations from the Doris community would be
>> greatly
>> >> appreciated. Your expertise and guidance will be instrumental in
>> helping me
>> >> resolve this challenge.
>> >>
>> >> Thank you for your continued support, and I look forward to your
>> valuable
>> >> input.
>> >>
>> >> Best regards,
>> >> Joice
>> >>
>> >> On Wed, Oct 4, 2023 at 6:54 PM Joice Jacob <joicejacob1...@gmail.com>
>> >> wrote:
>> >>
>> >>> Dear Doris Community,
>> >>>
>> >>> I am reaching out to the community to seek assistance with an
>> >>> integration issue I've encountered while trying to use Hive on a CDP
>> >>> distribution with Doris 2.0.1.1.
>> >>>
>> >>> Here are the details of my setup:
>> >>>
>> >>> Doris Version: 2.0.1.1
>> >>> Hive Version: 3.1.3
>> >>> Cluster Security: Kerberized
>> >>>
>> >>> I have successfully created a Hive catalog in Doris and have been able
>> >>> to set up the integration between Hive and Doris. However, when I
>> attempt
>> >>> to query a Hive table using Doris, I encounter the following error:
>> >>>
>> >>> ERROR 1105 (HY000): errCode = 2, detailMessage = get file split failed
>> >>> for table: baby1, err: java.lang.Exception: Hive 2.x versioned
>> full-acid
>> >>> tables need to run major compaction.
>> >>>
>> >>> I have already performed a major compaction as recommended, but I am
>> >>> still encountering the same error.
>> >>>
>> >>> I would greatly appreciate any insights, guidance, or solutions that
>> the
>> >>> Doris community can offer to help me resolve this issue. If anyone has
>> >>> encountered a similar problem or has expertise in integrating Hive
>> with
>> >>> Doris, your assistance would be invaluable.
>> >>>
>> >>> Thank you in advance for your time and support. I look forward to
>> >>> hearing from the community and working together to find a solution to
>> this
>> >>> challenge.
>> >>>
>> >>> [image: baby_table.JPG]
>> >>> [image: doris_hive_catlog_result.JPG]
>> >>>
>> >>> [image: doris_fe_log.JPG]
>> >>>
>> >>> [image: hive_metastore_error.JPG]
>> >>>
>> >>> Thanks
>> >>> Joice
>> >>>
>> >>>
>> >>>
>> >>>
>> >>>
>>
>> --
>> 张家峰
>>
>

Reply via email to