That's great news! If you need anything from me, just ask...

We should also check and update other non-Hive third-party libraries with
high/critical vulnerabilities, as someone mentioned in another email thread.

Since this is a major change, I think we should leave it for Spark 4.1.
What do you think?

El lun, 24 mar 2025 a las 0:59, Mich Talebzadeh (<mich.talebza...@gmail.com>)
escribió:

> For now, I am testing apache-hive-4.0.1-bin which is the latest release
> version  from
>
> https://dlcdn.apache.org/hive/hive-4.0.1/
>
> apache-hive-4.0.1-bin.tar
>
> My metastore is Oracle and upgrade scripts are provided..
>
> My previous version is Hive 3.1.1 and the metastore upgrade went OK
> without any major headache.
>
> Now I just need to customise various files under $HIVE_HOME/conf and then
> I will have some testing underway.
>
> HTH
>
> Dr Mich Talebzadeh,
> Architect | Data Science | Financial Crime | Forensic Analysis | GDPR
>
>    view my Linkedin profile
> <https://www.linkedin.com/in/mich-talebzadeh-ph-d-5205b2/>
>
>
>
>
>
> On Sun, 23 Mar 2025 at 17:13, Ángel Álvarez Pascua <
> angel.alvarez.pas...@gmail.com> wrote:
>
>> Well ... and then? When are we going to tackle this? I could help.
>>
>> El mié, 12 mar 2025, 15:50, Mich Talebzadeh <mich.talebza...@gmail.com>
>> escribió:
>>
>>> Agreed. Hive upgrade is more time consuming as it involves backing up
>>> Hive schema on your metastore and then running Hive provided upgrade schema
>>> scripts against Hive schema that could be problematic,but needs to be done
>>> one way or another.
>>>
>>> HTH
>>>
>>> Dr Mich Talebzadeh,
>>> Architect | Data Science | Financial Crime | Forensic Analysis | GDPR
>>>
>>>    view my Linkedin profile
>>> <https://www.linkedin.com/in/mich-talebzadeh-ph-d-5205b2/>
>>>
>>>
>>>
>>>
>>>
>>> On Wed, 12 Mar 2025 at 12:21, Ángel <angel.alvarez.pas...@gmail.com>
>>> wrote:
>>>
>>>> Not an easy task, I guess, but I'm totally for it too.
>>>>
>>>> The issue SPARK-49910
>>>> <https://issues.apache.org/jira/browse/SPARK-49910> is related to this.
>>>>
>>>> El mar, 11 mar 2025 a las 23:06, Mich Talebzadeh (<
>>>> mich.talebza...@gmail.com>) escribió:
>>>>
>>>>> Yes I am all  for it, as I use Hive with Oracle as its metastore
>>>>> extensively.
>>>>>
>>>>> Case in point, on 6th March A Hive user
>>>>> <https://lists.apache.org/thread/vhgxt1cj2ppc862j0lwxl63j6nfc7khh>
>>>>> alluded to it and I quote
>>>>>
>>>>> "I just wanted to highlight that Hive 3.x line is EOL. It has various
>>>>> known security vulnerabilities, many serious bugs (including wrong results
>>>>> and data corruption), and lacks lots of improvements and major features
>>>>> that are available in Hive 4. Upgrading is the right path forward."
>>>>>
>>>>>  In summary, Hive 4.x likely includes performance improvements, new
>>>>> features, and bug fixes. Compiling against it would allow Spark to take
>>>>> advantage of these. Plus using the latest versions of both Spark and Hive
>>>>> is important for maintaining a secure data platform.
>>>>>
>>>>> HTH
>>>>>
>>>>> Dr Mich Talebzadeh,
>>>>> Architect | Data Science | Financial Crime | Forensic Analysis | GDPR
>>>>>
>>>>>    view my Linkedin profile
>>>>> <https://www.linkedin.com/in/mich-talebzadeh-ph-d-5205b2/>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>> On Tue, 11 Mar 2025 at 19:08, Rozov, Vlad <vro...@amazon.com.invalid>
>>>>> wrote:
>>>>>
>>>>>> Hi All,
>>>>>>
>>>>>> As Apache Hive announced EOL for Hive 2.x [1] and 3.x [2], should
>>>>>> Spark be compiled against Hive 4.x and use it as default?
>>>>>>
>>>>>> Thank you,
>>>>>>
>>>>>> Vlad
>>>>>>
>>>>>> [1] https://lists.apache.org/thread/4ctrzfw60jkhc0hq2xoh1jpqxgt2zd93
>>>>>> [2] https://lists.apache.org/thread/99h6wr7nk4684r6tkcbm8ydfytgqy6f3
>>>>>> [3] https://github.com/apache/spark/pull/50213
>>>>>>
>>>>>

Reply via email to