Re: [DISCUSS] Hive Catalog federation in Polaris

2025-07-17 Thread Dennis Huo
A lot of the concerns seem to arise from the fact that Polaris essentially has two distinct deployment modes: 1. Provide a highly isolated multi-tenant (multi-realm) Iceberg REST service with a tightly curated set of secure integrations to external dependencies 2. Provide a turnkey Iceberg REST se

Re: [DISCUSS] Hive Catalog federation in Polaris

2025-07-15 Thread Pooja Nilangekar
Sure, I will add it to the community sync agrenda. My concern here is that the other IRC will still need to re-implement significant portions of Polaris logic, some important blocks of the top of the list: Auth token management, entity persistence, principals management, entity access API; which i

Re: [DISCUSS] Hive Catalog federation in Polaris

2025-07-15 Thread Robert Stupp
I think this deserves a discussion in the next community sync, feel free to add it to the agenda. >From what I read in this thread: nobody is against the idea of being able to federate to HMS. However, technical concerns should really be considered, and I think that it is a much safer option for

Re: [DISCUSS] Hive Catalog federation in Polaris

2025-07-14 Thread Pooja Nilangekar
Hi everyone, I believe Hive Metastore (HMS) federation is a critical feature for the success and widespread adoption of the project. The reality of the current data lake ecosystem is that a vast number of organizations have significant footprint in HMS. For Polaris to be the Iceberg Rest Catal

Re: [DISCUSS] Hive Catalog federation in Polaris

2025-07-10 Thread Yufei Gu
It’s a valid point that Polaris needs to support multi-tenancy, or even across different external catalogs (such as remote HMS) within a single realm. Unfortunately, Kerberos isn’t compatible with this model, as it requires global configuration per JVM, making it inherently single-tenant. So I’d s

Re: [DISCUSS] Hive Catalog federation in Polaris

2025-07-09 Thread Robert Stupp
Following up on my email: Polaris would really benefit from supporting HMS and other catalog types. And the way I see to get there is to have a "HMS only" IRC service, which can be legibly built on Java 11, use Kerberos, etc. Polaris can then federate to that HMS catalog. AFAIU clients can authent

Re: [DISCUSS] Hive Catalog federation in Polaris

2025-07-09 Thread Robert Stupp
Let's recap what Polaris offers: 1. Multi tenancy via realms 2. Multiple catalogs per realm 3. OAuth/OIDC Adding Kerberos is global per JVM, making #1 impossible and likely also not suitable for #2, plus adding another complicated and complex auth mechanism. If Kerberos is a strong concern, I prop

Re: [DISCUSS] Hive Catalog federation in Polaris

2025-07-08 Thread Yufei Gu
HMS integration is a key step toward one of Polaris’s critical missions: helping users move off HMS. It brings clear value by aligning with our long-term direction. I’m not too concerned about hive.xml, most of its configurations can be dynamically injected at runtime. The real challenge lies in K

Re: [DISCUSS] Hive Catalog federation in Polaris

2025-07-07 Thread Russell Spitzer
I think having some integration with HMS is definitely a good idea. We've already seen users build this in the wild on top of Polaris showing that there is definitely a demand. I'm still a strong believer that we should be helping users get to Polaris from whatever systems they are currently using

Re: [DISCUSS] Hive Catalog federation in Polaris

2025-07-07 Thread Eric Maynard
1. We (Polaris) can provide end users a way to migrate off of these catalogs that the Iceberg project no longer wants to invest into. Implementing HMS federation is in service to the goal of removing non-Iceberg catalogs, not in contradiction to it. 2. This does not seem like a user-centered conce

Re: [DISCUSS] Hive Catalog federation in Polaris

2025-07-04 Thread Robert Stupp
I'd really prefer to not add "anything Hive" to Polaris itself, and I'd really like to see Hadoop being removed entirely from the Polaris code base. There are multiple reasons for this: 1. The Iceberg project would rather like to remove all catalogs except the REST catalog. (That's at least wh

Re: [DISCUSS] Hive Catalog federation in Polaris

2025-07-02 Thread Eric Maynard
I think this is orthogonal to Dmitri's note above but my understanding is the following: When the auth type is NONE, the config doesn't tell Polaris which (or any) hive-site.xml to pick up. This is essentially just a test mode or a mode for when the Polaris service is deployed as a wrapper around

Re: [DISCUSS] Hive Catalog federation in Polaris

2025-07-02 Thread Pooja Nilangekar
Hey Eric, The part about other auth types has a few aspects we should consider: 1. We can reliably support HMS instances that have (Simple, ie. NO_AUTH) and LDAP (with some implementation of LDAPAuthenticationType). 2. We can't support Kerberos because it requires that the user via the Polaris

Re: [DISCUSS] Hive Catalog federation in Polaris

2025-07-02 Thread Eric Maynard
IMO 2a should be off the table; the person creating an EXTERNAL catalog does not necessarily have permission to access the path that a hive-site.xml is as (whether local to the Polaris catalog service or in object storage) or to even know what paths the catalog has access to. It's the same problem