I’m aligned with the phased approach you’ve outlined: supporting password
resets for existing principals now, while steadily enhancing IdP support
and making it the default over time
I will go ahead with making changes based on the discussions
Thank you for the suggestions


On Tue, Jul 15, 2025 at 10:12 PM Dmitri Bourlatchkov <di...@apache.org>
wrote:

> I support Robert's proposal to leverage external IdPs in Polaris.
>
> However, Arun's use case also has merit, IMHO, because it enables existing
> deployments to migrate to newer Polaris versions, while keeping the old
> Polaris Principals.
>
> Migrating to an external IdP, while valuable from a security perspective,
> is a big deployment change.
>
> Perhaps we could implement the password reset request for existing Polaris
> Principals, while gradually enhancing support for external IdPs and making
> it default at some point.
>
> Cheers,
> Dmitri.
>
> On Tue, Jul 15, 2025 at 6:07 AM Robert Stupp <sn...@snazy.de> wrote:
>
> > Hi all,
> >
> > Writing the following with my "nasty security guy" hat on:
> >
> > Generally speaking, storing secrets is a quite sensitive topic that
> > deserves a lot of attention upfront, during the initial implementation
> > and for all changes related to it. What we already have in Polaris is
> > IMHO strictly speaking not enough, because it lacks the ability to
> > automatically rotate secrets and targeted logging/auditing
> > functionality. It's overall okay-ish. With the ability to provide
> > secrets comes another requirement to ensure that secrets are strong
> > enough and new permission checks and feature flags, which make the
> > code more complex.
> >
> > The mentioned requirements are the core domain of what IdPs do (for
> > example the OSS projects Keycloak, Authelia).
> >
> > From the Polaris's project PoV I think we should not go down that
> > route but leverage IdPs to do this for us. Mistakes and oversights in
> > the domain of secrets are common sources for vulnerabilities.
> >
> > On Mon, Jul 14, 2025 at 5:42 PM Dmitri Bourlatchkov <di...@apache.org>
> > wrote:
> > >
> > > Hi Arun,
> > >
> > > After writing my previous email I realised that we might be able to
> > support
> > > your use case via REST API by implementing admin-base password reset
> > [624].
> > >
> > > The flow would be:
> > > 1) create a principal using current ID (random client ID/secret)
> > > 2) set your own client ID / secret via the password reset API (new)
> > >
> > > Would this work from your POV?
> > >
> > > As for me, it helps to isolate rare admin-level principal manipulation
> > from
> > > the mainstream principal API, which IMHO, is a good idea from a general
> > > security perspective.
> > >
> > > Also, since we allow user-provided client IDs (after this change), it
> > would
> > > make sense to allow resetting them without re-creating the principal,
> > hence
> > > the new "reset" API.
> > >
> > > [624] https://github.com/apache/polaris/issues/624
> > >
> > > Cheers,
> > > Dmitri.
> > >
> > > On Mon, Jul 14, 2025 at 11:29 AM Dmitri Bourlatchkov <di...@apache.org
> >
> > > wrote:
> > >
> > > > Thanks for volunteering to make a PR for this, Arun! Looking forward
> > to it.
> > > >
> > > > As discussed, please consider securing the extra capability via (new)
> > > > permission checks. I'd think it might be worth it to also have a
> > feature
> > > > flag to control the new functionality.
> > > >
> > > > Re: External IdP - most of the authentication code is already in
> > `main`.
> > > > There are a few remaining dangling pieces related to connecting
> > external
> > > > users to Polaris roles, though, IIRC.
> > > >
> > > > Cheers,
> > > > Dmitri.
> > > >
> > > > On Mon, Jul 14, 2025 at 6:34 AM Arun Suri <arun.s...@fivetran.com
> > .invalid>
> > > > wrote:
> > > >
> > > >> Thanks for the detailed response Dimitri and Yufei!
> > > >>
> > > >> I agree with making the PR to support user-defined client ID and
> > secret
> > > >> via
> > > >> the REST API, along with appropriate access checks and possibly
> > > >> introducing
> > > >> a new permission type/config. I will work on this
> > > >>
> > > >> REST fits better with our tooling as it has fewer dependencies and
> > > >> complications compared to the JAR-based Admin CLI. We also believe
> > > >> building
> > > >> our migration logic in a neutral way (e.g., using the REST API) is
> > more
> > > >> robust—no matter how the tools evolve, the API remains the stable
> > > >> contract.
> > > >>
> > > >> As for external IdP delegation, it's something we're open to
> exploring
> > > >> down
> > > >> the line, though we understand it's still relatively new in Polaris
> > > >>
> > > >> Thanks,
> > > >>
> > > >> On Sun, Jul 13, 2025 at 1:28 AM Yufei Gu <flyrain...@gmail.com>
> > wrote:
> > > >>
> > > >> > The use case of passing secrets via REST is definitely valid, and
> > this
> > > >> use
> > > >> > case is not the only one we should be considering. Other concrete
> > > >> scenarios
> > > >> > include:
> > > >> >
> > > >> >    1. Catalog federation, where Polaris needs to store credentials
> > to
> > > >> >    connect to remote catalogs (e.g., Hive, Glue, Unity Catalog).
> > > >> >    2. S3-compatible storage without STS support, where Polaris
> must
> > > >> persist
> > > >> >    static access keys and secrets to enable read/write operations.
> > > >> >
> > > >> > Given these needs, I think it's the right time to formalize our
> > > >> approach to
> > > >> > secret management by integrating Polaris with established secret
> > > >> managers
> > > >> > such as HashiCorp Vault, AWS Secrets Manager, Azure Key Vault, and
> > > >> Google
> > > >> > Cloud Secret Manager.
> > > >> >
> > > >> > While using the admin tool to inject secrets is a workable
> > short-term
> > > >> > solution, it’s best treated as a stopgap.
> > > >> >
> > > >> > The good news is that the secret management interface was
> > introduced in
> > > >> the
> > > >> > Polari core already,
> > > >> >
> > > >> >
> > > >>
> >
> https://github.com/apache/polaris/blob/main/polaris-core/src/main/java/org/apache/polaris/core/secrets/UserSecretsManager.java
> > > >> > ,
> > > >> > we may just need to provide wrapper implementations for different
> > secret
> > > >> > managers.
> > > >> >
> > > >> >
> > > >> > Yufei
> > > >> >
> > > >> >
> > > >> > On Thu, Jul 10, 2025 at 12:52 PM Dmitri Bourlatchkov <
> > di...@apache.org>
> > > >> > wrote:
> > > >> >
> > > >> > > Thanks for providing more context, Arun!
> > > >> > >
> > > >> > > I do not object to adding user-provided client ID and secret to
> > the
> > > >> REST
> > > >> > > API. However, I personally maintain my opinion that this kind of
> > > >> > operation
> > > >> > > fits better with the Admin Tool, given the current state of the
> > > >> project.
> > > >> > I
> > > >> > > wonder what other community members think on this topic, too.
> > > >> > >
> > > >> > > If we go with updating the current REST API, then limiting
> access
> > to
> > > >> > > explicit client ID and secret parameters via access checks will
> > > >> certainly
> > > >> > > make sense. We may need a new permission type for this, I guess.
> > > >> > >
> > > >> > > Do you have the capacity to make a PR for this?
> > > >> > >
> > > >> > > Regarding the Admin Tool, is the difficulty in the fact that it
> > is a
> > > >> CLI
> > > >> > > tool that requires a JVM and your existing tooling is based on
> > > >> HTTP/REST
> > > >> > > and is not written in java? Just trying to understand the
> overall
> > use
> > > >> > case
> > > >> > > better.
> > > >> > >
> > > >> > > Your point about the external vault makes me wonder whether you
> > might
> > > >> be
> > > >> > > interested in running an IdP server (e.g. keycloak) in your
> infra
> > and
> > > >> > > making Polaris delegate user management to that system. There's
> > some
> > > >> > > existing support for that, but I'm not sure if anyone tried it
> > > >> end-to-end
> > > >> > > without any custom code on the server side (it is certainly
> > possible
> > > >> with
> > > >> > > custom code).
> > > >> > >
> > > >> > > Thanks,
> > > >> > > Dmitri.
> > > >> > >
> > > >> > > On Thu, Jul 10, 2025 at 3:30 PM Arun Suri <
> arun.s...@fivetran.com
> > > >> > .invalid>
> > > >> > > wrote:
> > > >> > >
> > > >> > > >  Hey Dmitri and Robert,
> > > >> > > >
> > > >> > > >
> > > >> > > > *To clarify our use case further:*
> > > >> > > >
> > > >> > > >
> > > >> > > > This isn't a one-time migration for us. We're migrating our
> > > >> *customers*
> > > >> > > > from
> > > >> > > > Polaris 0.9 (EclipseLink) to Polaris 1.0 (JDBC) gradually.
> > During
> > > >> this
> > > >> > > > process, we’ll be *running both catalog servers in parallel*,
> > with
> > > >> 1.0
> > > >> > > > acting as a *secondary/fallback* catalog.
> > > >> > > >
> > > >> > > > Our strategy involves:
> > > >> > > >
> > > >> > > >    - Registering the same tables in both 0.9 and 1.0
> > > >> > > >    - Using the *same *clientId* and *clientSecret in both
> > catalogs
> > > >> to
> > > >> > > >    ensure clients can authenticate seamlessly
> > > >> > > >    - Allowing us to *switch traffic between the two catalogs*,
> > and
> > > >> roll
> > > >> > > >    back instantly if needed
> > > >> > > >
> > > >> > > > This setup requires credential continuity — not just for
> > migration,
> > > >> but
> > > >> > > to
> > > >> > > > enable *safe rollback and zero-downtime cutover*. Using
> > different
> > > >> > > > credentials across catalogs versions would break this flow and
> > > >> require
> > > >> > > deep
> > > >> > > > client coordination to rotate secrets, which is not feasible
> at
> > > >> scale.
> > > >> > > >
> > > >> > > >
> > > >> > > > *Regarding your question, Dmitri: *I wonder how your tooling
> > could
> > > >> > obtain
> > > >> > > > Principals' secrets from the old
> > > >> > > >
> > > >> > > > Polaris instance for use as the new Principal creation request
> > > >> > parameter
> > > >> > > >
> > > >> > > > - We store the credentials in an external Vault as well. So we
> > are
> > > >> not
> > > >> > > > reading them from the old Polaris instance, but do have access
> > to
> > > >> them.
> > > >> > > >
> > > >> > > >
> > > >> > > > We did consider raw table copying, but the differences in
> > schema and
> > > >> > > > hashing logic between 0.9 and 1.0 make that risky — and harder
> > to
> > > >> > > > validate/test it completely due to unknown risks.
> > > >> > > >
> > > >> > > > So our goal with this proposal is to:
> > > >> > > >
> > > >> > > >    - Enable a *safe, service-admin only way* to inject known
> > > >> > credentials
> > > >> > > >    via the API during the transition phase with validations of
> > > >> course
> > > >> > > >    - Keep this functionality configurable.
> > > >> > > >
> > > >> > > > We’re not trying to expand Polaris into a full IdP — just to
> > > >> provide a
> > > >> > > > secure and practical bridge between versions. So the change
> > seems
> > > >> fine
> > > >> > to
> > > >> > > > us.
> > > >> > > >
> > > >> > > > Happy to iterate on the proposal  in a future sync
> > > >> > > >
> > > >> > > > On Wed, Jul 9, 2025 at 3:49 AM Dmitri Bourlatchkov <
> > > >> di...@apache.org>
> > > >> > > > wrote:
> > > >> > > >
> > > >> > > > > Hi Arun,
> > > >> > > > >
> > > >> > > > > Thank you for starting this discussion!
> > > >> > > > >
> > > >> > > > > I did some poking about Keycloak and it looks like Keycloak
> > allows
> > > >> > > > > user-provided Client IDs.
> > > >> > > > >
> > > >> > > > > I think it should be fine for Polaris to accept
> user-provided
> > > >> Client
> > > >> > > IDs
> > > >> > > > in
> > > >> > > > > the Principal management API. I suppose we may want to
> impose
> > some
> > > >> > > > > restrictions in terms of special characters, but in general
> a
> > > >> > > > > previous Polaris Client ID should be valid as an input
> > parameter
> > > >> when
> > > >> > > > > creating a new Principal.
> > > >> > > > >
> > > >> > > > > I think it should also be fine for Polaris to accept
> > user-provided
> > > >> > > Client
> > > >> > > > > Secrets (passwords) when creating Principals.
> > > >> > > > >
> > > >> > > > > That said, from my POV using the Admin Tool is still
> > preferable
> > > >> for
> > > >> > > > > migration use cases.
> > > >> > > > >
> > > >> > > > > My main argument in favour of the Admin Tool is that the
> whole
> > > >> > > migration
> > > >> > > > > process is a deployment type of activity when the Polaris
> > service
> > > >> is
> > > >> > > > > configured for the first time. Ideally, Polaris data would
> > follow
> > > >> a
> > > >> > > > backup
> > > >> > > > > / restore process (not currently implemented) where the old
> > > >> > instance's
> > > >> > > > data
> > > >> > > > > is exported into a file, which is then imported into the new
> > > >> instance
> > > >> > > > > before it is started for the first time.
> > > >> > > > >
> > > >> > > > > I wonder how your tooling could obtain Principals' secrets
> > from
> > > >> the
> > > >> > old
> > > >> > > > > Polaris instance for use as the new Principal creation
> request
> > > >> > > parameter.
> > > >> > > > >
> > > >> > > > > Cheers,
> > > >> > > > > Dmitri.
> > > >> > > > >
> > > >> > > > > On Tue, Jul 8, 2025 at 9:02 AM Arun Suri <
> > arun.s...@fivetran.com
> > > >> > > > .invalid>
> > > >> > > > > wrote:
> > > >> > > > >
> > > >> > > > > > Hi all,
> > > >> > > > > >
> > > >> > > > > > Following up on the suggestion from the discussion here
> > > >> > > > > > <
> > > >> > >
> > https://github.com/apache/polaris/issues/1929#issuecomment-3045487786
> > > >> > > > >
> > > >> > > > > > — thank you for the feedback so far.
> > > >> > > > > >
> > > >> > > > > > We’re currently migrating our self-hosted Polaris service
> > from
> > > >> > > version
> > > >> > > > > 0.9
> > > >> > > > > > (EclipseLink-based metastore) to version 1.0 (JDBC-based
> > > >> > metastore).
> > > >> > > As
> > > >> > > > > > part of this transition, we need to preserve the existing
> > > >> > `clientId`
> > > >> > > > and
> > > >> > > > > > `clientSecret` credentials for registered principals.
> > > >> > > > > > These credentials are already embedded in customer
> > workflows.
> > > >> > > Rotating
> > > >> > > > > them
> > > >> > > > > > during migration would create disruptions and require
> > cross-team
> > > >> > > > > > coordination with clients — making both rollout and
> rollback
> > > >> > > > > significantly
> > > >> > > > > > more complex.
> > > >> > > > > >
> > > >> > > > > > We understand the security implications of allowing
> > arbitrary
> > > >> > > > credentials
> > > >> > > > > > to be passed in an API request. That said, we believe this
> > > >> > capability
> > > >> > > > can
> > > >> > > > > > be introduced safely and in a tightly controlled manner.
> For
> > > >> > example:
> > > >> > > > > >
> > > >> > > > > > - Restricting this functionality to service admin only.
> > > >> > > > > > - Ensuring all credential transmissions occur only over
> > HTTPS
> > > >> > > > > > - Clearly documenting that this is strictly for
> > > >> > *migration/bootstrap
> > > >> > > > use
> > > >> > > > > > cases*, not for production use
> > > >> > > > > > - Disabling this functionality by default in publicly
> hosted
> > > >> > > > deployments
> > > >> > > > > > - Ensuring credentials are never logged (e.g., in
> > observability
> > > >> > > systems
> > > >> > > > > > like logs or traces)
> > > >> > > > > >
> > > >> > > > > > Our goal is not to weaken the system's security
> guarantees,
> > but
> > > >> to
> > > >> > > > > provide
> > > >> > > > > > a practical and secure migration path where credential
> > > >> continuity
> > > >> > is
> > > >> > > > > > essential. Since there are a lot of DB schema changes
> > involved,
> > > >> > > Manual
> > > >> > > > > > insertion into the metastore isn't ideal either, due to
> > > >> potential
> > > >> > > > > > inconsistencies in hashing or salting logic across
> versions
> > —
> > > >> > > > increasing
> > > >> > > > > > operational risk.
> > > >> > > > > >
> > > >> > > > > > *### Proposal*
> > > >> > > > > >
> > > >> > > > > > We propose extending the existing `createPrincipal` API to
> > > >> > optionally
> > > >> > > > > > accept `clientId` and `clientSecret` fields, with the
> above
> > > >> > > safeguards
> > > >> > > > in
> > > >> > > > > > place.
> > > >> > > > > >
> > > >> > > > > > We would appreciate your feedback on this proposal and are
> > > >> happy to
> > > >> > > > > > contribute a patch once there’s alignment. We’re also open
> > to
> > > >> > > > discussing
> > > >> > > > > > this during the next Polaris Community Sync if helpful.
> > > >> > > > > >
> > > >> > > > > > Arun Suri
> > > >> > > > > >
> > > >> > > > > > Senior Software Engineer
> > > >> > > > > >
> > > >> > > > > > He/him/his
> > > >> > > > > >
> > > >> > > > > > Engineering | Fivetran
> > > >> > > > > > arun.s...@fivetran.com
> > > >> > > > > > fivetran.com <//fivetran.com>
> > > >> > > > > > <http://www.fivetran.com>
> > > >> > > > > > [image: facebook] <https://www.facebook.com/Fivetran/>
> > [image:
> > > >> > > > twitter]
> > > >> > > > > > <
> > > >> > > > > >
> > > >> > > > >
> > > >> > > >
> > > >> > >
> > > >> >
> > > >>
> >
> https://twitter.com/fivetran?ref_src=twsrc%5Egoogle%7Ctwcamp%5Eserp%7Ctwgr%5Eauthor
> > > >> > > > > > >
> > > >> > > > > > [image:
> > > >> > > > > > linkedin] <https://www.linkedin.com/company/fivetran>
> > [image:
> > > >> > > > instagram]
> > > >> > > > > > <https://www.instagram.com/fivetran_ig/>
> > > >> > > > > >
> > > >> > > > >
> > > >> > > >
> > > >> > > >
> > > >> > > > --
> > > >> > > > Arun Suri
> > > >> > > >
> > > >> > > > Senior Software Engineer
> > > >> > > >
> > > >> > > > He/him/his
> > > >> > > >
> > > >> > > > Engineering | Fivetran
> > > >> > > > arun.s...@fivetran.com
> > > >> > > > fivetran.com <//fivetran.com>
> > > >> > > > <http://www.fivetran.com>
> > > >> > > > [image: facebook] <https://www.facebook.com/Fivetran/>
> [image:
> > > >> > twitter]
> > > >> > > > <
> > > >> > > >
> > > >> > >
> > > >> >
> > > >>
> >
> https://twitter.com/fivetran?ref_src=twsrc%5Egoogle%7Ctwcamp%5Eserp%7Ctwgr%5Eauthor
> > > >> > > > >
> > > >> > > > [image:
> > > >> > > > linkedin] <https://www.linkedin.com/company/fivetran> [image:
> > > >> > instagram]
> > > >> > > > <https://www.instagram.com/fivetran_ig/>
> > > >> > > >
> > > >> > >
> > > >> >
> > > >>
> > > >>
> > > >> --
> > > >> Arun Suri
> > > >>
> > > >> Senior Software Engineer
> > > >>
> > > >> He/him/his
> > > >>
> > > >> Engineering | Fivetran
> > > >> arun.s...@fivetran.com
> > > >> fivetran.com <//fivetran.com>
> > > >> <http://www.fivetran.com>
> > > >> [image: facebook] <https://www.facebook.com/Fivetran/> [image:
> > twitter]
> > > >> <
> > > >>
> >
> https://twitter.com/fivetran?ref_src=twsrc%5Egoogle%7Ctwcamp%5Eserp%7Ctwgr%5Eauthor
> > > >> >
> > > >> [image:
> > > >> linkedin] <https://www.linkedin.com/company/fivetran> [image:
> > instagram]
> > > >> <https://www.instagram.com/fivetran_ig/>
> > > >>
> > > >
> >
>


-- 
Arun Suri

Senior Software Engineer

He/him/his

Engineering | Fivetran
arun.s...@fivetran.com
fivetran.com <//fivetran.com>
<http://www.fivetran.com>
[image: facebook] <https://www.facebook.com/Fivetran/> [image: twitter]
<https://twitter.com/fivetran?ref_src=twsrc%5Egoogle%7Ctwcamp%5Eserp%7Ctwgr%5Eauthor>
[image:
linkedin] <https://www.linkedin.com/company/fivetran> [image: instagram]
<https://www.instagram.com/fivetran_ig/>

Reply via email to