Hi,

As there has been no further input from the community, I created a PR
to remove the unused files under the `deploy` dir:
https://github.com/apache/cloudberry/pull/1090. Please help review it.

I kept the `deploy/build` files unremoved as a revision base in the
coming months to avoid creating them from scratch.

Best,
Dianjin Wang

On Fri, Apr 25, 2025 at 6:01 PM Ed Espino <esp...@apache.org> wrote:
>
> Hi Dianjin,
>
> Thanks for the detailed context and thoughtful suggestions.
>
> I agree with your assessment that much of the content under
> deploy/—particularly
> docker/, k8s/, script/, vagrant, cbdb_deploy.sh, and cbdb_env.sh—is
> outdated and not aligned with Cloudberry's current or future direction as a
> community-led Apache project.
>
> Given that these artifacts rely on internal or deprecated setups and are
> not part of any active public workflow, I propose we proceed with deleting
> the entire deploy/ directory. This would remove ambiguity for new
> contributors and make the codebase cleaner and more compliant with ASF’s
> principles of openness and transparency.
>
> We can leave a placeholder issue or GitHub Discussion open to encourage
> community contributions toward a modern, accessible deployment workflow.
> Docker Compose seems like a natural starting point, and it would be great
> to see contributors take the lead in shaping tools that better reflect
> real-world usage scenarios across platforms.
>
> Let’s continue to gather input from others, but unless there are
> objections, we could aim to move forward with the cleanup in the next
> couple of weeks.
>
> Best,
>
> -=e
>
> -=e
> --
> Ed Espino
> Apache Cloudberry (Incubating) & MADlib
>
>
> On Fri, Apr 25, 2025 at 1:28 AM Dianjin Wang <wangdian...@gmail.com> wrote:
>
> > Hi Ed,
> >
> > Thanks for bringing this up — I’d like to contribute some context and
> > suggestions as follows.
> >
> > Best,
> > Dianjin Wang
> >
> >
> > On Fri, Apr 25, 2025 at 5:32 AM Ed Espino <esp...@apache.org> wrote:
> > >
> > > - Is the `deploy/` directory still in active use or maintained?
> >
> > Files in `deploy/build/` were originally scattered under the top-level
> > dir in Greenplum to guide users in building the project. We moved them
> > here for better organization. These scripts and files are used to help
> > users build Cloudberry on Linux and macOS. However, they do need
> > updates — including bash compatibility, dependency management, and OS
> > support. We might cherry-pick updates from Greenplum or revise them
> > ourselves.
> >
> > The `vagrant/` directory, previously under `src/tools/vagrant/` in
> > Greenplum, was also moved here. However, these files have not been
> > maintained upstream for over 5–6 years and use VirtualBox as the
> > provider, which has poor support for Apple silicon in my previous
> > testing (I'm not sure how it goes now).
> >
> > > - Was it contributed primarily to support internal workflows, or is it
> > > intended to serve as a general deployment guide for the community?
> >
> > Files under `k8s/`, `docker/`, `script/`, as well as `cbdb_deploy.sh`
> > and `cbdb_env.sh`, were initially to support internal workflows. As
> > you can see, some internal URLs referenced in the files are not usable
> > by the general public.
> >
> > To my knowledge, these are also not used in the public Cloudberry
> > workflows.
> >
> > > - If it isn’t currently usable without access to internal resources,
> > should
> > > we clarify that in the README or revisit how it’s presented?
> >
> > For legacy and unused files, I propose we remove them entirely:
> > - `docker/`
> > - `k8s/`
> > - `script`
> > - `vagrant`
> > - `cbdb_deploy.sh`
> > - `cbdb_env.sh`
> >
> > > - Would it be valuable to provide a community-accessible deployment path
> > > using standard open tools (e.g., Docker Compose, Helm charts, Terraform)?
> >
> > +1. Starting with Docker Compose would be a great first step. It could
> > help both new users and contributors get started more easily.
> >
> > > - Does the current setup align with ASF expectations around transparency
> > > and community-driven development?
> >
> > As suggested above, once we clean up the internal and deprecated
> > files, it should be more aligned with ASF’s expectations.
> >
> > > While it’s entirely understandable that early artifacts might reflect
> > their
> > > original environment, continued reliance on internal infrastructure could
> > > make onboarding harder for external users — and may not reflect the
> > Apache
> > > spirit of openness and independence.
> >
> > Agree, I acknowledged with the vendor engineers that these files are
> > not used anymore in their internal workflows. So we can clean up these
> > files for better openness and public access.
> >
> > ~~~
> >
> > BTW, I started a GitHub Discussion on cleanup of `deploy` dir[1] back
> > in Feb, but we can continue the deep discussion on this mailing thread
> > to gather more feedback and suggestions.
> >
> > [1] https://github.com/apache/cloudberry/discussions/948
> >
> > ---------------------------------------------------------------------
> > To unsubscribe, e-mail: dev-unsubscr...@cloudberry.apache.org
> > For additional commands, e-mail: dev-h...@cloudberry.apache.org
> >
> >

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@cloudberry.apache.org
For additional commands, e-mail: dev-h...@cloudberry.apache.org

Reply via email to