+1, I remember exploring this while exploring a way for compaction for iceberg tables for a Hive usecase, got some good pointers for cleaning up orphan files, I think it was using a pretty old version of Hive(3.1.1 I believe), so couldn't pull it in as dependency in Hive master branch itself, which was my initial plan.
But overall, it was some good code. Good Luck!!! -Ayush On Fri, 23 Feb 2024 at 14:15, Justin Mclean <jus...@classsoftware.com> wrote: > Hi, > > I would like to propose a new project to the ASF incubator - Apache Amoro. > I’m one of the mentors, but there are a lot of other people involved who > have done all of the hard work. > > Amoro is a Lakehouse management system built on open data lake formats > like Apache Iceberg and Apache Paimon (Incubating). Working with compute > engines including Apache Flink, Apache Spark, and Trino, Amoro brings > pluggable and self-managed features for Lakehouse to provide out-of-the-box > data warehouse experience, and helps data platforms or products easily > build infra-decoupled, stream-and-batch-fused and lake-native architecture. > You can find the proposal here. [1] > > We are looking forward to anyone's feedback or questions. > > Thanks, > Justin > > [1] https://cwiki.apache.org/confluence/display/INCUBATOR/AmoroProposal > --------------------------------------------------------------------- > To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org > For additional commands, e-mail: general-h...@incubator.apache.org > >