Thanks Walaa for driving it forward, looking forward to thinking about implementation of Materialized Views.
I see Jan's point, the PR spec change is similar but does not seem to be completely aligned with the Draft Spec in the design doc: https://docs.google.com/document/d/1UnhldHhe3Grz8JBngwXPA6ZZord1xMedY5ukEhZYF-A/ . I left my comments on PR of those sections with the links to the difference. I think most of those Draft Spec proposal is still applicable after the decision to have separate Table and View objects It will be interesting to at least see drill a bit further why we did not choose the approach in the Draft Spec and chose another way. Thanks Szehon On Wed, May 8, 2024 at 4:48 AM Jan Kaul <jank...@mailbox.org.invalid> wrote: > Well, everybody that actively contributed to the discussion on the > original google doc was in consensus. That's why I brought up the topic at > the Community Sync on the 2024-02-14 (https://youtu.be/uAQVGd5zV4I?t=890) > to raise the awareness of the broader community. After which the discussion > about the storage model started. I don't think that the discussion about a > single aspect of a proposal should invalidate all other aspects of the > proposal. > > Regardless, the state of the proposal from the original google doc > contains a lot of valuable contributions from Micah, Szehon, Jack, Dan, > yourself and others and it should at least provide the basis for any > further discussion. I don't think it's effective to start with a completely > different design because we are bound to have the same discussions all over > again. > > Thanks, Jan > On 08.05.24 12:11, Walaa Eldin Moustafa wrote: > > The only consensus the community had was on the object model through the > most recent voting thread [1]. This kind of consensus was not present > during the doc discussions, and this should be evident from the fact the > last doc state listed 5 alternatives with no particular conclusion. I am > not quite sure what type of consensus we are referring to here given all > the follow up discussions, alternatives, etc. > > Due to the separate object model, the PR is fundamentally different from > the doc in the sense it does not propose a new metadata model but rather > formalizes some new table and view properties related to MVs. That is also > one reason there are no repeated discussions. That said, if you feel there > is a repeated discussion (which I do not see so far), it would be best to > link the relevant discussion from the doc in a comment. > > Happy to move the discussion elsewhere if there is sufficient support for > this idea, but as things stand, I do not see this as an efficient way to > make progress. It sounds we have been re-emphasizing the same points in the > last two replies, so I will let others chime in at this point. > > [1] https://lists.apache.org/thread/rotmqzmwk5jrcsyxhzjhrvcjs5v3yjcc > > Thanks, > Walaa. > > > On Wed, May 8, 2024 at 2:31 AM Jan Kaul <jank...@mailbox.org.invalid> > <jank...@mailbox.org.invalid> wrote: > >> The original google doc >> <https://docs.google.com/document/d/1UnhldHhe3Grz8JBngwXPA6ZZord1xMedY5ukEhZYF-A/edit?usp=sharing> >> discussed multiple aspects of the Materialized View spec. One was the >> storage model while others were related to the metadata. After we (Micah, >> Szehon, you, me) reached consensus in the google doc, Jack raised his >> concern about the storage model and the long discussion about the storage >> model started. Now we truly reached consensus about the storage model, >> which is now also reflected in the google doc. All other aspects from the >> google doc about the metadata weren't questioned and still represent the >> consensus. >> >> I would like to *avoid repeating the discussions* in your PR that we >> already had in the google doc. Especially since we reached consensus which >> took a considerable amount of time. >> >> Thanks, Jan >> On 08.05.24 10:21, Walaa Eldin Moustafa wrote: >> >> Thanks Jan. I think we moved on to more alignment steps beyond that doc a >> while ago. After that doc, we have discussed the topic further in 2 dev >> list threads and one more doc >> <https://docs.google.com/document/d/1zg0wQ5bVKTckf7-K_cdwF4mlRi6sixLcyEh6jErpGYY/edit?pli=1> >> (with strictly two options for the storage model to consider). Moreover, >> the original doc grew to 14 pages long with one section comparing 5 design >> alternatives, which made things harder to reach consensus. The lack of >> consensus is what partly led up to the subsequent discussions and call for >> a more focused approach to reach consensus. If we already have a consensus >> on the storage model (separate tables and views), I think we should take >> things further and have continued focused discussions on the specific >> metadata in the form of a PR. I have included all previous discussions >> including the original doc and issue as references in the PR description. >> Please let me know if this works. Happy to hear others' thoughts on the >> best way to move forward. >> >> Thanks, >> Walaa. >> >> >> On Wed, May 8, 2024 at 12:56 AM Jan Kaul <jank...@mailbox.org.invalid> >> <jank...@mailbox.org.invalid> wrote: >> >>> Thanks Walaa for trying to move things along. However I don't think it's >>> a good idea to start a separate discussion about the metadata for >>> materialized views because we already had this discussion and reached >>> consensus in this google doc: >>> >>> >>> https://docs.google.com/document/d/1UnhldHhe3Grz8JBngwXPA6ZZord1xMedY5ukEhZYF-A/edit?usp=sharing >>> >>> Once the draft is finalized we can adopt the PR to reflect the consensus >>> from the google doc. >>> >>> Best wishes, >>> >>> Jan >>> On 07.05.24 19:11, Walaa Eldin Moustafa wrote: >>> >>> Thanks Steven. I feel it is needed so the MV spec is not scattered >>> across the table and view spec pages. We may add a reference in each >>> respective properties section. >>> >>> On Tue, May 7, 2024 at 10:04 AM Steven Wu <stevenz...@gmail.com> wrote: >>> >>>> Walaa, thanks for initiating the next step. >>>> >>>> With the agreed model of separate view and storage table, I am >>>> wondering if a separate materialized view spec page is needed. E.g., the >>>> new view metadata (view-materialized and view-storage-table) is probably >>>> good to be added to the view page directly to avoid information scattering. >>>> The same can be said about the storage table metadata. >>>> >>>> We may keep the separate materialized view page to document motivation, >>>> freshness semantics, etc.. >>>> >>>> On Mon, May 6, 2024 at 10:58 PM Walaa Eldin Moustafa < >>>> wa.moust...@gmail.com> wrote: >>>> >>>>> Hi Everyone, >>>>> >>>>> Thanks again for participating in the modeling discussion [1]. Since >>>>> the outcome of this discussion was to model materialized views as separate >>>>> objects, an Iceberg view and a table, I think the next step should be >>>>> discussing the metadata details for each object. I have created a PR >>>>> https://github.com/apache/iceberg/pull/10280 with an initial spec >>>>> improvement. Please feel free to review it and leave feedback there. >>>>> >>>>> [1] https://lists.apache.org/thread/rotmqzmwk5jrcsyxhzjhrvcjs5v3yjcc >>>>> >>>>> Thanks, >>>>> Walaa. >>>>> >>>>>