On Fri, 12 Apr 2024, Jason Merrill wrote:

> On 4/12/24 10:35, Patrick Palka wrote:
> > On Wed, 10 Apr 2024, Jason Merrill wrote:
> > 
> > > On 4/10/24 14:48, Patrick Palka wrote:
> > > > On Tue, 9 Apr 2024, Jason Merrill wrote:
> > > > 
> > > > > On 3/5/24 10:31, Patrick Palka wrote:
> > > > > > On Tue, 27 Feb 2024, Patrick Palka wrote:
> > > > > > 
> > > > > > Subject: [PATCH] c++/modules: local type merging [PR99426]
> > > > > > 
> > > > > > One known missing piece in the modules implementation is merging of
> > > > > > a
> > > > > > streamed-in local type (class or enum) with the corresponding in-TU
> > > > > > version of the local type.  This missing piece turns out to cause a
> > > > > > hard-to-reduce use-after-free GC issue due to the entity_ary not
> > > > > > being
> > > > > > marked as a GC root (deliberately), and manifests as a serialization
> > > > > > error on stream-in as in PR99426 (see comment #6 for a reduction).
> > > > > > It's
> > > > > > also reproducible on trunk when running the xtreme-header tests
> > > > > > without
> > > > > > -fno-module-lazy.
> > > > > > 
> > > > > > This patch makes us merge such local types according to their
> > > > > > position
> > > > > > within the containing function's definition, analogous to how we
> > > > > > merge
> > > > > > FIELD_DECLs of a class according to their index in the TYPE_FIELDS
> > > > > > list.
> > > > > > 
> > > > > >     PR c++/99426
> > > > > > 
> > > > > > gcc/cp/ChangeLog:
> > > > > > 
> > > > > >     * module.cc (merge_kind::MK_local_type): New enumerator.
> > > > > >     (merge_kind_name): Update.
> > > > > >     (trees_out::chained_decls): Move BLOCK-specific handling
> > > > > >     of DECL_LOCAL_DECL_P decls to ...
> > > > > >     (trees_out::core_vals) <case BLOCK>: ... here.  Stream
> > > > > >     BLOCK_VARS manually.
> > > > > >     (trees_in::core_vals) <case BLOCK>: Stream BLOCK_VARS
> > > > > >     manually.  Handle deduplicated local types..
> > > > > >     (trees_out::key_local_type): Define.
> > > > > >     (trees_in::key_local_type): Define.
> > > > > >     (trees_out::get_merge_kind) <case FUNCTION_DECL>: Return
> > > > > >     MK_local_type for a local type.
> > > > > >     (trees_out::key_mergeable) <case FUNCTION_DECL>: Use
> > > > > >     key_local_type.
> > > > > >     (trees_in::key_mergeable) <case FUNCTION_DECL>: Likewise.
> > > > > >     (trees_in::is_matching_decl): Be flexible with type mismatches
> > > > > >     for local entities.
> > > > > > 
> > > > > > diff --git a/gcc/cp/module.cc b/gcc/cp/module.cc
> > > > > > index 80b63a70a62..d9e34e9a4b9 100644
> > > > > > --- a/gcc/cp/module.cc
> > > > > > +++ b/gcc/cp/module.cc
> > > > > > @@ -6714,7 +6720,37 @@ trees_in::core_vals (tree t)
> > > > > >         case BLOCK:
> > > > > >           t->block.locus = state->read_location (*this);
> > > > > >           t->block.end_locus = state->read_location (*this);
> > > > > > -      t->block.vars = chained_decls ();
> > > > > > +
> > > > > > +      for (tree *chain = &t->block.vars;;)
> > > > > > +   if (tree decl = tree_node ())
> > > > > > +     {
> > > > > > +       /* For a deduplicated local type or enumerator, chain the
> > > > > > +          duplicate decl instead of the canonical in-TU decl.
> > > > > > Seeing
> > > > > > +          a duplicate here means the containing function whose
> > > > > > body
> > > > > > +          we're streaming in is a duplicate too, so we'll end up
> > > > > > +          discarding this BLOCK (and the rest of the duplicate
> > > > > > function
> > > > > > +          body) anyway.  */
> > > > > > +       if (is_duplicate (decl))
> > > > > > +         decl = maybe_duplicate (decl);
> > > > > > +       else if (DECL_IMPLICIT_TYPEDEF_P (decl)
> > > > > > +                && TYPE_TEMPLATE_INFO (TREE_TYPE (decl)))
> > > > > > +         {
> > > > > > +           tree tmpl = TYPE_TI_TEMPLATE (TREE_TYPE (decl));
> > > > > > +           if (DECL_TEMPLATE_RESULT (tmpl) == decl &&
> > > > > > is_duplicate
> > > > > > (tmpl))
> > > > > > +             decl = DECL_TEMPLATE_RESULT (maybe_duplicate
> > > > > > (tmpl));
> > > > > > +         }
> > > > > 
> > > > > This seems like a lot of generally-applicable code for finding the
> > > > > duplicate,
> > > > > which other calls to maybe_duplicate/odr_duplicate don't use.  If the
> > > > > template
> > > > > is a duplicate, why isn't its result?  If there's a good reason for
> > > > > that,
> > > > > should this template handling go into maybe_duplicate?
> > > > 
> > > > Ah yeah, that makes sense.
> > > > 
> > > > Some context: IIUC modules treats the TEMPLATE_DECL instead of the
> > > > DECL_TEMPLATE_RESULT as the canonical decl, which in turn means we'll
> > > > register_duplicate only the TEMPLATE_DECL.  But BLOCK_VARS never
> > > > contains
> > > > a TEMPLATE_DECL, always the DECL_TEMPLATE_RESULT (i.e. a TYPE_DECL),
> > > > hence the extra handling.
> > > > 
> > > > Given that it's relatively more difficult to get at the TEMPLATE_DECL
> > > > from the DECL_TEMPLATE_RESULT rather than vice versa, maybe we should
> > > > just register both as duplicates from register_duplicate?  That way
> > > > callers can just simply pass the DECL_TEMPLATE_RESULT to maybe_duplicate
> > > > and it'll do the right thing.
> > > 
> > > Sounds good.
> > > 
> > > > > > @@ -10337,6 +10373,83 @@ trees_in::fn_parms_fini (int tag, tree fn,
> > > > > > tree
> > > > > > existing, bool is_defn)
> > > > > >         }
> > > > > >     }
> > > > > >     +/* Encode into KEY the position of the local type (class or
> > > > > > enum)
> > > > > > +   declaration DECL within FN.  The position is encoded as the
> > > > > > +   index of the innermost BLOCK (numbered in BFS order) along with
> > > > > > +   the index within its BLOCK_VARS list.  */
> > > > > 
> > > > > Since we already set DECL_DISCRIMINATOR for mangling, could we use
> > > > > it+name
> > > > > for
> > > > > the key as well?
> > > > 
> > > > We could (and IIUc that'd be more robust to ODR violations), but
> > > > wouldn't it mean we'd have to do a linear walk over all BLOCK_VARs of
> > > > all BLOCKS in order to find the one with the matching
> > > > name+discriminator?  That'd be slower than the current approach which
> > > > lets us skip to the correct BLOCK and walk only its BLOCK_VARS.
> > > 
> > > Ah, good point.  How about block number + name instead of the index?
> > 
> > It seems DECL_DISCRIMINATOR is only set at instantiation time and so for
> > local types from a function template pattern the field is empty, which
> > means we can't use it as the key in general :/
> 
> I meant just block number and name, without DECL_DISCRIMINATOR.  Just using
> the name instead of an index in BLOCK_VARS.

Ah, I think that'd be enough for named local types, but what about
anonymous local types?  IIUC without DECL_DISCRIMINATOR we wouldn't be
able to reliably distinguisth between multiple anonymous local types
defined in the same block, since their identifiers aren't stable given
that they're based off of a global counter (and so sensitive to #include
order) :(

> 
> Jason
> 
> 

Reply via email to