Re: I'm sorry, but this is unacceptable (union members and ctors)

Brooks Moses Fri, 15 Jun 2007 21:51:23 -0700

michael.a wrote:

It would be interesting for someone to try to make a practical argument that
is anything but a nest of technicalities, as to why ctors and unions
shouldn't be mixable.

The Fortran language specification allows essentially this, although theterms are initializers and equivalences rather than ctors and unions.Just this week, I reviewed the patch to add this functionality to theGCC Fortran front end, and I wrote a bit of the infrastructure it uses,so I can speak somewhat to the problems of implementing it.

(This was PR29786; you can see how long it took before it was fixed,even though it was a regression against the old Fortran front end, andwas also for quite a while only one of two elements of the Fortranstandard that we hadn't implemented yet.)

In Fortran, the rule is that any element in an equivalence (i.e., union)can be initialized, so long as no two initializers attempt to initializethe same piece of memory to different values.

The implementation for this creates an unsigned-char buffer arrayrepresenting target memory, and goes through every initializer (i.e.,ctor) in the equivalence, converting their values into theirtarget-memory representations, checking to see what bits of memory theytouch and whether those have already been initialized to somethingdifferent, and then writing them into the buffer array. Then, anentirely new initializer is created from that buffer array.

That all had to be built on a fair pile of front-end code to convertvalues into their target-memory representations, and then rather morecode that was essentially a special-purpose initializer constructor todeal with the buffer array.

A lot of the trickiness is in exactly how you specify what's allowed.The Fortran rule requires explicitly simulating the target memorystorage and checking byte-value versions of the initializers againsteach other, which is a rather messy thing to be doing in the front end,but it's at least simple to specify.

An alternate version would be to specify that overlapping ctors are notallowed even if they do result in the same byte-values. Aside frombeing a somewhat arbitrary restriction, this doesn't simplify thingsvery much, since the front end still needs to look pretty deeply intothe target memory representation to see if things overlap.

The version we used to have in the Fortran front end was simply to onlyallow one item in each equivalence to have an initializer. That seemedto work without doing anything particular to the initializers, but I'mnot sure whether things are tracked in the other front ends in ways thatwould make enforcing such a rule easy -- and, very likely, it wouldn'twork for the example you describe (with a four-number rectangle beingunioned with two two-point vectors) because you have two vectors in theunion and they both have initializers. It's also a rather arbitraryrule that's not the sort of thing one would really want in a languagestandard.

Now, as for "shouldn't"? I can't speak to that, given that the Fortrancommittee thought it a valuable feature to include, and that we didimplement it and it works. Well, mostly works, at least -- I wouldn'tat all swear that we've got all the bugs out of it yet. But it was apain, and it (along with one other feature that required simulating thewriting of things to target memory) required an amount of effort toimplement that was dramatically out of proportion to the importance ofthe feature.


- Brooks

Re: I'm sorry, but this is unacceptable (union members and ctors)

Reply via email to