On 14/09/2021 20:48, Rafał Pietrak wrote:
W dniu 13.09.2021 o 13:41, David Brown pisze:
On 13/09/2021 11:51, Rafał Pietrak via Gcc wrote:
Hi,
Thenk you for very prompt reply.
(I'm not sure how much this is a "gcc development list" matter, rather
than perhaps a "gcc help list" or perhaps comp.arch.embedded Usenet
group discussion, but I'll continue here until Jonathan or other gcc
developers say stop.)
Thank you. I appreciate it.
And yet, I do think, that it's about the "core" gcc - it's about a
programmer is able to "talk to gcc" without much "missunderstanding".
But I'm not going to push it much more. I accept, that "talking to gcc"
with d->cr1 &= ~ { ... } syntax is not what majority of programmers
would like to be able to do.
The gcc developers are always interested in new ideas. But they are not
keen on new syntaxes or extensions - there has to be a /very/ good
reason for them. The days of gcc being a maverick that makes up its own
improved C are long, long gone - they'd much rather work with the C and
C++ committees towards improving the languages in general, instead of
having more of their own non-standard additions. So an extension like
this should ideally be a proposal for adding to C23, though gcc could
always implement it earlier. And while I appreciate what you are trying
to do here, it is simply not general enough or important enough to
justify such changes. To get a new feature implemented, it has to do
something you could not do before, or do it /far/ more simply, clearly,
safely or efficiently.
W dniu 13.09.2021 o 10:44, Jonathan Wakely pisze:
On Mon, 13 Sept 2021 at 07:55, Rafał Pietrak via Gcc <gcc@gcc.gnu.org> wrote:
[-----------------]
#elif VARIANT_WORKING
struct cr_s a = (struct cr_s) {
.re = 1,
.te = 1,
.rxneie = 1,
.txeie = 1,
.ue = 1 };
int *b = (int *) &a;
d->cr1 &= ~(*b);
This is a strict aliasing violation. You should either use a union or
memcpy to get the value as an int.
Yes, I know. I know, this is a "trick" I use (I had to use to missleed
gcc).
Don't think of it as a "trick" - think of it as a mistake. A completely
unnecessary mistake, that will likely give you unexpected results at times :
union {
struct cr_s s;
uint32_t raw;
} a = {(struct cr_s) {
.re = 1,
.te = 1,
.rxneie = 1,
.txeie = 1,
.ue = 1 }
};
d->cr1 &= ~(a.raw);
Ha! This is very nice.
But pls note, that if contrary to my VARIANT_WORKING this actually is
kosher (and not an error, like you've said about the my "WORKING"), and
it actually looks very similar to the VARIANT_THE_BEST ... may be there
is a way to implement VARIANT_THE_BEST as "syntactic trick" leading
compiler into this semantics you've outlined above?
The issue I noticed with your "WORKING" is the bit order - that is
orthogonal to the code structure, and easy to fix by re-arranging the
fields in the initial bit-field. It is independent from the structure
of the code.
I'm raising this questions, since CR1 as int (or better as uint32_t) is
already declared in my code. Compiler shouldn't have too hard time
weeding out struct cr_s from union embedding it.
The code to convert between a bit-field of size 32-bit and a uint32_t is
nothing at all, and the compiler can handle that fine :-) But you have
to write the source code in a way that the conversion is well defined
behaviour. When you write something that is not defined behaviour, the
compiler can generate code in ways you don't expect because technically
you haven't told it what you actually want.
I hope you also realise that the "VARIANT_TRADITIONAL" and
"VARIANT_WORKING" versions of your code do different things. The ARM
Cortex-M devices (like the STM-32) are little-endian, and the bitfields
are ordered from the LSB. So either your list of #define's is wrong, or
your struct cr_s bitfiled is wrong. (I haven't used that particular
device myself, so I don't know which is wrong.)
I'm sory. this is my mistake. I've taken a shortcut and quickly written
an at hoc '#defines' for the email. In my code I have:
enum usart_cr1_e {
USART_SBK, USART_RWU, USART_RE, USART_TE, USART_IDLEIE,
USART_RXNEIE, USART_TCIE, USART_TXEIE, USART_PEIE, USART_PS,
USART_PCE, USART_WAKE, USART_M, USART_UE,
};
And gcc produces exactly the same code in both variants:
8000c56: 68d3 ldr r3, [r2, #12]
8000c58: f423 5302 bic.w r3, r3, #8320 ; 0x2080
8000c5c: f023 032c bic.w r3, r3, #44 ; 0x2c
8000c60: 60d3 str r3, [r2, #12]
Great.
Also - perhaps beside the point, but good advice anyway - for this kind
of work you should always use the fixed-size <stdint.h> types such as
"uint32_t", not home-made types like "uint". And you always want
unsigned types when doing bit operations such as bitwise complement.
Yes. That's true. No problem here.
Using a union of the bitfield struct with a "raw" combined field is a
common idiom, and gives you exactly what you need here. It is
guaranteed to work in C, unlike your code (which has undefined
behaviour). If it is important to you, you should note that it is not
defined behaviour in C++ (though maybe gcc guarantees it will work - the
documentation of "-fstrict-aliasing" is not clear on the matter).
Somehow I've missed it until now, always falling back to things like
(int *) or even (void *). That's a good lesson for me.
I always view any use of "void *" with suspicion - it is telling the
compiler you don't know your real type. It has its uses, of course, but
I prefer to be more explicit when I can. I would never use a cast to
"int *" for accessing anything other than an int, and even with
"uint32_t *" I'd normally have a "volatile" in there, and probably have
an access function or wrapper too. (With "volatile", the compiler will
follow your instructions strictly - it lets you do messy casts safely,
but can lead to inefficiencies.)
As Jonathan says, a small inline function or macro (using gcc's
extension of declarations and statements inside an expression) can be
used for a wrapper that will work simply and efficiently while being
safe for both C and C++ :
#define raw32(x) \
({ uint32_t raw; memcpy(&raw, &x, sizeof(uint32_t)); raw;})
struct cr_s a = (struct cr_s) {
.re = 1,
.te = 1,
.rxneie = 1,
.txeie = 1,
.ue = 1 }
};
d->cr1 &= ~raw32(a);
May be I'll test it in some future. For the time being I'm very
satisfied with the assignment to a temporary union variable.
but having said that, I strongly believe (particularly if union variable
is kosher), that gcc recognising VARIANT_THE_BEST as a syntax shortcut
to such union temporary variable could significantly improve code
readibility.
Type-punning via unions is guaranteed to work in C, but undefined
behaviour in C++. (gcc may choose to define it, however.)
For the safest and most portable way of accessing raw memory, memcpy()
always works. gcc (and many other compilers) fully optimise memcpy()
calls that have fixed sizes like this - the code generated here is as
optimal as any other solution. Some compilers, however, will give you a
library call which is obviously not efficient.
gcc has another extension that can also be useful for this kind of thing:
typedef uint32_t __attribute__((may_alias)) uint32_a;
Now you can safely use a cast to "uint32_a *", rather than "uint32_t *",
and access data - gcc knows it may alias the struct :
struct cr_s a = ....
d->cr &= ~ *((uint32_a*) &a);
So thank you for extended explanation, and I'm currently staying off the
list hoping that my perspective will eventually catch some traction :)
This is still copied to the list. If you want to look a bit at the
list, without following everything, I recommend using the gmane.org
Usenet gateway that provides a lot of mailing lists in convenient Usenet
format. There is plenty of interesting discussions on the gcc mailing
lists. I personally haven't done any gcc development - I'd love to do
so, but haven't had the time to dig into it - but I sometimes get to
help others and save the real developers some time.
mvh.,
David
Thank you again,
Rafał