backslash whitespace newline

Howard Hinnant Mon, 24 Oct 2005 15:45:38 -0700

I've been reviewing the age-old issue of interpreting<whitespace>*<newline> as the end-of-line indicator as is the currentpractice with gcc. For those not familiar with this issue, gcc takesadvantage of C99's 5.1.1.2p1b1 "implementation-defined manner" toconvert multibyte end-of-line indicators to newline characters. gccconsiders zero or more whitespace characters preceding a moretraditional CR and/or LF as the end-of-line indicator. This behaviorcan cause differences in some code compared to compilers which do notstrip trailing whitespace off of lines. For example:


// comment \
int x;
int y;

Pretend there's one or more spaces or tabs after the '\'. gcc willinterpret this as:


A:

// comment int x;
int y;

while other compilers (Microsoft, EDG-based, CodeWarrior to name afew) interpret it as:


B:

// comment
int x;
int y;

And depending on what you're trying to do, either A or B is the"correct" answer. I've seen code broken either way (by assuming Aand having the compiler do B and vice-versa).

This issue has recently been discussed on the C standards reflector,and though I was not privy to that discussion, my understanding isthat the likely resolution from this standards body will be that acompiler implementing either A or B is conforming.

That being said, gcc to the best of knowledge, is the only moderncompiler to implement end-of-line whitespace stripping (yes I'm awareof older compilers and dealing with punch cards). So on the basis ofconforming to a de-facto standard alone, I propose that gcc abandonend-of-line whitespace stripping, or at least strip 2 or morewhitespace characters down to 1 space instead of to 0 spaces duringtranslation phase 1.

I realize that this change could break some existing code. But I amalso aware of existing code wishing to port to gcc which is broken bygcc's current behavior. If we want gcc to "gain market share", doesit not make sense to "welcome" new comers when possible by adoptingwhat is otherwise industry-wide practice?


Thanks,
Howard

backslash whitespace newline

Reply via email to