Re: Is this a bug? +goto

Michelle Long via Digitalmars-d-learn Wed, 07 Nov 2018 12:05:40 -0800

On Tuesday, 6 November 2018 at 13:53:41 UTC, MatheusBN wrote:

On Tuesday, 6 November 2018 at 05:46:40 UTC, Jonathan M Daviswrote:
On Monday, November 5, 2018 7:55:46 PM MST MatheusBN viaDigitalmars-d-learn wrote:
On Tuesday, 6 November 2018 at 01:55:04 UTC, Jonathan M Davis
wrote:
>> And I found a bit strange that in such code, since "x" is>> never used, why it isn't skipped.
>
> It's skipped right over. The goto jumps out of the scope,> and the line with
>
> int x;
>
> is never run. In fact, if you compile with -w or -wi, the> compiler will give you a warning about unreachable code.
That is exactly my point.
Since "x" it's skipped and never used, it shouldn't just be awarning (unreachable code) instead of an error?
I'm trying to understand why/when such code could give anyproblem.
On the other hand if the code were:

{
    goto Q:
    int x;

    Q:
x = 10; // <- Now you are accessing an uninitializedvariable.
}

Then I think an error would be ok.
D tries to _very_ little with code flow analysis, because onceyou start having to do much with it, odds are that thecompiler implementation is going to get it wrong. As such, anyfeature that involves code flow analysis in D tends to be_very_ simple. So, D avoids the issue here by saying that youcannot skip the initialization of a variable with goto. Thecompiler is not going to do the complicated logic of keepingtrack of where you access the variable in relation to thegoto. That's exactly the sort of thing that might be obviousin the simple case but is highly likely to be buggy in morecomplex code. Code such as
{
    goto Q;
    int x;
}
Q:

or

{
    if(foo)
        goto Q;
    int x;
}
Q:
is fine, because the compiler can trivially see that it isimpossible for x to be used after it's been skipped, whereaswith something like
goto Q;
int x;
Q:
the compiler has to do much more complicated analysis of whatthe code is doing in order to determine that, and when thecode isn't trivial, that can get _really_ complicated.
You could argue that it would be nicer if the languagerequired that the compiler be smarter about it, but by havingthe compiler be stupid, it reduces the risk of compiler bugs,and most people would consider code doing much with gotos likethis to be poor code anyway. Most of the cases where goto isreasonable tend to be using goto from inside braces already,because it tends to be used as a way to more efficiently exitdeeply nested code. And with D's labeled break and continue,the need for using goto outside of switch statements alsotends to be lower than it is in C/C++.
- Jonathan M Davis
It's clear now about this decision and by the way thanks forreplying all my doubts.
MatheusBN.

Don't let their psychobabble fool you. They are wrong and youwere right from the start.

There is no initialization of the variable, or, if thereis(because it's "on the tack, which is "initialized" at the startof the function"), the variable is still never used and that isthe whole problem.

What you will find with some of these guys is they start with theassumption that everything D does is correct then they try todisprove anything that goes against it by coming up with reasonsthat explain why D does it the way it does. It is circularreasoning and invalid. Each step they come up with some newexplanation when you pick holes in their previous ones.

Eventually it's either "It's because D is not designed to dothat" or "write an enhancement yourself" type of answer.

The fact is simple: Who ever implemented the goto statement didnot create code to handle this case and chose the easiest routewhich is to error out. This was either oversight or "laziness".

It's really simple as that. Not once has anyone proven that thesemantics are illogical, which is what it would require for thecompiler to be absolutely correct in it's error.

In this case, they are simple wrong because it requires no flowanalysis or any complex logic to determine. It's not because C isstupid and is unsafe, it's unreachable, etc...

The compiler simply knows what line and scope a variable isinitialized on(since it can determine if a variable is used forinitialization, which is a logic error) and it simply has todetermine if the goto escapes the scope before using anyinitialized variable.


It can do this easily but the logic was not added.

Case A:
{
   if (true) goto X;
   int x;
}
X:


Case B:
{
   if (true) goto X;
   {
      int x;
   }
}
X:

These two cases are EXACTLY the same semantically. It's likewriting A + B and (A + B).

What the extra scope does though is create a new scope in thecompiler AST and this separates the goto logic, which is properlyimplemented to handle that case.

The fact that one produces one error and the other is validproves that the compiler is incomplete. Adding scopes does notchange semantics no different than adding parenthesis(which isjust scope). ((((((3)))))) is the same as 3. (obviously not allscopes can be eliminated in all cases, but this isn't one ofthose cases)

And, so, the real answer is simply the compiler does not testthis case. My point with the previous post was to point it out...but as you see, a lot of the fanboys come in and simply defendwhat D does as if it is the most valid way from the get go. Thisis their mind set. They reason from their conclusions. I've seenthem do it quite often. I'm not sure what the motivations are. Ifthey don't understand the problem(Sometimes simple is veryconfusing for some) or if they want to obfuscate or what.

The idea for any sane person would be to check and see if thecode has a semantically logical meaning first. In this case itdoes. Goto is a common control flow feature and sometimesnecessary to greatly simplify certain problems(since D does nothave the ability to escape nested scopes such as return3, whichreturns from 3 nested scopes in).

If one can transform logically the "offending" code in to asemantically equivalent piece of code(this is known asmathematical transformation, such as rewriting a mathematicalexpression using logically valid rules) that involves no realchanges(such as adding scopes), and one fails and the otherdoesn't, it means the compiler has a bug.


It's like when people drop parenthesis: (3 + 4)*2 =?= 3 + 4*2.

It's illogical. If the compiler did this transformation it wouldproduce invalid results and it would be impossible to reasonabout code.

If the compiler gives errors for one of two identicalmathematical tree's(remember, programs are just mathematicalformulas, just really complex, but AST's abstractly the same)then the compiler has a problem.


It's like saying that (3 + 4)*2 is invalid but 3*2 + 4*2 is valid.

It means the compiler did not implement the distributive property.

People that don't know what they are talking about will then tryto justify why one works and the other doesn't using somecircular or invalid logic rather than actually understanding whatis going on. It is damn near impossible to reason with thesepeople because they always start with their conclusion and try tomake all the pieces fit that conclusion. Sometimes theyeventually come around to a logical conclusion but only they'vecreated a rats nest of reasons and cannot proceed any further butto say, basically, "it is what it is".

The problem is they still never understand what the actualproblem is... (because of the rats nest they have just madethemselves even more confused)

The problem with the goto is clearly stated and to counter it asbeing illogical one must simply prove one example where it wouldresult in invalid logic(not crapping out the compiler... thecompiler is not perfect and so will have bugs and errors in it.The goal is not to justify those bugs and errors but to fix themso the compiler does a better job and is more logicallyexpressive).

e.g., two cases (the `Case` term is not part of a switch in D,just use to denote the two possible scenarios)



Case A:

{
   if (true) goto X;
   int x;
}
X:


Case B:

{
   if (true) goto X;
   {
      int x;
   }
}
X:

Why is case A any different than case B(in general, the above isan example, the compiler might optimize things, which we don'twant to do since optimizations are secondary effects that are notas important as logical consistency)? We are simply talking aboutthe pure semantics of programming. It doesn't really matter whatlanguage we use to express it, This is not a problem in D but aproblem in programming languages. The question is simply: Are thetwo case semantically equivalent? (e.g., does (3) = 3? (5) = 5,(x) = x, (((((x+y*3))))) = x+y*3, etc )

Since we are not thinking of any specific compiler(although wehave to use the syntax and language grammar of D since ultimatelyit has to do with D and it has to be expressed in some language,so D is the obvious choice) we can't use circular reasoning(e.g.,D does it this way and D is right so...).

Now, the fact is, these are identical statements semantically...trivially so. It really can't get any simpler. Doesn't matterwhat D does. If D can't see that then D is incomplete.

Now, since we ultimately have to translate in to D and compilersdo strange things, it is possible that *in D* they are notidentical. E.g., if D inserted initialization of locals at thestart of scope and de-initializers at the end of scope, theywould not be the same.



which one could express as:

Case A:

int x;
{
   if (true) goto X;
   //int x;
}
~x;
X:


Case B:

{
   if (true) goto X;
   int x;
   {
      //int x;
   }
   ~x;
}
X:

Which, it is clear that x is initialized before the goto in caseA and after in case B. This could cause problems(chances are if Ddid something like this then it would result in invalid programsand compilers bugs at some point).

Sometimes though, because compilers are very complex, it isnecessary to prevent certain cases from occurring so certainother semantics can be used. Sometimes compilers simply crap outprecisely because that is the easiest thing to do. Of course, ifthis is done, someone should know about it and be able to explainwhy the compiler chose to do this rather than the most logicalthing.

Don't let people bludgeon you in to submission. Truth and logicis not dictatorial but absolute.

Re: Is this a bug? +goto

Reply via email to