vsavchenko added a comment.

That's awesome, just a few stylistic tweaks and tests and we are to land!



================
Comment at: clang/lib/StaticAnalyzer/Core/RangeConstraintManager.cpp:1561-1562
+
+    if (!Constraint.getConcreteValue())
+      return State;
+
----------------
I think we need a comment why we care about this early exit.


================
Comment at: clang/lib/StaticAnalyzer/Core/RangeConstraintManager.cpp:1564
+
+    // Simplify existing constraints.
+    SValBuilder &SVB = getSValBuilder();
----------------
Here I also think that we need to give more context to the readers, so they 
understand what simplification you are talking about here.


================
Comment at: clang/lib/StaticAnalyzer/Core/RangeConstraintManager.cpp:1566-1569
+    ConstraintRangeTy Constraints = State->get<ConstraintRange>();
+    for (std::pair<EquivalenceClass, RangeSet> ClassConstraint : Constraints) {
+      EquivalenceClass Class = ClassConstraint.first;
+      SymbolSet ClassMembers = Class.getClassMembers(State);
----------------
You don't actually use constraints here, so (let me write it in python) instead 
of:
```
[update(classMap[class]) for class, constraint in constraints.items()]
```
you can use
```
[update(members) for class, members in classMap.items()]
```


================
Comment at: clang/lib/StaticAnalyzer/Core/RangeConstraintManager.cpp:1575
+        if (SimplifiedMemberSym && MemberSym != SimplifiedMemberSym) {
+          // Add the newly simplified symbol to the equivalence class.
+          State =
----------------
It would be great if we provide some justification why we do merge here.


================
Comment at: clang/lib/StaticAnalyzer/Core/RangeConstraintManager.cpp:1562-1578
+    ConstraintMap CM = getConstraintMap(State);
+    for (auto const &C : CM) {
+      const SymbolRef &ParentSym = C.first;
+      SValBuilder &SVB = getSValBuilder();
+
+      SVal SimplifiedParentVal =
+          SVB.simplifySVal(State, SVB.makeSymbolVal(ParentSym));
----------------
martong wrote:
> vsavchenko wrote:
> > I tried to cover it in the comment to another patch.  This solution 
> > includes a lot of extra work and it will lose equality/disequality 
> > information for simplified expressions, and I think it's safe to say that 
> > if `a == b` then `simplify(a) == b`.
> > 
> > Let's start with `getConstraintMap`.  It is a completely artificial data 
> > structure (and function) that exists for Z3 refutation.  It's not what we 
> > keep in the state and it has a lot of duplicated constraints.  If we have 
> > an equivalence class `{a, b, c, d, e, f}`, we store only one constraint for 
> > all of them (thus when we update the class, or one of the members receives 
> > a new constraint, we can update all of them).  `getConstraintMap` returns a 
> > map where `a`, `b`, `c`, `d`, `e`, and `f` are mapped to the same 
> > constraint.  It's not **super** bad, but it's extra work constructing this 
> > map and then processing it.
> > 
> > Another, and more important aspect is that when you `setConstraint`, you 
> > lose information that this symbol is equal/disequal to other symbols.  One 
> > example here would be a situation where `x + y == z`, and we find out that 
> > `y == 0`, we should update equivalence class `{x + y, z}` to be a class 
> > `{x, z}`.  In order to do this, you need to update two maps: `ClassMap` 
> > (it's mapping `x + y` to `{x + y, z}`) and `ClassMembers` (it's mapping `{x 
> > + y, z}` to `x + y` and `z`).
> > 
> > Similar example can be made with `x + y != z`, but updating `ClassMap` and 
> > `ClassMembers` will fix it.  And you don't even need to touch the actual 
> > mapping with the actual constraints.
> Absolutely, great findings!
> 
> I think the most straightforward and consistent implementation of updating 
> `ClassMap` and `ClassMembers` is to directly use the `merge` method. I.e. we 
> can merge the simplified symbol (as a trivial eq class) to the existing 
> equivalence class. Using `merge`, however, would not remove the 
> non-simplified original symbol. But this might not be a problem; rather it is 
> a necessity (as the child patch demonstrates) it might be very useful if we 
> can find the symbol (without simplification, i.e. as written) in the 
> `ConstraintRange` map. Do you see any drawbacks of reusing `merge` here?
Oh, that's actually even better!

If we consider the following example.
Let `a + b == c` and `a == d` be known and `b == 0` to be a new constraint. 
Then your approach will help us to figure out that `c == d`.

So, you found a great way!

I think that we should still add the test cases I briefly described in my 
previous comment and that one from above.


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D103314/new/

https://reviews.llvm.org/D103314

_______________________________________________
cfe-commits mailing list
cfe-commits@lists.llvm.org
https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

Reply via email to