Disjoint analysis and analysis filters #208

N1ark · 2025-12-21T17:16:04Z

Based on #207
Fixes #173

Long PR description because changes are a bit obscure maybe...

Add a "disjoint" analysis, that tracks inequalities. Anytime it learns either !(a == b) or distinct(a, b, ...), it stores the inequalities. It can then reduce equalities to true/false when possible!
To avoid its state being too large, it only stores inequalities for: values of type location, or simple variables, or concrete bitvectors; nothing else.

Furthermore (and this should be discussed), it does a trick where if it absorbs information regarding a variable of type location, it does nos dirty those variables in the PC. In other words, it will learn e.g. |1| != |2|, and |1| and |2| won't be marked as dirty (meaning they become relevant for the next SAT check). This avoids sending a sat check everytime we create a new location to verify that all locations can in fact be distinct (since we "know" they can be).
Add a filter : t -> Var.t -> Svalue.ty -> Svalue.t Iter.t -> Svalue.t Iter.t function to Analysis.S. It allows an analysis to filter the possible values for a variable of a given type, using its knowledge.

This is used in the trivial model check in Bv_solver. We generate an infinite iterator of values for each type, and then filter that with the analyses. We then pick the first N possible values (currently 3), and do model checks with these values. This works a lot better than the previous method (using the variable's ID) and even just doing 2 attempts is usually a much better improvement.

The filter of each analysis does:
- Disjoint: if we know the variable is distinct to the iterated value, we filter it out.
- Equality: if we know some equality for this variable, we substitute the entire iterator by a singleton iterator with that value. E.g. if we know V|1| == V|2| (and decide V|1| is cheaper than V|2|), for a filter on variable |2| we return singleton |1|. Bv_solver is then clever enough to resolve that to the same value as V|1| in the model check!
- Interval: if we know some interval for the variable, we again substitute the entire iterator by a (non-shuffled) iterator over all valid values. The reason we override the iterator rather than filtering is because in cases where the interval is very narrow (e.g. [0; 10]), filtering means we will randomly generate numbers until we get N values that randomly fall in that range, which is insanely slow. I need to see if shuffling this iterator (requires some work) is a better heuristic!
Re-add the AddOvf binary operator to check for overflows. We used to translate these into some comparison of the operand signs and it would just create an unnecessarily large expression, since analyses could usually not learn anything from it.
Remove Declared_vars from Bv_solver; instead we learn the types of variables by using iter_var. Much nicer, in my opinion.
The minimum value of a pointer in Rust is now its alignment; this helps the trivial model check a lot, since instead of starting at 1 for e.g. alignment 8, it starts at 8 and instantly finds a model :)
Added some reductions

giltho · 2025-12-27T10:03:50Z

I think that one I can't just review in the car like the previous ones. I need to go a bit deeper and understand the implications of not dirtying variables.

I'm not sure that's valid because it is possible for two locations to be equal.

N1ark · 2025-12-29T09:55:45Z

I'm not sure that's valid because it is possible for two locations to be equal.

That's fine though? It will query whether L1 and L2 are equal, and the disjoint analysis will just add to the constraints all inequalities relating to L1 and L2.

Feel free to add a test if you want to check but I think it's sound (modulo when you reach 2^N locations and Z3 would tell u they can't all be distinct)

N1ark requested a review from giltho as a code owner December 21, 2025 17:16

N1ark added soteria-core Issues related to the Soteria core library performance Issues relating to performance improvements solver Solver related issues and PRs: new solvers, changes to encoding, etc. labels Dec 21, 2025

N1ark changed the title ~~Patricia trees of values~~ Disjoint Analysis and Analysis Filters Dec 21, 2025

N1ark changed the title ~~Disjoint Analysis and Analysis Filters~~ Disjoint analysis and analysis filters Dec 21, 2025

giltho mentioned this pull request Dec 26, 2025

Use Patricia trees #207

Merged

Base automatically changed from patricia-optims to main December 29, 2025 18:13

N1ark added 8 commits December 29, 2025 19:21

Add a disjoint analysis

7ee200a

Better reduce Svalue.distinct

b4f2d26

Handle not (distinct _) in eq analysis

31680ba

Re-introduce AddOvf

b51431d

Reduce n +sovf x

7eff7a3

Remove Declared_vars from non-incr Bv_solver

6d30d85

Add Analyses.filter

e7a27f8

Fix interval analysis filter

449005f

N1ark force-pushed the disjoint-analysis branch from 726bb59 to 2d4b883 Compare December 29, 2025 18:23

N1ark added 5 commits December 29, 2025 19:23

Tests

e3b4b5c

Lower bound of pointers is their alignment!

a8a2c47

hacky: ineqs for locations are never dirty

b934e84

Fix concrete reduction of ashr

626610a

More reductions

565ae94

N1ark force-pushed the disjoint-analysis branch from 2d4b883 to 565ae94 Compare December 29, 2025 18:23

Make equality analysis store equalities

9044f87

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Disjoint analysis and analysis filters #208

Disjoint analysis and analysis filters #208

Uh oh!

N1ark commented Dec 21, 2025

Uh oh!

giltho commented Dec 27, 2025

Uh oh!

N1ark commented Dec 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Disjoint analysis and analysis filters #208

Are you sure you want to change the base?

Disjoint analysis and analysis filters #208

Uh oh!

Conversation

N1ark commented Dec 21, 2025

Uh oh!

giltho commented Dec 27, 2025

Uh oh!

N1ark commented Dec 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants