C++: Speed up the `cpp/unbounded-write` query for an upcoming change #18485

MathiasVP · 2025-01-13T16:10:14Z

This fixes a performance problem in #18017 on some projects compiled as C code (where we infer many new guards on #18017 which causes the barrier to explode in size).

This PR adds a very cheap pruning phase to the cpp/unbounded-write query. The "cheapness" comes from the fact that we only evaluate the first stage of dataflow. This provides enough pruning to make the performance impact of #18017 negligible (compare this run with this run).

Copilot

Copilot wasn't able to review any files in this pull request.

Files not reviewed (2)

cpp/ql/lib/semmle/code/cpp/controlflow/IRGuards.qll: Language not supported
cpp/ql/src/Security/CWE/CWE-120/UnboundedWrite.ql: Language not supported

Tip: Copilot only keeps its highest confidence comments to reduce noise and keep you focused. Learn more

paldepind

Looks good to me!

paldepind · 2025-01-13T20:33:38Z

cpp/ql/lib/semmle/code/cpp/controlflow/IRGuards.qll

+ * To find the specific guard that performs the comparison
+ * use `IRGuards.comparesLt`.
+ */
+predicate comparesLt(Operand left, Operand right, int k, boolean isLt, AbstractValue value) {


Why do we want this predicate and the one below, given that they just forward arguments to compares_eq and compares_lt?

The idea is that the fanout from g to g.comparesLt(...) is quite large on certain repositories, so we want to create a way to compute a version of comparesLt that can be used to prune the set of guards we want to look at early (in the interestingLessThanOrEqual predicate below).

So the important part is that there's no GuardCondition as a column to this predicate (as that is what's giving the large fan-out). We can include the ValueNumber (i.e., the first column from compares_lt) that is currently omitted from this predicate, but since it wasn't needed for the pruning in UnboundedWrite I decided to leave it out to not expose that implementation detail.

paldepind · 2025-01-14T06:50:06Z

cpp/ql/src/Security/CWE/CWE-120/UnboundedWrite.ql

+predicate interestingLessThanOrEqual(Operand left) {
+  exists(DataFlowImplCommon::NodeEx node |
+    node.asNode().asOperand() = left and
+    BarrierFlow::Stages::Stage1::sinkNode(node, _)


So my understanding/guessing is that stage one does some cheap flow starting at sources to find sink that could be interesting, and we are using that capability to prune down barriers as well.

Is this something that the data flow library could actually do itself? Prune down barriers in this way. Or could it be bad to do more generally?

The problem is that the lessThanOrEqual predicate itself blows up due to the fanout from g in g.comparesLt(left, _, _, true, branch) (or g.comparesEq(left, _, _, true, branch)) on some projects containing a function with many guards that guard equivalent expressions (so that there are many guards for the same left).

As an alternative we could've inlined lessThanOrEqual and isBarrier, and then we'd most likely get to a point where the resulting predicate was restricted by an internal call inside the dataflow library that ensures that the node being barrier'd is part of the path from a source to a sink, but this more robust 🙂

C++: Speed up the 'cpp/unbounded-write' query.

2d44b33

Copilot AI review requested due to automatic review settings January 13, 2025 16:10

MathiasVP requested a review from a team as a code owner January 13, 2025 16:10

Copilot AI reviewed Jan 13, 2025

View reviewed changes

MathiasVP added the no-change-note-required This PR does not need a change note label Jan 13, 2025

github-actions bot added the C++ label Jan 13, 2025

paldepind approved these changes Jan 14, 2025

View reviewed changes

MathiasVP merged commit aa55b8e into github:main Jan 14, 2025
14 of 15 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

C++: Speed up the `cpp/unbounded-write` query for an upcoming change #18485

C++: Speed up the `cpp/unbounded-write` query for an upcoming change #18485

Uh oh!

MathiasVP commented Jan 13, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

paldepind left a comment

Uh oh!

paldepind Jan 13, 2025

Uh oh!

MathiasVP Jan 14, 2025

Uh oh!

paldepind Jan 14, 2025

Uh oh!

MathiasVP Jan 14, 2025

Uh oh!

Uh oh!

Uh oh!

C++: Speed up the cpp/unbounded-write query for an upcoming change #18485

C++: Speed up the cpp/unbounded-write query for an upcoming change #18485

Uh oh!

Conversation

MathiasVP commented Jan 13, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Uh oh!

paldepind left a comment

Choose a reason for hiding this comment

Uh oh!

paldepind Jan 13, 2025

Choose a reason for hiding this comment

Uh oh!

MathiasVP Jan 14, 2025

Choose a reason for hiding this comment

Uh oh!

paldepind Jan 14, 2025

Choose a reason for hiding this comment

Uh oh!

MathiasVP Jan 14, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

C++: Speed up the `cpp/unbounded-write` query for an upcoming change #18485

C++: Speed up the `cpp/unbounded-write` query for an upcoming change #18485