[Cranelift] fold `(or ...) + (neg ...)` to `(and ...)` #11639

bongjunj · 2025-09-08T13:11:25Z

This adds (rule (simplify (iadd ty (bor ty x y) (ineg ty y))) (band ty x (bnot ty y)))

bongjunj · 2025-09-08T13:21:10Z

Seems like iadd ... (ineg ty y) -> isub ... y wins over this rule.

alexcrichton · 2025-09-08T14:39:25Z

That might be fixable by adjusting the per-opcode costs perhaps? We could make arithmetic operations like iadd/isub slightly more costly than bitwise operations like band/bnot perhaps

cfallin · 2025-09-08T15:42:59Z

I'll note that the mid-end does keep both around, rather than destructively rewriting (because egraphs!), so in the future if we have a more sophisticated cost function extractor there may be cases for both -- e.g. if the slightly more expensive form uses partial results that are already computed somewhere else. Perhaps different ISAs will have different cost functions too (they should all be 1-cycle ALU ops on any reasonable machine, but maybe some combinations of instructions fold together or compressed instruction forms are available or ...).

All that said, I'm curious @bongjunj -- are you driving your exploration with some sort of overall goodness metric? In other words, are you finding any and all equivalences, or is your goal to find those that seem to simplify somehow? And for this particular one, did you see instances where it leads to useful simplifications?

(I'm not opposed at all to building up a nice database of simplifications in general; as we've sometimes said, "rules are cheap" with ISLE's DSL compiler combining their matching. Just curious where all this is going.)

github-actions · 2025-09-08T15:45:55Z

Subscribe to Label Action

cc @cfallin, @fitzgen

This issue or pull request has been labeled: "cranelift", "isle"

Thus the following users have been cc'd because of the following labels:

cfallin: isle
fitzgen: isle

To subscribe or unsubscribe from this label, edit the .github/subscribe-to-label.json configuration file.

Learn more.

bongjunj · 2025-09-09T00:34:29Z

Just realized that this is another version of the simplification of #10979

In addition, to @cfallin's comment, all my simplification rules are inspired by LLVM InstCombine rules.
For example, this particular rule resembles the LLVM optimization of the following:

define i32 @src(i32 %A) {
  %B = or i32 %A, 123
  %C = add i32 %B, -123
  ret i32 %C
}

define i32 @tgt(i32 %A) {
  %C = and i32 %A, -124
  ret i32 %C
}

(https://alive2.llvm.org/ce/z/QY4j7V)

So basically, what I'm doing now is observe the discrepancy between the LLVM InstCombine pass and Cranelift's mid-end optimizer and then add rules to Cranelift for such missed optimization opportunities. In other words, the good metric we are looking for here is kind of "LLVM-ness". But I'm not sure how we can measure the usefulness of (with a well-established metric), or find an instance of this particular rule.

cfallin · 2025-09-09T15:38:29Z

So basically, what I'm doing now is observe the discrepancy between the LLVM InstCombine pass and Cranelift's mid-end optimizer and then add rules to Cranelift for such missed optimization opportunities. In other words, the good metric we are looking for here is kind of "LLVM-ness".

That sounds great, then! I wanted to make sure we had some ground truth indicating these rewrites could be useful, and "LLVM does it" is a very strong argument for that. Thanks for putting in this effort!

cfallin · 2025-09-09T15:42:51Z

To the immediate question of making this rule actually fire: since we already rewrite (iadd _ x (ineg _ y)) to (isub _ x y), could you rewrite the left-hand side to cascade on that, and match on (isub _ (bor _ x y) y)?

fitzgen · 2025-09-09T17:53:32Z

Small clarification on the following, because I think it is pretty important when we are talking about rewrites that aren't necessarily beneficial on their own:

I'm not opposed at all to building up a nice database of simplifications in general; as we've sometimes said, "rules are cheap" with ISLE's DSL compiler combining their matching.

Rules are cheap, but e-nodes are expensive. (At least, expensive relative to rules, and there is always the risk of accidentally expanding to exponential numbers of e-nodes, which can be subtly easy to do.)

So adding all the commutative versions of a beneficial simplification is cheap, but adding basic commutation rules for every commutative operation (e.g. a+b --> b+a and a|b --> b|a) is expensive.

Similarly, if we have an input A that matches rule r to produce B which then matches rule s to produce the final output C (A --r--> B --s--> C) then creating a "macro rule" that composes r and s (A --rs--> C) is a win in our system (from a cost perspective) since we are trading away an intermediate e-node (expensive) for an additional rule (cheap).¹

So adding rules that create new e-nodes because maybe they will be useful for some other beneficial rewrite, but aren't beneficial on their own, is something that should ultimately be approached with care. That doesn't mean we shouldn't ever do it, but we should at least put in the effort to check that there actually exists another beneficial rewrite that could fire afterwards, and make sure it isn't too general such that it will result in tons of intermediate e-nodes that might not actually lead to some other beneficial rule firing.

It would be super cool if, given an input set of basic rules, the ISLE compiler automatically derived a set of macro rules based on them. Seems like a potentially fun/fruitful research project. ↩

bongjunj added 2 commits September 8, 2025 13:09

[Cranelift] add opts and tests

56a4c81

[ISLE] add runtests

620d0ef

bongjunj requested a review from a team as a code owner September 8, 2025 13:11

bongjunj requested review from alexcrichton and removed request for a team September 8, 2025 13:11

github-actions bot added cranelift Issues related to the Cranelift code generator isle Related to the ISLE domain-specific language labels Sep 8, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Cranelift] fold `(or ...) + (neg ...)` to `(and ...)` #11639

[Cranelift] fold `(or ...) + (neg ...)` to `(and ...)` #11639

bongjunj commented Sep 8, 2025

Uh oh!

bongjunj commented Sep 8, 2025

Uh oh!

alexcrichton commented Sep 8, 2025

Uh oh!

cfallin commented Sep 8, 2025

Uh oh!

github-actions bot commented Sep 8, 2025

Uh oh!

bongjunj commented Sep 9, 2025

Uh oh!

cfallin commented Sep 9, 2025

Uh oh!

cfallin commented Sep 9, 2025

Uh oh!

fitzgen commented Sep 9, 2025

Uh oh!

Uh oh!

[Cranelift] fold (or ...) + (neg ...) to (and ...) #11639

Are you sure you want to change the base?

[Cranelift] fold (or ...) + (neg ...) to (and ...) #11639

Conversation

bongjunj commented Sep 8, 2025

Uh oh!

bongjunj commented Sep 8, 2025

Uh oh!

alexcrichton commented Sep 8, 2025

Uh oh!

cfallin commented Sep 8, 2025

Uh oh!

github-actions bot commented Sep 8, 2025

Subscribe to Label Action

Uh oh!

bongjunj commented Sep 9, 2025

Uh oh!

cfallin commented Sep 9, 2025

Uh oh!

cfallin commented Sep 9, 2025

Uh oh!

fitzgen commented Sep 9, 2025

Footnotes

Uh oh!

Uh oh!

[Cranelift] fold `(or ...) + (neg ...)` to `(and ...)` #11639

[Cranelift] fold `(or ...) + (neg ...)` to `(and ...)` #11639