-
Notifications
You must be signed in to change notification settings - Fork 13.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[InstSimplify] Handle trunc to i1 in Select with bit test folds. #122944
Conversation
@llvm/pr-subscribers-llvm-transforms @llvm/pr-subscribers-llvm-analysis Author: Andreas Jonson (andjo403) ChangesPreparation to handle results in no changes in local run of llvm-opt-benchmark Full diff: https://github.com/llvm/llvm-project/pull/122944.diff 1 Files Affected:
diff --git a/llvm/lib/Analysis/InstructionSimplify.cpp b/llvm/lib/Analysis/InstructionSimplify.cpp
index d69747e30f884d..05dedd219c2667 100644
--- a/llvm/lib/Analysis/InstructionSimplify.cpp
+++ b/llvm/lib/Analysis/InstructionSimplify.cpp
@@ -4612,14 +4612,16 @@ static Value *simplifyCmpSelOfMaxMin(Value *CmpLHS, Value *CmpRHS,
return nullptr;
}
-/// An alternative way to test if a bit is set or not uses sgt/slt instead of
-/// eq/ne.
-static Value *simplifySelectWithFakeICmpEq(Value *CmpLHS, Value *CmpRHS,
- CmpPredicate Pred, Value *TrueVal,
- Value *FalseVal) {
- if (auto Res = decomposeBitTestICmp(CmpLHS, CmpRHS, Pred))
- return simplifySelectBitTest(TrueVal, FalseVal, Res->X, &Res->Mask,
- Res->Pred == ICmpInst::ICMP_EQ);
+/// An alternative way to test if a bit is set or not uses e.g. sgt/slt instead
+/// of eq/ne.
+static Value *simplifySelectWithBitTest(Value *CondVal, Value *TrueVal,
+ Value *FalseVal) {
+
+ if (auto *ICmp = dyn_cast<ICmpInst>(CondVal))
+ if (auto Res = decomposeBitTestICmp(
+ ICmp->getOperand(0), ICmp->getOperand(1), ICmp->getPredicate()))
+ return simplifySelectBitTest(TrueVal, FalseVal, Res->X, &Res->Mask,
+ Res->Pred == ICmpInst::ICMP_EQ);
return nullptr;
}
@@ -4728,11 +4730,6 @@ static Value *simplifySelectWithICmpCond(Value *CondVal, Value *TrueVal,
return FalseVal;
}
- // Check for other compares that behave like bit test.
- if (Value *V =
- simplifySelectWithFakeICmpEq(CmpLHS, CmpRHS, Pred, TrueVal, FalseVal))
- return V;
-
// If we have a scalar equality comparison, then we know the value in one of
// the arms of the select. See if substituting this value into the arm and
// simplifying the result yields the same value as the other arm.
@@ -4984,6 +4981,9 @@ static Value *simplifySelectInst(Value *Cond, Value *TrueVal, Value *FalseVal,
simplifySelectWithICmpCond(Cond, TrueVal, FalseVal, Q, MaxRecurse))
return V;
+ if (Value *V = simplifySelectWithBitTest(Cond, TrueVal, FalseVal))
+ return V;
+
if (Value *V = simplifySelectWithFCmp(Cond, TrueVal, FalseVal, Q, MaxRecurse))
return V;
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LG
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
In terms of general approach, could you please land #122179, add trunc support to it, and then update folds together with added test coverage for the change?
fdf24cc
to
39fea1f
Compare
As there is a lot of regressions when adding handling for trunk in foldLogOpOfMaskedICmps I instead added the update of decomposeBitTest in this PR. Still results in no changes in local run of llvm-opt-benchmark, this can be due to trunc to i1 is not that common yet, but also at least the test bittest_trunc_or is folded to the same with instcombine as this InstSimplify fold. |
Result.X = X; | ||
Result.Mask = Result.Mask.zext(X->getType()->getScalarSizeInBits()); | ||
Result.C = Result.C.zext(X->getType()->getScalarSizeInBits()); | ||
} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Think the LookThruTrunc
logic is untested.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
good catch test added.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What is the motivation for LookThruTrunc
? Doesn't (trunc (trunc X))
always get folded to (trunc X)
in InstCombine?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There seems to be no motivation to handle the LookThruTrunc for the trunc case as you say it is folded to one trunc, I did not think all the way and thought that an extra use of the first trunc stopped that fold but it did not.
Hmm have tested this with a local run of llvm-opt-benchmark after folding fold icmp ne (and X, 1), 0 --> trunc X to N x i1 and there is still no changes with this fold so shall I abandon this commit? |
when I did the analysis in #119196 (comment) I only searched in the code after folds with the bit test pattern I did not test if any of the patterns happen in llvm-opt-benchmark, but maybe possible that some of the other folds will result in this pattern. |
2c3973b
to
694932a
Compare
LGTM |
results in no changes in local run of llvm-opt-benchmark
Proof: https://alive2.llvm.org/ce/z/Jncqb2