Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[NFC] Update test for PR #118055 #122696

Merged
merged 1 commit into from
Jan 22, 2025

Conversation

sushgokh
Copy link
Contributor

This patch updates the motivating test for the above PR so that it does not conflict with urem PR #122236

@llvmbot
Copy link
Member

llvmbot commented Jan 13, 2025

@llvm/pr-subscribers-llvm-transforms

Author: Sushant Gokhale (sushgokh)

Changes

This patch updates the motivating test for the above PR so that it does not conflict with urem PR #122236


Full diff: https://github.com/llvm/llvm-project/pull/122696.diff

1 Files Affected:

  • (modified) llvm/test/Transforms/SLPVectorizer/AArch64/div.ll (+6-5)
diff --git a/llvm/test/Transforms/SLPVectorizer/AArch64/div.ll b/llvm/test/Transforms/SLPVectorizer/AArch64/div.ll
index 57c877a58d5c0e..aa702d95172490 100644
--- a/llvm/test/Transforms/SLPVectorizer/AArch64/div.ll
+++ b/llvm/test/Transforms/SLPVectorizer/AArch64/div.ll
@@ -558,8 +558,8 @@ define <2 x i32> @vectorize_sdiv_v2i32(<2 x i32> %a, <2 x i32> %x, <2 x i32> %y,
 ; NO-SVE-SAME: <2 x i32> [[A:%.*]], <2 x i32> [[X:%.*]], <2 x i32> [[Y:%.*]], <2 x i32> [[Z:%.*]]) #[[ATTR0]] {
 ; NO-SVE-NEXT:    [[A0:%.*]] = extractelement <2 x i32> [[A]], i64 0
 ; NO-SVE-NEXT:    [[A1:%.*]] = extractelement <2 x i32> [[A]], i64 1
-; NO-SVE-NEXT:    [[TMP1:%.*]] = sdiv i32 [[A0]], 2
-; NO-SVE-NEXT:    [[TMP2:%.*]] = sdiv i32 [[A1]], 4
+; NO-SVE-NEXT:    [[TMP1:%.*]] = sdiv i32 [[A0]], [[A1]]
+; NO-SVE-NEXT:    [[TMP2:%.*]] = sdiv i32 [[A1]], [[A0]]
 ; NO-SVE-NEXT:    [[X0:%.*]] = extractelement <2 x i32> [[X]], i64 0
 ; NO-SVE-NEXT:    [[X1:%.*]] = extractelement <2 x i32> [[X]], i64 1
 ; NO-SVE-NEXT:    [[TMP3:%.*]] = add i32 [[TMP1]], [[X0]]
@@ -578,7 +578,8 @@ define <2 x i32> @vectorize_sdiv_v2i32(<2 x i32> %a, <2 x i32> %x, <2 x i32> %y,
 ;
 ; SVE-LABEL: define <2 x i32> @vectorize_sdiv_v2i32(
 ; SVE-SAME: <2 x i32> [[A:%.*]], <2 x i32> [[X:%.*]], <2 x i32> [[Y:%.*]], <2 x i32> [[Z:%.*]]) #[[ATTR0]] {
-; SVE-NEXT:    [[TMP1:%.*]] = sdiv <2 x i32> [[A]], <i32 2, i32 4>
+; SVE-NEXT:    [[TMP5:%.*]] = shufflevector <2 x i32> [[A]], <2 x i32> poison, <2 x i32> <i32 1, i32 0>
+; SVE-NEXT:    [[TMP1:%.*]] = sdiv <2 x i32> [[A]], [[TMP5]]
 ; SVE-NEXT:    [[TMP2:%.*]] = add <2 x i32> [[TMP1]], [[X]]
 ; SVE-NEXT:    [[TMP3:%.*]] = sub <2 x i32> [[TMP2]], [[Y]]
 ; SVE-NEXT:    [[TMP4:%.*]] = mul <2 x i32> [[TMP3]], [[Z]]
@@ -587,8 +588,8 @@ define <2 x i32> @vectorize_sdiv_v2i32(<2 x i32> %a, <2 x i32> %x, <2 x i32> %y,
 {
   %a0 = extractelement <2 x i32> %a, i64 0
   %a1 = extractelement <2 x i32> %a, i64 1
-  %1 = sdiv i32 %a0, 2
-  %2 = sdiv i32 %a1, 4
+  %1 = sdiv i32 %a0, %a1
+  %2 = sdiv i32 %a1, %a0
   %x0 = extractelement <2 x i32> %x, i64 0
   %x1 = extractelement <2 x i32> %x, i64 1
   %3 = add i32 %1, %x0

Comment on lines 591 to 592
%1 = sdiv i32 %a0, %a1
%2 = sdiv i32 %a1, %a0
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

can we keep the the test with constants and add a new one with extracts?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

sure, will do that. Thanks

@sushgokh sushgokh force-pushed the update_test_alter_div_cost branch from 894d877 to af5a289 Compare January 13, 2025 12:02
@@ -553,8 +553,62 @@ define <4 x i32> @slp_v4i32_Op1_unknown_Op2_const_pow2(<4 x i32> %a)
}

; computes (a/const + x - y) * z
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This needs to move to the test with constants and there we need an update.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

sorry, will do so.

%1 = sdiv i32 %a0, %a1
%2 = sdiv i32 %a1, %a0
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Does this intentionally use the extracts from the opposite lane? This will have extra overhead than just having extracted divisors that won't require a shuffle

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This will have extra overhead

I agree. The idea is just to show that currently, the vector sdiv cost is being overestimated and that is being improved.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

cost of shuffle is extraneous to actual sdiv <2 x i32> cost and will be added to the vector cost by SLP seperately. I hope this answers your question

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

right but it makes the test more complicated than it needs to be and it would probably be goo to have one without the need for shuffles (by passing another argument perhaps or 2 for the divisor)

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done. Updated test does not require shuffles.

Copy link
Contributor

@fhahn fhahn left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Didn't mean to approve, marking as changes requested as per comments

@sushgokh sushgokh force-pushed the update_test_alter_div_cost branch 2 times, most recently from eb43bdf to 9effba7 Compare January 15, 2025 07:13
@sushgokh
Copy link
Contributor Author

ping

@sushgokh sushgokh force-pushed the update_test_alter_div_cost branch from 9effba7 to 653c51c Compare January 22, 2025 11:16
Copy link
Contributor

@fhahn fhahn left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, thanks.

Might be good so also add [SLP] to the title of the PR

@sushgokh sushgokh merged commit c6c6475 into llvm:main Jan 22, 2025
5 of 7 checks passed
@sushgokh sushgokh deleted the update_test_alter_div_cost branch January 23, 2025 05:17
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants