[Torch] Fold `aten.to.dtype` on splat constants. #4306

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

mdazz wants to merge 1 commit into llvm:main from mdazz:mdazz/add-todtype-folder

+161 −21

mdazz commented Sep 5, 2025

This commit teaches AtenToDtypeOp::fold to constant-fold dtype conversions when the operand is a splat DenseElementsAttr.

Folding is done according to torch's rounding behavior, i.e.

Bool: 0 and -0.0 → false; nonzero/NaN/±Inf → true.
Float → Int: round toward zero.
Int → Float: sign-aware, rmNearestTiesToEven.
Float ↔ Float: use builtin mlir::FloatType::getFloatSemantics().
Int ↔ Int: use zextOrTrunc / sextOrTrunc based on source signedness.

Folding is only performed when non_blocking == false, copy == false, and memory_format is None.

Author

mdazz commented Sep 5, 2025 •

edited

Loading

Not sure who can review, maybe you would know @vivekkhandelwal1 @zjgarvey ?


          [Torch] Fold aten.to.dtype on splat constants.

1d7b55b

This commit teaches `AtenToDtypeOp::fold` to constant-fold dtype conversions
when the operand is a splat `DenseElementsAttr`.

Folding is done according to torch's rounding behavior, i.e.
  * Bool: 0 and -0.0 → false; nonzero/NaN/±Inf → true.
  * Float → Int: round toward zero.
  * Int → Float: sign-aware, rmNearestTiesToEven.
  * Float ↔ Float: use builtin `mlir::FloatType::getFloatSemantics()`.
  * Int ↔ Int: use `zextOrTrunc` / `sextOrTrunc` based on source signedness.

Folding is only performed when `non_blocking == false`, `copy == false`, and `memory_format` is None.

mdazz force-pushed the mdazz/add-todtype-folder branch from 3bf4e4b to 1d7b55b Compare

September 8, 2025 10:55

zjgarvey requested review from zjgarvey, vivekkhandelwal1 and sahas3

September 9, 2025 15:34

sahas3 requested changes

View reviewed changes

test/Dialect/Torch/canonicalize.mlir

+              	// int32 splat → float32
+              	%int_splat = torch.vtensor.literal(dense<42> : tensor<2x3xsi32>) : !torch.vtensor<[2,3],si32>
+              	%int6 = torch.constant.int 6 // torch.float32
+              	// CHECK: %[[R1:.*]] = torch.vtensor.literal({{.*}} : tensor<2x3xf32>) : !torch.vtensor<[2,3],f32>

Member

sahas3 Sep 11, 2025

Can you put the actual value which I think here will be 42.0 ?

test/Dialect/Torch/canonicalize.mlir

+              						-> !torch.vtensor<[4,4],si32>
+              	// int64 splat (max int32) → int32 (trunc)
+              	%int64_splat = torch.vtensor.literal(dense<2147483647> : tensor<10xsi64>) : !torch.vtensor<[10],si64>

Member

sahas3 Sep 11, 2025

Can this value be int32max+1 to ensure that trucation does happen in the IR being locked down?

test/Dialect/Torch/canonicalize.mlir

+              	// float32 splat → float64
+              	%float32_splat = torch.vtensor.literal(dense<2.71828> : tensor<5x5xf32>) : !torch.vtensor<[5,5],f32>
+              	%int7 = torch.constant.int 7 // torch.float64
+              	// CHECK: %[[R4:.*]] = torch.vtensor.literal({{.*}} : tensor<5x5xf64>) : !torch.vtensor<[5,5],f64>

Member

sahas3 Sep 11, 2025

Let's capture the actual value here too and other such places.

test/Dialect/Torch/canonicalize.mlir

+              	// int32 splat → float32
+              	%int_splat = torch.vtensor.literal(dense<42> : tensor<2x3xsi32>) : !torch.vtensor<[2,3],si32>
+              	%int6 = torch.constant.int 6 // torch.float32
+              	// CHECK: %[[R1:.*]] = torch.vtensor.literal({{.*}} : tensor<2x3xf32>) : !torch.vtensor<[2,3],f32>

Member

sahas3 Sep 11, 2025

Since we are not locking the values being returned from the output IR, I think we should add CHECK-NOT:torch.aten.to.dtype as well to ensure that the op is being folded.

test/Dialect/Torch/canonicalize.mlir

                 return %0 : !torch.tensor
               }
+              // CHECK-LABEL:   @torch.aten.to.dtype$fold_splat(

Member

sahas3 Sep 11, 2025

Can you add some e2e tests to ensure that torch's rounding logic is accurately captured in this implementation?
Also please fix the CI failures, we cannot merge until CI pipelines are green.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet