[ET-VK] Introduce custom op correctness + speed testing suite & add vulkan operator testing to CI #13835

pytorchbot · 2025-08-30T13:03:08Z

This PR was created by the merge bot to help merge the original PR into the main branch.
ghstack PR number: #13815 by @SS-JIA
^ Please use this as the source of truth for the PR details, comments, and reviews
ghstack PR base: https://github.com/pytorch/executorch/tree/gh/SS-JIA/315/base
ghstack PR head: https://github.com/pytorch/executorch/tree/gh/SS-JIA/315/head
Merge bot PR base: https://github.com/pytorch/executorch/tree/gh/SS-JIA/314/orig
Merge bot PR head: https://github.com/pytorch/executorch/tree/gh/SS-JIA/315/orig
@diff-train-skip-merge

cc @SS-JIA @manuelcandales @cbilgin

…upgrade glslc Pull Request resolved: #13814 ## Motivation Prepare for shaders that will use accelerated int8 dot product GLSL extensions, i.e. `dotPacked4x8AccSatEXT` ## Changes * Query for support for the shader integer dot product extension when creating the VkPhysicalDevice * Request the shader integer dot product extension when creating VkDevice * Provide APIs to check if the extension is available in the current runtime. ghstack-source-id: 306632732 @exported-using-ghexport Differential Revision: [D81323427](https://our.internmc.facebook.com/intern/diff/D81323427/)

…ulkan operator testing to CI Pull Request resolved: #13815 ## Motivation Provide an easy way to test and benchmark custom operators when developing them. ## Changes Introduces a custom op test suite under `backends/vulkan/test/custom_ops`. Each operator will have its own test file, as seen in the next diff. `utils.[h|cpp]` define common utilities that can be used across test files. To facilitate prototyping, prototype shaders and C++ host code can be placed under the `impl/` and `glsl` folders. Output of the test binary looks like: ``` === Compute Shader Performance Benchmark === Add Operation Prototyping Framework ---------------------------------------------------------------------- Executing 32 test cases for Add ---------------------------------------------------------------------- Add_1x64x64_Texture3D_Float [1x64x64] 3.094 μs 1.324 GFLOP/s PASSED Add_1x64x64_Texture3D_Half [1x64x64] 2.574 μs 1.591 GFLOP/s SKIPPED Add_1x64x64_Buffer_Float [1x64x64] 3.084 μs 1.328 GFLOP/s PASSED Add_1x64x64_Buffer_Half [1x64x64] 2.668 μs 1.535 GFLOP/s SKIPPED Add_1x128x128_Texture3D_Float [1x128x128] 6.001 μs 2.730 GFLOP/s PASSED Add_1x128x128_Texture3D_Half [1x128x128] 4.004 μs 4.092 GFLOP/s SKIPPED Add_1x128x128_Buffer_Float [1x128x128] 6.074 μs 2.698 GFLOP/s PASSED Add_1x128x128_Buffer_Half [1x128x128] 5.112 μs 3.205 GFLOP/s SKIPPED Add_1x256x256_Texture3D_Float [1x256x256] 17.852 μs 3.671 GFLOP/s PASSED Add_1x256x256_Texture3D_Half [1x256x256] 10.057 μs 6.517 GFLOP/s SKIPPED Add_1x256x256_Buffer_Float [1x256x256] 19.027 μs 3.444 GFLOP/s PASSED Add_1x256x256_Buffer_Half [1x256x256] 15.330 μs 4.275 GFLOP/s SKIPPED Add_1x512x512_Texture3D_Float [1x512x512] 48.292 μs 5.428 GFLOP/s PASSED Add_1x512x512_Texture3D_Half [1x512x512] 26.832 μs 9.770 GFLOP/s SKIPPED Add_1x512x512_Buffer_Float [1x512x512] 48.828 μs 5.369 GFLOP/s PASSED Add_1x512x512_Buffer_Half [1x512x512] 48.308 μs 5.427 GFLOP/s SKIPPED Add_1x1x1024_Texture3D_Float [1x1x1024] 2.376 μs 0.431 GFLOP/s PASSED Add_1x1x1024_Texture3D_Half [1x1x1024] 2.215 μs 0.462 GFLOP/s SKIPPED Add_1x1x1024_Buffer_Float [1x1x1024] 2.402 μs 0.426 GFLOP/s PASSED Add_1x1x1024_Buffer_Half [1x1x1024] 2.304 μs 0.445 GFLOP/s SKIPPED Add_1x1024x1_Texture3D_Float [1x1024x1] 6.120 μs 0.167 GFLOP/s PASSED Add_1x1024x1_Texture3D_Half [1x1024x1] 6.245 μs 0.164 GFLOP/s SKIPPED Add_1x1024x1_Buffer_Float [1x1024x1] 2.392 μs 0.428 GFLOP/s PASSED Add_1x1024x1_Buffer_Half [1x1024x1] 2.304 μs 0.445 GFLOP/s SKIPPED Add_32x32x32_Texture3D_Float [32x32x32] 10.249 μs 3.197 GFLOP/s PASSED Add_32x32x32_Texture3D_Half [32x32x32] 6.583 μs 4.978 GFLOP/s SKIPPED Add_32x32x32_Buffer_Float [32x32x32] 10.468 μs 3.130 GFLOP/s PASSED Add_32x32x32_Buffer_Half [32x32x32] 8.481 μs 3.864 GFLOP/s SKIPPED Add_16x128x64_Texture3D_Float [16x128x64] 26.000 μs 5.041 GFLOP/s PASSED Add_16x128x64_Texture3D_Half [16x128x64] 17.841 μs 7.347 GFLOP/s SKIPPED Add_16x128x64_Buffer_Float [16x128x64] 28.917 μs 4.533 GFLOP/s PASSED Add_16x128x64_Buffer_Half [16x128x64] 28.792 μs 4.552 GFLOP/s SKIPPED ``` `SKIPPED` means that correctness checking is not performed on that test case. This usually happens in one of the following cases: * Input/output dtype is fp16. There is no fp16 dtype support in reference calculation functions * Input sizes are too big. Since reference calculation functions are implemented in a naive manner, calculating reference data may take too long for large inputs. Larger test cases are usually meant to tests performance, not correctness. ghstack-source-id: 306632731 @exported-using-ghexport Differential Revision: [D81323426](https://our.internmc.facebook.com/intern/diff/D81323426/)

pytorch-bot · 2025-08-30T13:03:12Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/13835

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

ROCm MI2xx CI/CD workflows failing due to : download from https://api.github.com/repos/pytorch/pytorch timed out.

❌ 6 New Failures, 19 Pending

As of commit 760d3a5 with merge base e2098f8 ():

NEW FAILURES - The following jobs have failed:

pull / unittest / linux / linux-job (gh)
RuntimeError: Command docker exec -t b718eb9cbd076531b2b0b91c51c63eaf1142b766f18bfabd97124a8fa23bd292 /exec failed with exit code 3
pull / unittest / macos / macos-job (gh)
RuntimeError: Command bash /Users/ec2-user/runner/_work/_temp/exec_script failed with exit code 3
pull / unittest-arm-backend-with-no-fvp (test_pytest_models) / linux-job (gh)
RuntimeError: Command docker exec -t 89f4f4e03dda01113e2e54359a0c0b6de48a4a84f4daa1352ea79ae3305df264 /exec failed with exit code 3
pull / unittest-arm-backend-with-no-fvp (test_pytest_ops) / linux-job (gh)
RuntimeError: Command docker exec -t 39319715926af2fd604fe8394b63706e93708f4824e17168c270e5109f932a07 /exec failed with exit code 3
pull / unittest-editable / linux / linux-job (gh)
RuntimeError: Command docker exec -t 8c76e0595a4866c510d3ce2483360a5aeb92a27cabcbc23c9e1c0870dd0b3167 /exec failed with exit code 3
pull / unittest-editable / macos / macos-job (gh)
RuntimeError: Command bash /Users/ec2-user/runner/_work/_temp/exec_script failed with exit code 3

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ssjia added 2 commits August 29, 2025 17:33

pytorchbot requested review from larryliu0820, kirklandsign and SS-JIA as code owners August 30, 2025 13:03

pytorch-bot bot added the module: vulkan Issues related to the Vulkan delegate and code under backends/vulkan/ label Aug 30, 2025

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Aug 30, 2025

Base automatically changed from gh/SS-JIA/314/orig to main August 30, 2025 13:18

SS-JIA approved these changes Aug 30, 2025

View reviewed changes

SS-JIA merged commit 1520f9f into main Aug 30, 2025
107 of 120 checks passed

SS-JIA deleted the gh/SS-JIA/315/orig branch August 30, 2025 13:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ET-VK] Introduce custom op correctness + speed testing suite & add vulkan operator testing to CI #13835

[ET-VK] Introduce custom op correctness + speed testing suite & add vulkan operator testing to CI #13835

Uh oh!

pytorchbot commented Aug 30, 2025 •

edited by pytorch-bot bot

Loading

Uh oh!

pytorch-bot bot commented Aug 30, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

[ET-VK] Introduce custom op correctness + speed testing suite & add vulkan operator testing to CI #13835

[ET-VK] Introduce custom op correctness + speed testing suite & add vulkan operator testing to CI #13835

Uh oh!

Conversation

pytorchbot commented Aug 30, 2025 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Aug 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/13835

❗ 1 Active SEVs

❌ 6 New Failures, 19 Pending

Uh oh!

Uh oh!

Uh oh!

pytorchbot commented Aug 30, 2025 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Aug 30, 2025 •

edited

Loading