[fp8] Only assert when CUDA is available. #2590

Stonepia · 2025-07-24T02:12:52Z

This commit performs the capability checks only when CUDA is available, so that we have better support for third-party devices like Intel GPU.

pytorch-bot · 2025-07-24T02:12:55Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2590

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

liangan1 · 2025-07-24T02:25:39Z

torchao/float8/inference.py

-
-        assert is_sm_at_least_89() or is_MI300(), (
-            "Float8 dynamic quantization requires CUDA compute capability ≥8.9 or MI300+."
-        )


This change will impact other device, such as cpu or any unknow device. Suggest to add common utlis function to judge the fp8 capability and apply it to all fp8 related changes. This function should only ensure the CUDA compute capability ≥8.9 or MI300+ and XPU device is available now.

This commit performs the capability checks only when CUDA is available, so that we have better support for third-party devices like Intel GPU.

liangan1 · 2025-07-24T05:16:09Z

Can you show the accuracy result for dq-fp8 for both CUDA and XPU based on the LLama-3.1-8B?

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jul 24, 2025

Stonepia changed the title ~~Only assert when CUDA is available.~~ [fp8] Only assert when CUDA is available. Jul 24, 2025

Stonepia marked this pull request as draft July 24, 2025 02:21

liangan1 suggested changes Jul 24, 2025

View reviewed changes

Stonepia added 2 commits July 24, 2025 02:31

Only assert when CUDA is available.

351bf1c

This commit performs the capability checks only when CUDA is available, so that we have better support for third-party devices like Intel GPU.

Add check for float8_semi_sparse_weight_transform

e51c650

Unify the _check_hardware_support() API

c76cf7e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[fp8] Only assert when CUDA is available. #2590

[fp8] Only assert when CUDA is available. #2590

Stonepia commented Jul 24, 2025

Uh oh!

pytorch-bot bot commented Jul 24, 2025

Uh oh!

liangan1 Jul 24, 2025

Uh oh!

liangan1 commented Jul 24, 2025 •

edited

Loading

Uh oh!

Uh oh!

[fp8] Only assert when CUDA is available. #2590

Are you sure you want to change the base?

[fp8] Only assert when CUDA is available. #2590

Conversation

Stonepia commented Jul 24, 2025

Uh oh!

pytorch-bot bot commented Jul 24, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2590

Uh oh!

liangan1 Jul 24, 2025

Choose a reason for hiding this comment

Uh oh!

liangan1 commented Jul 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

liangan1 commented Jul 24, 2025 •

edited

Loading