Skip to content

Conversation

DiweiSun
Copy link

Enabling CI testing for the torchao project on the Intel XPU (GPU) platform to ensure functional correctness, performance consistency, and long-term compatibility as both torchao and XPU support evolve.

Copy link

pytorch-bot bot commented Aug 20, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2814

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure

As of commit e8936cb with merge base 2db4c76 (image):

NEW FAILURE - The following job has failed:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@meta-cla meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Aug 20, 2025
Comment on lines 38 to 66
- name: Clean all stopped docker containers
if: always()
shell: bash
run: |
# Prune all stopped containers.
# If other runner is pruning on this node, will skip.
nprune=$(ps -ef | grep -c "docker container prune")
if [[ $nprune -eq 1 ]]; then
docker container prune -f
fi
- name: Runner health check GPU count
if: always()
shell: bash
run: |
ngpu=$(timeout 30 clinfo -l | grep -c -E 'Device' || true)
msg="Please file an issue on pytorch/ao reporting the faulty runner. Include a link to the runner logs so the runner can be identified"
if [[ $ngpu -eq 0 ]]; then
echo "Error: Failed to detect any GPUs on the runner"
echo "$msg"
exit 1
fi
- name: Use following to pull public copy of the image
id: print-ghcr-mirror
shell: bash
run: |
echo "docker pull ${DOCKER_IMAGE}"
docker pull ${DOCKER_IMAGE}
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ported done. Please kindly help review.

if-no-files-found: ignore
path: ./**/core.[1-9]*

- name: Teardown XPU
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can reuse the action in pytorch directly

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yes, this is literally ported from pytorch

@liangan1 liangan1 mentioned this pull request Sep 2, 2025
8 tasks
@liangan1 liangan1 added topic: for developers Use this tag if this PR is mainly developer facing ci labels Sep 4, 2025
@DiweiSun DiweiSun changed the title Molly/enable xpu ci [CI][XPU] enable unit test for XPU device Sep 8, 2025
- ciflow/xpu/*
pull_request:
branches:
- main
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

remove pull-request after review

@chuanqi129
Copy link
Contributor

@pytorchbot label "ciflow/xpu"

Copy link

pytorch-bot bot commented Sep 10, 2025

Didn't find following labels among repository labels: ciflow/xpu

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ci CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. topic: for developers Use this tag if this PR is mainly developer facing
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants