Skip to content

Conversation

DiweiSun
Copy link

Enabling CI testing for the torchao project on the Intel XPU (GPU) platform to ensure functional correctness, performance consistency, and long-term compatibility as both torchao and XPU support evolve.

Copy link

pytorch-bot bot commented Aug 20, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2814

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 50e56ec with merge base 2db4c76 (image):
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@meta-cla meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Aug 20, 2025
Comment on lines 38 to 66
- name: Clean all stopped docker containers
if: always()
shell: bash
run: |
# Prune all stopped containers.
# If other runner is pruning on this node, will skip.
nprune=$(ps -ef | grep -c "docker container prune")
if [[ $nprune -eq 1 ]]; then
docker container prune -f
fi

- name: Runner health check GPU count
if: always()
shell: bash
run: |
ngpu=$(timeout 30 clinfo -l | grep -c -E 'Device' || true)
msg="Please file an issue on pytorch/ao reporting the faulty runner. Include a link to the runner logs so the runner can be identified"
if [[ $ngpu -eq 0 ]]; then
echo "Error: Failed to detect any GPUs on the runner"
echo "$msg"
exit 1
fi

- name: Use following to pull public copy of the image
id: print-ghcr-mirror
shell: bash
run: |
echo "docker pull ${DOCKER_IMAGE}"
docker pull ${DOCKER_IMAGE}
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ported done. Please kindly help review.

if-no-files-found: ignore
path: ./**/core.[1-9]*

- name: Teardown XPU
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can reuse the action in pytorch directly

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yes, this is literally ported from pytorch

@liangan1 liangan1 mentioned this pull request Sep 2, 2025
8 tasks
@liangan1 liangan1 added topic: for developers Use this tag if this PR is mainly developer facing ci labels Sep 4, 2025
@DiweiSun DiweiSun changed the title Molly/enable xpu ci [CI][XPU] enable unit test for XPU device Sep 8, 2025
- ciflow/xpu/*
pull_request:
branches:
- main
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

remove pull-request after review

@chuanqi129
Copy link
Contributor

@pytorchbot label "ciflow/xpu"

Copy link

pytorch-bot bot commented Sep 10, 2025

Didn't find following labels among repository labels: ciflow/xpu

@liangan1
Copy link
Collaborator

@pytorchbot label "ciflow/xpu"

@pytorch-bot pytorch-bot bot added the ciflow/xpu label used to trigger xpu CI jobs label Sep 15, 2025
Copy link

pytorch-bot bot commented Sep 15, 2025

Unknown label ciflow/xpu.
Currently recognized labels are

  • ciflow/benchmark
  • ciflow/tutorials
  • ciflow/rocm
  • ciflow/4xh100

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ci ciflow/xpu label used to trigger xpu CI jobs CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. topic: for developers Use this tag if this PR is mainly developer facing
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants