Skip to content
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
4 changes: 2 additions & 2 deletions .ci/dockerfiles/Dockerfile.gpu_test
Original file line number Diff line number Diff line change
Expand Up @@ -13,7 +13,7 @@
# docker run --gpus all --privileged -it nixl-gpu-test
#
# Build arguments:
# BASE_IMAGE: Base NVIDIA PyTorch image (default: nvcr.io/nvidia/pytorch:25.02-py3)
# BASE_IMAGE: Base NVIDIA PyTorch image (default: nvcr.io/nvidia/pytorch:25.06-py3)
# _UID: User ID for the non-root user (default: 148069)
# _GID: Group ID for the user (default: 30)
# _LOGIN: Username (default: svc-nixl)
Expand All @@ -22,7 +22,7 @@
# WORKSPACE: Workspace directory path
#

ARG BASE_IMAGE=nvcr.io/nvidia/pytorch:25.02-py3
ARG BASE_IMAGE=nvcr.io/nvidia/pytorch:25.06-py3

FROM ${BASE_IMAGE}

Expand Down
4 changes: 2 additions & 2 deletions .ci/jenkins/lib/build-matrix.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -6,7 +6,7 @@
# Key Components:
# - Job Configuration: Defines timeout, failure behavior, and Kubernetes resources
# - Docker Images: Specifies the container images used for different build stages
# - PyTorch images (24.10 and 25.02) for building and testing
# - PyTorch images (24.10 and 25.06) for building and testing
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

24.10 is based on CUDA 12.6 which we officially do not support anymore, minimum is 12.8. It would be great if we could find another base image for ubuntu 22.04. nvcr.io/nvidia/cuda:12.8.0-devel-ubuntu22.04 might work

# - Podman image for container builds
# - Matrix Axes: Defines build variations (currently x86_64 architecture)
# - Build Steps: Sequential steps for building, testing, and container creation
Expand Down Expand Up @@ -34,7 +34,7 @@ kubernetes:
requests: "{memory: 8Gi, cpu: 8000m}"

runs_on_dockers:
- { name: "ubuntu24.04-pytorch", url: "nvcr.io/nvidia/pytorch:25.02-py3" }
- { name: "ubuntu24.04-pytorch", url: "nvcr.io/nvidia/pytorch:25.06-py3" }
- { name: "ubuntu22.04-pytorch", url: "nvcr.io/nvidia/pytorch:24.10-py3" }
- { name: "podman-v5.0.2", url: "quay.io/podman/stable:v5.0.2", category: 'tool', privileged: true }

Expand Down
2 changes: 1 addition & 1 deletion .ci/jenkins/lib/test-matrix.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -30,7 +30,7 @@ runs_on_agents:
matrix:
axes:
image:
- nvcr.io/nvidia/pytorch:25.02-py3
- nvcr.io/nvidia/pytorch:25.06-py3
arch:
- x86_64

Expand Down
4 changes: 2 additions & 2 deletions contrib/aws-efa/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -65,9 +65,9 @@ The AWS test script:

## Container Image

The script uses the container image: `nvcr.io/nvidia/pytorch:25.02-py3`
The script uses the container image: `nvcr.io/nvidia/pytorch:25.06-py3`
You can override this by setting the `CONTAINER_IMAGE` environment variable:

```bash
export CONTAINER_IMAGE="your-custom-image:tag"
```
```
2 changes: 1 addition & 1 deletion contrib/aws-efa/aws_job_def.json
Original file line number Diff line number Diff line change
Expand Up @@ -15,7 +15,7 @@
"imagePullSecrets": [],
"containers": [
{
"image": "nvcr.io/nvidia/pytorch:25.02-py3",
"image": "nvcr.io/nvidia/pytorch:25.06-py3",
"command": [
"/bin/bash",
"-c",
Expand Down
4 changes: 2 additions & 2 deletions contrib/aws-efa/aws_test.sh
Original file line number Diff line number Diff line change
Expand Up @@ -30,7 +30,7 @@ usage() {
echo " GITHUB_REPOSITORY - GitHub repository (e.g., \"ai-dynamo/nixl\")"
echo ""
echo "Optional environment variables:"
echo " CONTAINER_IMAGE - Container image to use (default: nvcr.io/nvidia/pytorch:25.02-py3)"
echo " CONTAINER_IMAGE - Container image to use (default: nvcr.io/nvidia/pytorch:25.06-py3)"
echo " TEST_TIMEOUT - Timeout for test execution in minutes"
exit 1
}
Expand All @@ -47,7 +47,7 @@ if [ -z "$GITHUB_REF" ] || [ -z "$GITHUB_SERVER_URL" ] || [ -z "$GITHUB_REPOSITO
fi

test_cmd="$1"
export CONTAINER_IMAGE=${CONTAINER_IMAGE:-"nvcr.io/nvidia/pytorch:25.02-py3"}
export CONTAINER_IMAGE=${CONTAINER_IMAGE:-"nvcr.io/nvidia/pytorch:25.06-py3"}

# Set Git checkout command based on GITHUB_REF
case "$GITHUB_REF" in
Expand Down