Skip to content

Conversation

@sduvvuri1603
Copy link
Contributor

@sduvvuri1603 sduvvuri1603 commented Dec 11, 2025

Description of your changes:

  • add a shared cleanup step to integration-tests-v1, kfp-webhooks, and e2e-test-frontend so we free disk and skip log collection when the kubeflow namespace never comes up; this unblocks the disk-exhaustion regressions seen in feat: Add pipeline run parallelism config #12442

  • harden .github/actions/check-artifact-exists so GitHub’s 403 pagination limit is treated as “artifact not found” rather than crashing the job

  • fix our Docker builds (backend/Dockerfile, test/release/Dockerfile.release) to fetch Argo CLI and git-cliff with Accept: application/octet-stream and then extract, avoiding HTML responses that were breaking the unpack step

@google-oss-prow
Copy link

Skipping CI for Draft Pull Request.
If you want CI signal for your change, please convert it to an actual PR.
You can still manually trigger a test run with /test all

@google-oss-prow
Copy link

@sduvvuri1603: No presubmit jobs available for kubeflow/pipelines@master

In response to this:

/test all

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@sduvvuri1603
Copy link
Contributor Author

/retest

@google-oss-prow google-oss-prow bot added size/XS and removed size/M labels Dec 11, 2025
@sduvvuri1603 sduvvuri1603 force-pushed the fix/kind-workflow-cleanup branch 2 times, most recently from a1d11c7 to e64b119 Compare December 11, 2025 16:40
@google-oss-prow google-oss-prow bot added size/S and removed size/XS labels Dec 11, 2025
@sduvvuri1603 sduvvuri1603 force-pushed the fix/kind-workflow-cleanup branch from e64b119 to 32b002e Compare December 11, 2025 16:50
@google-oss-prow google-oss-prow bot added size/M and removed size/S labels Dec 11, 2025
@sduvvuri1603 sduvvuri1603 force-pushed the fix/kind-workflow-cleanup branch 2 times, most recently from 494471f to 6433f30 Compare December 11, 2025 17:06
@google-oss-prow google-oss-prow bot added size/S and removed size/M labels Dec 11, 2025
@sduvvuri1603 sduvvuri1603 force-pushed the fix/kind-workflow-cleanup branch from 6433f30 to de4fabd Compare December 11, 2025 17:12
@google-oss-prow
Copy link

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
Once this PR has been reviewed and has the lgtm label, please assign chensun for approval. For more information see the Kubernetes Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@google-oss-prow google-oss-prow bot added size/M and removed size/S labels Dec 11, 2025
@sduvvuri1603 sduvvuri1603 force-pushed the fix/kind-workflow-cleanup branch from de4fabd to f505767 Compare December 11, 2025 17:21
@sduvvuri1603 sduvvuri1603 force-pushed the fix/kind-workflow-cleanup branch from f505767 to 3a8389e Compare December 11, 2025 17:34
@sduvvuri1603
Copy link
Contributor Author

/retest

@sduvvuri1603 sduvvuri1603 marked this pull request as ready for review December 11, 2025 19:09
params['page'] = page
response = requests.get(url, headers=headers, params=params)

if response.status_code == requests.codes.forbidden:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice catch! thanks for fixing it
I am assuming 403 is when we hit some type of api rate limit?

if: ${{ steps.deploy.outcome != 'success' || steps.forward-frontend-port.outcome != 'success' || steps.tests.outcome != 'success' }}
run: |
./.github/resources/scripts/collect-logs.sh --ns kubeflow --output /tmp/tmp_pod_log.txt
if kubectl get namespace kubeflow >/dev/null 2>&1; then
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

wondering, if we should move this check to the collect-logs.sh script itself, create a function that checks if the namespace exist and if it does not then skips collection of the logs, just like here. Because this step is used at multiple places, so will be good to avoid code duplicacy

- name: Checkout code
uses: actions/checkout@v5

- name: Free up disk space
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think all workflows that create/deploy cluster needs this step, so may be the better solution is to move this step to create_cluster action and remove this step from all the workflows that explicitly define it. (FYI: AI Assistant can perform this step for you very easily)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants