Skip to content

fix(scheduler): Add a retry for pod patching (labels or status) by the scheduler#1437

Open
davidLif wants to merge 5 commits into
mainfrom
DavidL/fix-pod-status-update-retry
Open

fix(scheduler): Add a retry for pod patching (labels or status) by the scheduler#1437
davidLif wants to merge 5 commits into
mainfrom
DavidL/fix-pod-status-update-retry

Conversation

@davidLif
Copy link
Copy Markdown
Collaborator

@davidLif davidLif commented Apr 16, 2026

Description

Add a retry for pod patching (labels or status) by the scheduler. This follows a similar pattern to the current podgroup patching retries on failure.
Also, Do not retry podgroup status updates if the update failure was "podgroup not found".

Related Issues

Fixes #

Checklist

Note: Ensure your PR title follows the Conventional Commits format (e.g., feat(scheduler): add new feature)

  • Self-reviewed
  • Added/updated tests (if needed)
  • Updated documentation (if needed)

Breaking Changes

Additional Notes

@coderabbitai
Copy link
Copy Markdown
Contributor

coderabbitai Bot commented Apr 16, 2026

Important

Review skipped

Auto reviews are disabled on this repository. Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: 39e24eae-8cf9-4e5c-ae6e-3cd5f96561d7

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

  • 🔍 Trigger review
✨ Finishing Touches
🧪 Generate unit tests (beta)
  • Create PR with unit tests
  • Commit unit tests in branch DavidL/fix-pod-status-update-retry

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

Comment @coderabbitai help to get the list of available commands and usage tips.

@davidLif davidLif enabled auto-merge April 16, 2026 21:49
@github-actions
Copy link
Copy Markdown

Merging this branch will decrease overall coverage

Impacted Packages Coverage Δ 🤖
github.com/kai-scheduler/KAI-scheduler/pkg/scheduler/cache/status_updater 78.59% (-0.62%) 👎

Coverage by file

Changed files (no unit tests)

Changed File Coverage Δ Total Covered Missed 🤖
github.com/kai-scheduler/KAI-scheduler/pkg/scheduler/cache/status_updater/concurrency.go 89.47% (-4.56%) 76 (+9) 68 (+5) 8 (+4) 👎

Please note that the "Total", "Covered", and "Missed" counts above refer to code statements instead of lines of code. The value in brackets refers to the test coverage of that file in the old version of the code.

Changed unit test files

  • github.com/kai-scheduler/KAI-scheduler/pkg/scheduler/cache/status_updater/concurrency_test.go
  • github.com/kai-scheduler/KAI-scheduler/pkg/scheduler/cache/status_updater/default_status_updater_test.go

@github-actions
Copy link
Copy Markdown

github-actions Bot commented Apr 16, 2026

📊 Performance Benchmark Results

Comparing PR (DavidL/fix-pod-status-update-retry) vs main branch — click to expand
goos: linux
goarch: amd64
pkg: github.com/kai-scheduler/KAI-scheduler/pkg/scheduler/actions
cpu: Intel(R) Xeon(R) Platinum 8370C CPU @ 2.80GHz
                                    │ main-bench.txt │           pr-bench.txt            │
                                    │     sec/op     │   sec/op     vs base              │
AllocateAction_SmallCluster-4            107.6m ± 5%   107.5m ± 0%       ~ (p=0.180 n=6)
AllocateAction_MediumCluster-4           133.3m ± 0%   133.2m ± 1%       ~ (p=0.699 n=6)
AllocateAction_LargeCluster-4            207.1m ± 8%   207.4m ± 7%       ~ (p=1.000 n=6)
ReclaimAction_SmallCluster-4             103.0m ± 0%   102.8m ± 0%       ~ (p=0.132 n=6)
ReclaimAction_MediumCluster-4            105.8m ± 1%   105.4m ± 1%  -0.39% (p=0.026 n=6)
PreemptAction_SmallCluster-4             103.6m ± 0%   103.5m ± 0%  -0.16% (p=0.041 n=6)
PreemptAction_MediumCluster-4            109.5m ± 0%   109.4m ± 1%       ~ (p=0.818 n=6)
ConsolidationAction_SmallCluster-4       123.4m ± 1%   123.1m ± 1%  -0.27% (p=0.009 n=6)
ConsolidationAction_MediumCluster-4      290.7m ± 1%   288.7m ± 1%       ~ (p=0.093 n=6)
FullSchedulingCycle_SmallCluster-4       104.9m ± 1%   105.1m ± 0%       ~ (p=0.180 n=6)
FullSchedulingCycle_MediumCluster-4      119.9m ± 1%   118.6m ± 1%  -1.13% (p=0.009 n=6)
FullSchedulingCycle_LargeCluster-4       158.5m ± 1%   160.1m ± 2%  +1.02% (p=0.041 n=6)
ManyQueues_MediumCluster-4               136.0m ± 2%   135.8m ± 2%       ~ (p=0.485 n=6)
GangScheduling_MediumCluster-4           154.4m ± 1%   154.4m ± 1%       ~ (p=0.937 n=6)
geomean                                  133.2m        133.0m       -0.14%

                                    │ main-bench.txt │            pr-bench.txt            │
                                    │      B/op      │     B/op      vs base              │
AllocateAction_SmallCluster-4           2.177Mi ± 1%   2.177Mi ± 0%       ~ (p=1.000 n=6)
AllocateAction_MediumCluster-4          11.89Mi ± 0%   11.89Mi ± 0%       ~ (p=0.937 n=6)
AllocateAction_LargeCluster-4           41.28Mi ± 0%   41.28Mi ± 0%       ~ (p=0.240 n=6)
ReclaimAction_SmallCluster-4            906.2Ki ± 1%   906.4Ki ± 1%       ~ (p=0.937 n=6)
ReclaimAction_MediumCluster-4           2.963Mi ± 0%   2.965Mi ± 0%       ~ (p=1.000 n=6)
PreemptAction_SmallCluster-4            1.037Mi ± 1%   1.039Mi ± 0%       ~ (p=0.180 n=6)
PreemptAction_MediumCluster-4           4.216Mi ± 0%   4.216Mi ± 0%       ~ (p=0.699 n=6)
ConsolidationAction_SmallCluster-4      9.908Mi ± 0%   9.907Mi ± 0%       ~ (p=0.937 n=6)
ConsolidationAction_MediumCluster-4     89.24Mi ± 0%   89.24Mi ± 0%       ~ (p=0.589 n=6)
FullSchedulingCycle_SmallCluster-4      1.392Mi ± 1%   1.393Mi ± 0%       ~ (p=0.818 n=6)
FullSchedulingCycle_MediumCluster-4     6.929Mi ± 0%   6.928Mi ± 0%       ~ (p=0.589 n=6)
FullSchedulingCycle_LargeCluster-4      22.87Mi ± 0%   22.86Mi ± 0%       ~ (p=0.937 n=6)
ManyQueues_MediumCluster-4              16.35Mi ± 0%   16.35Mi ± 0%       ~ (p=0.394 n=6)
GangScheduling_MediumCluster-4          17.17Mi ± 0%   17.17Mi ± 0%       ~ (p=0.699 n=6)
geomean                                 6.994Mi        6.996Mi       +0.02%

                                    │ main-bench.txt │           pr-bench.txt            │
                                    │   allocs/op    │  allocs/op   vs base              │
AllocateAction_SmallCluster-4            35.08k ± 0%   35.08k ± 0%       ~ (p=0.617 n=6)
AllocateAction_MediumCluster-4           313.0k ± 0%   313.0k ± 0%       ~ (p=0.660 n=6)
AllocateAction_LargeCluster-4            1.339M ± 0%   1.339M ± 0%       ~ (p=0.058 n=6)
ReclaimAction_SmallCluster-4             8.221k ± 0%   8.220k ± 0%       ~ (p=0.861 n=6)
ReclaimAction_MediumCluster-4            26.16k ± 0%   26.16k ± 0%       ~ (p=0.318 n=6)
PreemptAction_SmallCluster-4             11.14k ± 0%   11.14k ± 0%       ~ (p=0.400 n=6)
PreemptAction_MediumCluster-4            38.89k ± 0%   38.89k ± 0%       ~ (p=0.251 n=6)
ConsolidationAction_SmallCluster-4       130.0k ± 0%   130.0k ± 0%       ~ (p=0.818 n=6)
ConsolidationAction_MediumCluster-4      1.307M ± 0%   1.307M ± 0%       ~ (p=0.223 n=6)
FullSchedulingCycle_SmallCluster-4       20.71k ± 0%   20.71k ± 0%       ~ (p=0.621 n=6)
FullSchedulingCycle_MediumCluster-4      168.4k ± 0%   168.4k ± 0%       ~ (p=0.738 n=6)
FullSchedulingCycle_LargeCluster-4       698.9k ± 0%   698.9k ± 0%       ~ (p=0.784 n=6)
ManyQueues_MediumCluster-4               350.9k ± 0%   350.9k ± 0%       ~ (p=0.394 n=6)
GangScheduling_MediumCluster-4           571.9k ± 0%   571.9k ± 0%       ~ (p=0.617 n=6)
geomean                                  119.0k        119.0k       -0.00%

pkg: github.com/kai-scheduler/KAI-scheduler/pkg/scheduler/actions/integration_tests/reclaim
                            │ main-bench.txt │           pr-bench.txt            │
                            │     sec/op     │   sec/op     vs base              │
ReclaimLargeJobs_10Node-4        105.5m ± 1%   105.4m ± 1%       ~ (p=0.818 n=6)
ReclaimLargeJobs_50Node-4        224.1m ± 1%   224.6m ± 1%       ~ (p=0.240 n=6)
ReclaimLargeJobs_100Node-4       367.7m ± 5%   373.5m ± 7%       ~ (p=0.394 n=6)
ReclaimLargeJobs_200Node-4       719.6m ± 1%   732.7m ± 1%  +1.83% (p=0.002 n=6)
ReclaimLargeJobs_500Node-4        2.209 ± 1%    2.231 ± 1%  +0.98% (p=0.015 n=6)
ReclaimLargeJobs_1000Node-4       6.332 ± 1%    6.306 ± 1%       ~ (p=0.589 n=6)
geomean                          666.3m        670.9m       +0.69%

                            │ main-bench.txt │            pr-bench.txt            │
                            │      B/op      │     B/op      vs base              │
ReclaimLargeJobs_10Node-4       1.973Mi ± 3%   1.979Mi ± 3%       ~ (p=0.240 n=6)
ReclaimLargeJobs_50Node-4       60.18Mi ± 0%   60.19Mi ± 0%       ~ (p=0.818 n=6)
ReclaimLargeJobs_100Node-4      120.4Mi ± 0%   120.4Mi ± 0%       ~ (p=1.000 n=6)
ReclaimLargeJobs_200Node-4      243.1Mi ± 0%   243.1Mi ± 0%       ~ (p=1.000 n=6)
ReclaimLargeJobs_500Node-4      623.5Mi ± 0%   623.5Mi ± 0%       ~ (p=0.589 n=6)
ReclaimLargeJobs_1000Node-4     1.272Gi ± 0%   1.272Gi ± 0%       ~ (p=0.818 n=6)
geomean                         118.9Mi        119.0Mi       +0.05%

                            │ main-bench.txt │           pr-bench.txt            │
                            │   allocs/op    │  allocs/op   vs base              │
ReclaimLargeJobs_10Node-4        22.26k ± 2%   22.26k ± 2%       ~ (p=0.574 n=6)
ReclaimLargeJobs_50Node-4        814.5k ± 0%   814.5k ± 0%       ~ (p=0.589 n=6)
ReclaimLargeJobs_100Node-4       1.620M ± 0%   1.620M ± 0%       ~ (p=0.288 n=6)
ReclaimLargeJobs_200Node-4       3.230M ± 0%   3.230M ± 0%       ~ (p=0.937 n=6)
ReclaimLargeJobs_500Node-4       8.082M ± 0%   8.082M ± 0%       ~ (p=0.699 n=6)
ReclaimLargeJobs_1000Node-4      16.25M ± 0%   16.25M ± 0%       ~ (p=1.000 n=6)
geomean                          1.523M        1.523M       +0.00%

Legend

  • 📉 Negative delta = Performance improvement (faster)
  • 📈 Positive delta = Performance regression (slower)
  • p-value < 0.05 indicates statistically significant change
Raw benchmark data

PR branch:

goos: linux
goarch: amd64
pkg: github.com/kai-scheduler/KAI-scheduler/pkg/scheduler/actions
cpu: Intel(R) Xeon(R) Platinum 8370C CPU @ 2.80GHz
BenchmarkAllocateAction_SmallCluster-4         	      10	 107527876 ns/op	 2283081 B/op	   35078 allocs/op
BenchmarkAllocateAction_SmallCluster-4         	      10	 107664815 ns/op	 2284879 B/op	   35079 allocs/op
BenchmarkAllocateAction_SmallCluster-4         	      10	 107531624 ns/op	 2282092 B/op	   35077 allocs/op
BenchmarkAllocateAction_SmallCluster-4         	      10	 107463537 ns/op	 2283551 B/op	   35078 allocs/op
BenchmarkAllocateAction_SmallCluster-4         	      10	 107419263 ns/op	 2282140 B/op	   35072 allocs/op
BenchmarkAllocateAction_SmallCluster-4         	      10	 107487092 ns/op	 2282461 B/op	   35074 allocs/op
BenchmarkAllocateAction_MediumCluster-4        	       8	 133297672 ns/op	12472039 B/op	  313012 allocs/op
BenchmarkAllocateAction_MediumCluster-4        	       8	 132218541 ns/op	12472095 B/op	  312998 allocs/op
BenchmarkAllocateAction_MediumCluster-4        	       8	 133068445 ns/op	12469367 B/op	  312997 allocs/op
BenchmarkAllocateAction_MediumCluster-4        	       8	 133472319 ns/op	12473933 B/op	  312998 allocs/op
BenchmarkAllocateAction_MediumCluster-4        	       8	 134209798 ns/op	12469097 B/op	  312996 allocs/op
BenchmarkAllocateAction_MediumCluster-4        	       8	 132888767 ns/op	12470012 B/op	  312999 allocs/op
BenchmarkAllocateAction_LargeCluster-4         	       5	 209727387 ns/op	43291753 B/op	 1338700 allocs/op
BenchmarkAllocateAction_LargeCluster-4         	       5	 206448365 ns/op	43285265 B/op	 1338693 allocs/op
BenchmarkAllocateAction_LargeCluster-4         	       5	 208065578 ns/op	43285569 B/op	 1338692 allocs/op
BenchmarkAllocateAction_LargeCluster-4         	       5	 206355184 ns/op	43316131 B/op	 1338702 allocs/op
BenchmarkAllocateAction_LargeCluster-4         	       5	 222901372 ns/op	43283590 B/op	 1338674 allocs/op
BenchmarkAllocateAction_LargeCluster-4         	       5	 206809520 ns/op	43296982 B/op	 1338683 allocs/op
BenchmarkReclaimAction_SmallCluster-4          	      10	 102914962 ns/op	  921468 B/op	    8191 allocs/op
BenchmarkReclaimAction_SmallCluster-4          	      10	 102773411 ns/op	  925919 B/op	    8209 allocs/op
BenchmarkReclaimAction_SmallCluster-4          	      10	 102714998 ns/op	  930319 B/op	    8222 allocs/op
BenchmarkReclaimAction_SmallCluster-4          	      10	 103210124 ns/op	  930506 B/op	    8223 allocs/op
BenchmarkReclaimAction_SmallCluster-4          	      10	 102804446 ns/op	  922741 B/op	    8219 allocs/op
BenchmarkReclaimAction_SmallCluster-4          	      10	 102781587 ns/op	  930412 B/op	    8221 allocs/op
BenchmarkReclaimAction_MediumCluster-4         	      10	 105956330 ns/op	 3103572 B/op	   26163 allocs/op
BenchmarkReclaimAction_MediumCluster-4         	      10	 105316832 ns/op	 3115978 B/op	   26164 allocs/op
BenchmarkReclaimAction_MediumCluster-4         	      10	 105399352 ns/op	 3103493 B/op	   26162 allocs/op
BenchmarkReclaimAction_MediumCluster-4         	      10	 105433553 ns/op	 3110872 B/op	   26163 allocs/op
BenchmarkReclaimAction_MediumCluster-4         	      10	 105388111 ns/op	 3110984 B/op	   26164 allocs/op
BenchmarkReclaimAction_MediumCluster-4         	      10	 105341620 ns/op	 3107397 B/op	   26163 allocs/op
BenchmarkPreemptAction_SmallCluster-4          	      10	 103449093 ns/op	 1089250 B/op	   11139 allocs/op
BenchmarkPreemptAction_SmallCluster-4          	      10	 103583695 ns/op	 1089135 B/op	   11138 allocs/op
BenchmarkPreemptAction_SmallCluster-4          	      10	 103699123 ns/op	 1085559 B/op	   11136 allocs/op
BenchmarkPreemptAction_SmallCluster-4          	      10	 103490819 ns/op	 1089138 B/op	   11137 allocs/op
BenchmarkPreemptAction_SmallCluster-4          	      10	 103395601 ns/op	 1085191 B/op	   11136 allocs/op
BenchmarkPreemptAction_SmallCluster-4          	      10	 103133473 ns/op	 1088959 B/op	   11136 allocs/op
BenchmarkPreemptAction_MediumCluster-4         	      10	 109437193 ns/op	 4418980 B/op	   38892 allocs/op
BenchmarkPreemptAction_MediumCluster-4         	      10	 110964356 ns/op	 4429931 B/op	   38896 allocs/op
BenchmarkPreemptAction_MediumCluster-4         	      10	 110301080 ns/op	 4423262 B/op	   38895 allocs/op
BenchmarkPreemptAction_MediumCluster-4         	      10	 109431034 ns/op	 4423060 B/op	   38895 allocs/op
BenchmarkPreemptAction_MediumCluster-4         	      10	 109394949 ns/op	 4419253 B/op	   38893 allocs/op
BenchmarkPreemptAction_MediumCluster-4         	      10	 109397963 ns/op	 4419245 B/op	   38894 allocs/op
BenchmarkConsolidationAction_SmallCluster-4    	       9	 122414333 ns/op	10388108 B/op	  129977 allocs/op
BenchmarkConsolidationAction_SmallCluster-4    	       9	 123017382 ns/op	10390738 B/op	  130017 allocs/op
BenchmarkConsolidationAction_SmallCluster-4    	       9	 123286901 ns/op	10383359 B/op	  129956 allocs/op
BenchmarkConsolidationAction_SmallCluster-4    	       9	 123335176 ns/op	10387190 B/op	  129984 allocs/op

Main branch:

goos: linux
goarch: amd64
pkg: github.com/kai-scheduler/KAI-scheduler/pkg/scheduler/actions
cpu: Intel(R) Xeon(R) Platinum 8370C CPU @ 2.80GHz
BenchmarkAllocateAction_SmallCluster-4         	       9	 113147798 ns/op	 2315533 B/op	   35091 allocs/op
BenchmarkAllocateAction_SmallCluster-4         	      10	 107556309 ns/op	 2282992 B/op	   35078 allocs/op
BenchmarkAllocateAction_SmallCluster-4         	      10	 107772449 ns/op	 2284802 B/op	   35080 allocs/op
BenchmarkAllocateAction_SmallCluster-4         	      10	 107574029 ns/op	 2282034 B/op	   35075 allocs/op
BenchmarkAllocateAction_SmallCluster-4         	      10	 107662463 ns/op	 2282005 B/op	   35074 allocs/op
BenchmarkAllocateAction_SmallCluster-4         	      10	 107238720 ns/op	 2283323 B/op	   35075 allocs/op
BenchmarkAllocateAction_MediumCluster-4        	       8	 132736551 ns/op	12471626 B/op	  313005 allocs/op
BenchmarkAllocateAction_MediumCluster-4        	       8	 133264139 ns/op	12471814 B/op	  313008 allocs/op
BenchmarkAllocateAction_MediumCluster-4        	       8	 133335963 ns/op	12473545 B/op	  313007 allocs/op
BenchmarkAllocateAction_MediumCluster-4        	       8	 133320051 ns/op	12469519 B/op	  312996 allocs/op
BenchmarkAllocateAction_MediumCluster-4        	       8	 133376830 ns/op	12471245 B/op	  313007 allocs/op
BenchmarkAllocateAction_MediumCluster-4        	       8	 133772172 ns/op	12469223 B/op	  312996 allocs/op
BenchmarkAllocateAction_LargeCluster-4         	       5	 206730894 ns/op	43283936 B/op	 1338676 allocs/op
BenchmarkAllocateAction_LargeCluster-4         	       5	 205075474 ns/op	43289233 B/op	 1338680 allocs/op
BenchmarkAllocateAction_LargeCluster-4         	       5	 207322925 ns/op	43282971 B/op	 1338671 allocs/op
BenchmarkAllocateAction_LargeCluster-4         	       5	 223046058 ns/op	43284267 B/op	 1338672 allocs/op
BenchmarkAllocateAction_LargeCluster-4         	       5	 208818491 ns/op	43282604 B/op	 1338683 allocs/op
BenchmarkAllocateAction_LargeCluster-4         	       5	 206918246 ns/op	43306921 B/op	 1338692 allocs/op
BenchmarkReclaimAction_SmallCluster-4          	      10	 103216141 ns/op	  921053 B/op	    8188 allocs/op
BenchmarkReclaimAction_SmallCluster-4          	      10	 102766884 ns/op	  925887 B/op	    8209 allocs/op
BenchmarkReclaimAction_SmallCluster-4          	      10	 102952386 ns/op	  929983 B/op	    8222 allocs/op
BenchmarkReclaimAction_SmallCluster-4          	      10	 103010297 ns/op	  930416 B/op	    8221 allocs/op
BenchmarkReclaimAction_SmallCluster-4          	      10	 103067695 ns/op	  930453 B/op	    8221 allocs/op
BenchmarkReclaimAction_SmallCluster-4          	      10	 103217088 ns/op	  922967 B/op	    8221 allocs/op
BenchmarkReclaimAction_MediumCluster-4         	      10	 106531886 ns/op	 3107062 B/op	   26161 allocs/op
BenchmarkReclaimAction_MediumCluster-4         	      10	 105797620 ns/op	 3107332 B/op	   26163 allocs/op
BenchmarkReclaimAction_MediumCluster-4         	      10	 107183742 ns/op	 3112008 B/op	   26162 allocs/op
BenchmarkReclaimAction_MediumCluster-4         	      10	 105781401 ns/op	 3107256 B/op	   26163 allocs/op
BenchmarkReclaimAction_MediumCluster-4         	      10	 105737099 ns/op	 3111052 B/op	   26163 allocs/op
BenchmarkReclaimAction_MediumCluster-4         	      10	 105806465 ns/op	 3107295 B/op	   26163 allocs/op
BenchmarkPreemptAction_SmallCluster-4          	      10	 103643112 ns/op	 1085368 B/op	   11136 allocs/op
BenchmarkPreemptAction_SmallCluster-4          	      10	 103626932 ns/op	 1088812 B/op	   11137 allocs/op
BenchmarkPreemptAction_SmallCluster-4          	      10	 103604723 ns/op	 1088988 B/op	   11137 allocs/op
BenchmarkPreemptAction_SmallCluster-4          	      10	 103806989 ns/op	 1089040 B/op	   11137 allocs/op
BenchmarkPreemptAction_SmallCluster-4          	      10	 103722008 ns/op	 1081276 B/op	   11134 allocs/op
BenchmarkPreemptAction_SmallCluster-4          	      10	 103521716 ns/op	 1081421 B/op	   11135 allocs/op
BenchmarkPreemptAction_MediumCluster-4         	      10	 109812328 ns/op	 4422748 B/op	   38892 allocs/op
BenchmarkPreemptAction_MediumCluster-4         	      10	 109391148 ns/op	 4419456 B/op	   38894 allocs/op
BenchmarkPreemptAction_MediumCluster-4         	      10	 109497725 ns/op	 4429336 B/op	   38894 allocs/op
BenchmarkPreemptAction_MediumCluster-4         	      10	 109602543 ns/op	 4414940 B/op	   38890 allocs/op
BenchmarkPreemptAction_MediumCluster-4         	      10	 109514128 ns/op	 4418866 B/op	   38892 allocs/op
BenchmarkPreemptAction_MediumCluster-4         	      10	 109372093 ns/op	 4423136 B/op	   38895 allocs/op
BenchmarkConsolidationAction_SmallCluster-4    	       9	 123379168 ns/op	10390328 B/op	  130012 allocs/op
BenchmarkConsolidationAction_SmallCluster-4    	       9	 124416136 ns/op	10388664 B/op	  130004 allocs/op
BenchmarkConsolidationAction_SmallCluster-4    	       9	 123718439 ns/op	10393592 B/op	  130024 allocs/op
BenchmarkConsolidationAction_SmallCluster-4    	       9	 123299256 ns/op	10380569 B/op	  129945 allocs/op

@davidLif davidLif force-pushed the DavidL/fix-pod-status-update-retry branch from 54b0022 to 479123f Compare April 19, 2026 04:34
@github-actions
Copy link
Copy Markdown

Merging this branch will decrease overall coverage

Impacted Packages Coverage Δ 🤖
github.com/kai-scheduler/KAI-scheduler/pkg/scheduler/cache/status_updater 79.18% (-0.64%) 👎

Coverage by file

Changed files (no unit tests)

Changed File Coverage Δ Total Covered Missed 🤖
github.com/kai-scheduler/KAI-scheduler/pkg/scheduler/cache/status_updater/concurrency.go 90.79% (-4.73%) 76 (+9) 69 (+5) 7 (+4) 👎

Please note that the "Total", "Covered", and "Missed" counts above refer to code statements instead of lines of code. The value in brackets refers to the test coverage of that file in the old version of the code.

Changed unit test files

  • github.com/kai-scheduler/KAI-scheduler/pkg/scheduler/cache/status_updater/concurrency_test.go
  • github.com/kai-scheduler/KAI-scheduler/pkg/scheduler/cache/status_updater/default_status_updater_test.go

su.inFlightPods.Delete(key)
return
}
log.StatusUpdaterLogger.V(1).Errorf("Failed to patch pod %s/%s, retrying: %v",
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There should be a limit to the retries

Signed-off-by: davidLif <davidshani12@gmail.com>
…e podgroup

Signed-off-by: davidLif <davidshani12@gmail.com>
Signed-off-by: davidLif <davidshani12@gmail.com>
Signed-off-by: davidLif <davidshani12@gmail.com>
@davidLif davidLif force-pushed the DavidL/fix-pod-status-update-retry branch from 479123f to 410d721 Compare April 20, 2026 08:50
@github-actions
Copy link
Copy Markdown

Merging this branch will decrease overall coverage

Impacted Packages Coverage Δ 🤖
github.com/kai-scheduler/KAI-scheduler/pkg/scheduler/cache/status_updater 79.18% (-0.64%) 👎

Coverage by file

Changed files (no unit tests)

Changed File Coverage Δ Total Covered Missed 🤖
github.com/kai-scheduler/KAI-scheduler/pkg/scheduler/cache/status_updater/concurrency.go 90.79% (-4.73%) 76 (+9) 69 (+5) 7 (+4) 👎

Please note that the "Total", "Covered", and "Missed" counts above refer to code statements instead of lines of code. The value in brackets refers to the test coverage of that file in the old version of the code.

Changed unit test files

  • github.com/kai-scheduler/KAI-scheduler/pkg/scheduler/cache/status_updater/concurrency_test.go
  • github.com/kai-scheduler/KAI-scheduler/pkg/scheduler/cache/status_updater/default_status_updater_test.go

@github-actions
Copy link
Copy Markdown

Merging this branch will decrease overall coverage

Impacted Packages Coverage Δ 🤖
github.com/kai-scheduler/KAI-scheduler/pkg/scheduler/cache/status_updater 78.89% (-0.33%) 👎

Coverage by file

Changed files (no unit tests)

Changed File Coverage Δ Total Covered Missed 🤖
github.com/kai-scheduler/KAI-scheduler/pkg/scheduler/cache/status_updater/concurrency.go 89.47% (-4.56%) 76 (+9) 68 (+5) 8 (+4) 👎

Please note that the "Total", "Covered", and "Missed" counts above refer to code statements instead of lines of code. The value in brackets refers to the test coverage of that file in the old version of the code.

Changed unit test files

  • github.com/kai-scheduler/KAI-scheduler/pkg/scheduler/cache/status_updater/concurrency_test.go
  • github.com/kai-scheduler/KAI-scheduler/pkg/scheduler/cache/status_updater/default_status_updater_test.go

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants