62221 Parallelise the performance tests #8190

johnbillion · 2025-01-25T02:28:41Z

This change introduces a job matrix for the "current", "before", and "base" performance tests to replace the current behaviour of running them sequentially in a single job.

This speeds up the overall performance workflow run by 18-20 minutes 🕐 .

Reasoning

The order of the tests at the moment is "current" followed by "before" followed by "base". This means the database upgrade routine between the tests is actually being asked to perform a downgrade, which is not supported. In a PR, the "before" test is potentially being performed with a database schema from the proposed change. This problem is causing the tests in #21022 Switch to using bcrypt for hashing passwords #7333 to unnecessarily fail because of a change to the structure of data in the database which is not supported in the "before" or "base" version of the codebase.
This change removes the potential for to tests to interfere with one another. The current tests rely on running the database upgrade, flushing the cache, and deleting transients between test runs, which doesn't guarantee that the "before" and "base" tests are running in a clean environment compared to the "current" test. By separating the tests into separate jobs, every test run starts with a clean environment and uses the same setup steps.
The original intention of performing the tests sequentially in the same job was to reduce the chance that an environmental factor on GitHub Actions affects the comparison between the "current" and "before" tests that gets reported in a PR. I think in practice this is unlikely and uncommon, and the benefit of having tests that complete significantly faster outweighs this concern. This has no effect on the test reporting that's sent to codevitals.run.
Bonus: Nicely grouped jobs on the GitHub Actions workflow run screens.

Notes

The "v2" workflow files are needed so the tests on old branches continue to work as they do currently.
Published "base" test results can be seen at https://webhook.cool/at/yellow-ice-88/. I'm using webhook.cool to avoid spamming codevitals.run with results from this PR.
The e2e tests, unit tests, and build tests are only disabled in this PR to prevent them from unnecessarily running on this PR. See Add some more paths config to workflow files. #8147 to get that fixed.

Future enhancements

The "base" run result should be cached so it theoretically only ever needs to run once per release and then all subsequent performance runs pull its cached results. Needs more work that's outside of the scope of this change.
The "before" run result should be cached so multiple pushes to a PR don't unnecessarily re-run the same tests. Same as above, this needs more work so I will look into it at a later date.

Todo

Update inline docs in the workflow files
Reinstate the comparison
Reinstate the reporting
Add logic to only run the base test on a push to trunk

Trac ticket: https://core.trac.wordpress.org/ticket/62221

…other run.

…ance tests as they do currently.

johnbillion · 2025-01-28T16:17:21Z

@swissspidy @desrosj @sirreal @joemcgill What are your thoughts on this approach? The PR still needs some tweaks but the bulk of it is ready.

Latest results here: https://github.com/WordPress/wordpress-develop/actions/runs/13012122443

swissspidy · 2025-01-28T16:42:24Z

The original intention of performing the tests sequentially in the same job was to reduce the chance that an environmental factor on GitHub Actions affects the comparison between the "current" and "before" tests that gets reported in a PR. I think in practice this is unlikely and uncommon

IIRC @dmsnell looked into that before and there can actually be quite some fluctuation, even depending on the time of day.

johnbillion added 23 commits January 25, 2025 02:24

Split the performance tests into three jobs so they run in parallel.

2bdcaa9

Prevent unnecessary tests running in this branch.

e11aae7

Naming.

8b62844

This does need to use a script because it fetches an artifact from an…

a12a65f

…other run.

Start the environment in time.

5442eff

We always need a build so all the themes are available.

06072c5

Naming.

3ac9b9a

Compare the results.

8ee02d7

Naming.

f8f53e7

Todos.

6506842

Need a checkout here.

5979877

Reinstate the logging.

50e8501

Reuse the tag name via workflow output.

9090810

Reinstate the original version number handling.

bad5906

Skip the token check for now.

bd82165

Correct this prefix handling.

3739509

Split the comparison and publishing into a reusable workflow.

8430bc3

Prepare for running the base test only on pushes to trunk.

3398fa1

Hack the label.

a977c27

A v2 workflow is needed so older branches continue to run the perform…

9cfff3d

…ance tests as they do currently.

Keep shellcheck happy.

d1fad83

Merge branch 'trunk' into 62221-parallel-performance-tests

4a55471

Update the runners.

3d53664

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

62221 Parallelise the performance tests #8190

62221 Parallelise the performance tests #8190

johnbillion commented Jan 25, 2025 •

edited

Loading

johnbillion commented Jan 28, 2025

swissspidy commented Jan 28, 2025

62221 Parallelise the performance tests #8190

Are you sure you want to change the base?

62221 Parallelise the performance tests #8190

Conversation

johnbillion commented Jan 25, 2025 • edited Loading

Reasoning

Notes

Future enhancements

Todo

johnbillion commented Jan 28, 2025

swissspidy commented Jan 28, 2025

johnbillion commented Jan 25, 2025 •

edited

Loading