refactor(benchmark): repricing filter logic #1810

LouisTsai-Csie · 2025-11-26T08:43:42Z

🗒️ Description

Supported Command: fill &execute
Modes:
- -m benchmark (under compute/ folder)
- -m stateful (under stateful/ folder)
Flags
- --gas-benchmark-values (specify the block gas limit)
- --fixed-opcode-count (specify the opcode count)
Options
- -m repricing (run only the subset of benchmark tests for quick testing)

Issue Description
When running benchmark tests with the --fixed-opcode-count flag, the current behavior incorrectly restricts execution to only the benchmark cases marked with the repricing marker.

However, repricing is meant to be an independent option that limits execution to a subset of the benchmark suite.
Using --fixed-opcode-count should not imply repricing mode.

Fix Applied in This PR
This PR adjusts the logic so that:

Running with --fixed-opcode-count executes the full benchmark suite: Four passes are executed (including contract_balance with values 0 and 1):

fill -v tests/benchmark/compute/instruction/test_account_query.py::test_selfbalance --fixed-opcode-count 10 --clean -m benchmark

Running with -m repricing executes only the repricing subset: Two passes are executed (only contract_balance = 0)

fill -v tests/benchmark/compute/instruction/test_account_query.py::test_selfbalance --fixed-opcode-count 10 --clean -m repricing

Add a sanity check CI workflow for fixed opcode count feature.

🔗 Related Issues or PRs

None

✅ Checklist

All: Ran fast tox checks to avoid unnecessary CI fails, see also Code Standards and Enabling Pre-commit Checks:
```
uvx tox -e static
```
All: PR title adheres to the repo standard - it will be used as the squash commit message and should start type(scope):.
All: Considered adding an entry to CHANGELOG.md.
All: Considered updating the online docs in the ./docs/ directory.
All: Set appropriate labels for the changes (only maintainers can apply labels).
Tests: Ran mkdocs serve locally and verified the auto-generated docs for new tests in the Test Case Reference are correctly formatted.
Tests: For PRs implementing a missed test case, update the post-mortem document to add an entry the list.
Ported Tests: All converted JSON/YML tests from ethereum/tests or tests/static have been assigned @ported_from marker.

Cute Animal Picture

LouisTsai-Csie · 2025-11-27T05:12:55Z

packages/testing/src/execution_testing/specs/benchmark.py

+        if self.fixed_opcode_count is not None and self.code_generator is None:
+            pytest.skip(
+                "Cannot run fixed opcode count tests without a code generator"
+            )
+


Currently, the fixed opcode count feature is supported only by benchmark tests that use the code generator. If a test does not support this feature, it should be skipped.

There are two possible ways to filter such tests:

We could skip them during collection via pytest_collection_modifyitems, but this is more complex because it requires determining whether each test is a benchmark test and whether it uses the code generator.

Therefore, I chose to skip the unsupported tests directly in benchmark test wrapper instead.

codecov · 2025-11-28T12:25:55Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 86.08%. Comparing base (f23e4ab) to head (b54883f).
⚠️ Report is 2 commits behind head on forks/osaka.

Additional details and impacted files

@@             Coverage Diff              @@
##           forks/osaka    #1810   +/-   ##
============================================
  Coverage        86.08%   86.08%           
============================================
  Files              743      743           
  Lines            44076    44076           
  Branches          3891     3891           
============================================
  Hits             37941    37941           
  Misses            5657     5657           
  Partials           478      478

Flag	Coverage Δ
unittests	`86.08% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

LouisTsai-Csie self-assigned this Nov 26, 2025

LouisTsai-Csie added C-refactor Category: refactor A-test-benchmark Area: Tests Benchmarks—Performance measurement (eg. `tests/benchmark/*`, `p/t/s/e/benchmark/*`) labels Nov 26, 2025

LouisTsai-Csie marked this pull request as ready for review November 27, 2025 05:07

LouisTsai-Csie commented Nov 27, 2025

View reviewed changes

LouisTsai-Csie force-pushed the refactor-repricing-marker branch from 7945e85 to 79e7a7b Compare November 27, 2025 05:14

LouisTsai-Csie added 3 commits November 28, 2025 13:34

refactor: update filter logic

9a8d81c

feat(benchmark): add benchmark repricing ci workflow

3966f12

refactor: marker filter logic

b54883f

LouisTsai-Csie force-pushed the refactor-repricing-marker branch from 2110a2c to b54883f Compare November 28, 2025 06:26

LouisTsai-Csie marked this pull request as draft December 1, 2025 14:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

refactor(benchmark): repricing filter logic #1810

refactor(benchmark): repricing filter logic #1810

Uh oh!

LouisTsai-Csie commented Nov 26, 2025 •

edited

Loading

Uh oh!

LouisTsai-Csie Nov 27, 2025

Uh oh!

codecov bot commented Nov 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

refactor(benchmark): repricing filter logic #1810

Are you sure you want to change the base?

refactor(benchmark): repricing filter logic #1810

Uh oh!

Conversation

LouisTsai-Csie commented Nov 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🗒️ Description

🔗 Related Issues or PRs

✅ Checklist

Cute Animal Picture

Uh oh!

LouisTsai-Csie Nov 27, 2025

Choose a reason for hiding this comment

Uh oh!

codecov bot commented Nov 28, 2025

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

LouisTsai-Csie commented Nov 26, 2025 •

edited

Loading