refactor(levm): rewrite of state EF tests runner first iteration #3642

sofiazcoaga · 2025-07-15T16:41:43Z

Motivation

Related issue: #3496.

The idea is to incrementally develop a new EF Test runner (for state tests) that can eventually replace the current one. The main goal of the new runner is to be easy to understand and as straightforward as possible, also making it possible to easily add any new requirement.

Considerations

This first iteration was developed using the following test files as reference:

vectors/state_tests/prague/eip2537_bls_12_381_precompiles/bls12_g1add/gas.json
vectors/GeneralStateTests/Cancun/stEIP4844-blobtransactions/blobhashListBounds9.json
every test in the vectors/LegacyTests/Cancun/GeneralStateTests/Cancun/stEIP1153-transientStorage/ directory.

The main changes are:

The new Test and TestCase structures in types.
The runner and parser simplified flows.

Files that should not be reviewed as they are full or partial copies of the original files:

runner_v2/deserialize.rs
runner_v2/utils.rs

This iteration excludes report-related code, option flags and other possible test case errors to be considered that will be included later.

github-actions · 2025-07-15T16:43:41Z

Lines of code report

Total lines added: 1090
Total lines removed: 0
Total lines changed: 1090

Detailed view

+-----------------------------------------------------+-------+------+
| File                                                | Lines | Diff |
+-----------------------------------------------------+-------+------+
| ethrex/cmd/ef_tests/state/lib.rs                    | 7     | +1   |
+-----------------------------------------------------+-------+------+
| ethrex/cmd/ef_tests/state/runner_v2/deserialize.rs  | 294   | +294 |
+-----------------------------------------------------+-------+------+
| ethrex/cmd/ef_tests/state/runner_v2/error.rs        | 23    | +23  |
+-----------------------------------------------------+-------+------+
| ethrex/cmd/ef_tests/state/runner_v2/mod.rs          | 7     | +7   |
+-----------------------------------------------------+-------+------+
| ethrex/cmd/ef_tests/state/runner_v2/parser.rs       | 34    | +34  |
+-----------------------------------------------------+-------+------+
| ethrex/cmd/ef_tests/state/runner_v2/result_check.rs | 168   | +168 |
+-----------------------------------------------------+-------+------+
| ethrex/cmd/ef_tests/state/runner_v2/run.rs          | 11    | +11  |
+-----------------------------------------------------+-------+------+
| ethrex/cmd/ef_tests/state/runner_v2/runner.rs       | 103   | +103 |
+-----------------------------------------------------+-------+------+
| ethrex/cmd/ef_tests/state/runner_v2/types.rs        | 406   | +406 |
+-----------------------------------------------------+-------+------+
| ethrex/cmd/ef_tests/state/runner_v2/utils.rs        | 43    | +43  |
+-----------------------------------------------------+-------+------+

cdiielsi

Really nice work! I like how you distributed the modules, having runner separated from result_check makes things feel less crowded and easier to navigate. I made a couple of comments, one you can consider moving on (it's about writing reports) the rest are rather subjective and totally skippable.

cdiielsi · 2025-07-15T21:49:26Z

cmd/ef_tests/state/runner_v2/error.rs

+    FailedToDeserializeField(String),
+    FailedToCreateReportFile(String),
+    FailedToGetIndexValue(String),
+}


Maybe the runner errors can be distributed at lest in two categories, each with its own enum, so you can distinguish between parsing errors and LEVM errors. I'm not sure about this though because I also think that it could escalate so you end up having too many nested enums and to me that also overcomplicates debugging... Maybe pin this though and consider it as you continue with the runner.

I think this might be a good suggestion, for now I think it could be considered as out of scope as the new runner is still in a quite rustic state, but I will keep in mind for later iterations to improve the Errors.

cdiielsi · 2025-07-15T22:21:04Z

cmd/ef_tests/state/runner_v2/result_check.rs

+        format!(
+            "Test checks failed for test: {:?}, with fork: {:?},  in path: {:?}.\n",
+            test.name, test_case.fork, test.path,
+        )


Maybe it could be useful to have the chance to write only the failed tests in the report. I think this could be regulated with a flag you set when running the tests. That being said, I like the idea of having both the succesful and failed tests on the report so you can have a better overview of what is working and what isn't. When you run all tests it's probably too much, but if you are running per directory or per tests it could be useful.

I think this is a good idea that can be done when I get to writing the execution flags parts. Thank you!

cmd/ef_tests/state/runner_v2/result_check.rs

JereSalo · 2025-07-16T21:06:50Z

Suggestion, leave in PR description the necessary steps for running the tests

JereSalo · 2025-07-16T21:09:28Z

Parsing test files...
Error: FailedToParseTestFile("./vectors/LegacyTests/Cancun/GeneralStateTests/Cancun/stEIP1153-transientStorage/21_tstoreCannotBeDosdOOO.json", "Failed to deserialize post field in test 21_tstoreCannotBeDosdOOO. Serde error: missing field state")

JereSalo · 2025-07-16T21:12:14Z

Now that each test is one struct we can filter out repeated tests, a problem in the other runner that we have was that we were running some tests twice because some files were duplicated

sofiazcoaga added 3 commits July 15, 2025 13:24

add first iteration for new ef state tests runner

5b56dd0

include new runner execution in Cargo.toml tests

0dc1cdf

add new runner module in lib.rs

be2b62e

sofiazcoaga requested a review from a team as a code owner July 15, 2025 16:41

github-actions bot added the levm Lambda EVM implementation label Jul 15, 2025

github-project-automation bot added this to ethrex_l1 Jul 15, 2025

github-actions bot assigned sofiazcoaga Jul 15, 2025

sofiazcoaga marked this pull request as draft July 15, 2025 16:41

apply clippy suggestions

646233a

sofiazcoaga changed the title ~~refactor(levm): refactor of state EF tests runner~~ refactor(levm): rewrite of state EF tests runner first iteration Jul 15, 2025

sofiazcoaga marked this pull request as ready for review July 15, 2025 17:24

Merge branch 'main' into ef_tests/refactor-runner

e96db06

cdiielsi approved these changes Jul 16, 2025

View reviewed changes

sofiazcoaga mentioned this pull request Jul 16, 2025

refactor(levm): rewrite of EF state tests runner second iteration #3666

Draft

sofiazcoaga added 2 commits July 16, 2025 17:00

remove unneeded unwrap() by using if let some() in result checks

2e5c7e5

rename exception_is_expected() to exception_matches_expected()

dd73b5c

mpaulucci moved this to In Progress in ethrex_l1 Jul 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

refactor(levm): rewrite of state EF tests runner first iteration #3642

refactor(levm): rewrite of state EF tests runner first iteration #3642

Uh oh!

sofiazcoaga commented Jul 15, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Jul 15, 2025 •

edited

Loading

Uh oh!

cdiielsi left a comment

Uh oh!

cdiielsi Jul 15, 2025

Uh oh!

sofiazcoaga Jul 16, 2025

Uh oh!

cdiielsi Jul 15, 2025

Uh oh!

sofiazcoaga Jul 16, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

JereSalo commented Jul 16, 2025

Uh oh!

JereSalo commented Jul 16, 2025

Uh oh!

JereSalo commented Jul 16, 2025

Uh oh!

Uh oh!

refactor(levm): rewrite of state EF tests runner first iteration #3642

Are you sure you want to change the base?

refactor(levm): rewrite of state EF tests runner first iteration #3642

Uh oh!

Conversation

sofiazcoaga commented Jul 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Jul 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Lines of code report

Uh oh!

cdiielsi left a comment

Choose a reason for hiding this comment

Uh oh!

cdiielsi Jul 15, 2025

Choose a reason for hiding this comment

Uh oh!

sofiazcoaga Jul 16, 2025

Choose a reason for hiding this comment

Uh oh!

cdiielsi Jul 15, 2025

Choose a reason for hiding this comment

Uh oh!

sofiazcoaga Jul 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

JereSalo commented Jul 16, 2025

Uh oh!

JereSalo commented Jul 16, 2025

Uh oh!

JereSalo commented Jul 16, 2025

Uh oh!

Uh oh!

sofiazcoaga commented Jul 15, 2025 •

edited

Loading

github-actions bot commented Jul 15, 2025 •

edited

Loading

sofiazcoaga Jul 16, 2025 •

edited

Loading