feat: aac add marf computation in test harness #6560

fdefelici · 2025-10-03T14:23:49Z

Description

This PR add block marf computation to the AAC test harness, so that we can avoid to pass it as an input.

I checked different ways to do it (even using ephemeral implementation), but in the end the one that worked best was to write on the test chainstate, retrieve the root hash and then rollback the marf transaction.

Applicable issues

fixes #

Additional info (benefits, drawbacks, caveats)

Checklist

Test coverage for new or modified code paths
Changelog is updated
Required documentation changes (e.g., docs/rpc/openapi.yaml and rpc-endpoints.md for v2 endpoints, event-dispatcher.md for new events)
New clarity functions have corresponding PR in clarity-benchmarking repo
New integration test(s) added to bitcoin-tests.yml

codecov · 2025-10-06T07:37:18Z

Codecov Report

❌ Patch coverage is 97.67442% with 5 lines in your changes missing coverage. Please review.
✅ Project coverage is 75.69%. Comparing base (483f1ad) to head (111595a).
⚠️ Report is 15 commits behind head on develop.

Files with missing lines	Patch %	Lines
stackslib/src/chainstate/nakamoto/tests/mod.rs	83.33%	4 Missing ⚠️
stackslib/src/chainstate/tests/consensus.rs	99.43%	1 Missing ⚠️

❌ Your project status has failed because the head coverage (75.69%) is below the target coverage (80.00%). You can increase the head coverage or adjust the target coverage.

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #6560      +/-   ##
===========================================
+ Coverage    69.88%   75.69%   +5.80%     
===========================================
  Files          568      568              
  Lines       347547   347674     +127     
===========================================
+ Hits        242887   263171   +20284     
+ Misses      104660    84503   -20157

Files with missing lines	Coverage Δ
stacks-signer/src/client/mod.rs	`99.24% <100.00%> (+1.50%)`	⬆️
stackslib/src/chainstate/nakamoto/test_signers.rs	`77.93% <100.00%> (+0.95%)`	⬆️
stackslib/src/chainstate/tests/consensus.rs	`91.86% <99.43%> (-0.70%)`	⬇️
stackslib/src/chainstate/nakamoto/tests/mod.rs	`95.65% <83.33%> (+15.86%)`	⬆️

... and 368 files with indirect coverage changes

Continue to review full report in Codecov by Sentry.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 483f1ad...111595a. Read the comment docs.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

Jiloc

Great work, automatically computing the marf will definitely make these tests easier to write! Regarding the approach used, LGTM, but can't gaurantee if there is an easier way that also works for other types of txs. Maybe @kantai can share his thoughs on that.

One thought (that we already discussed offline): since we've removed the MARF from the input parameters (which is great for making the first test execution faster and cleaner), we might now want to include it in the expected output. It's still an important part of consensus, and we'll want to guarantee that a newer version of stacks-node produces the same MARF root for a block as previous versions.

stackslib/src/chainstate/tests/consensus.rs

jferrant · 2025-10-06T12:58:14Z

I agree that the MARF should still be listed in expected outputs :D Great to see this though.

aaronb-stacks

I think that this PR should be updated so that marf_hash is an Option type.

My rationale is that the marf_hash is actually part of the consensus protocol. So, for creating test vectors that prevent consensus breaking changes, it's better if the marf_hash is included in the input vector. Otherwise, a change which altered that hash could pass the test vector (even though it would be a consensus breaking change).

So, for most kinds of tests that we would write here, we actually want the marf hash included explicitly in the test vector. However, there are plenty of cases where we'd want to be able to run this test harness without the marf hash. In particular, it would help during test writing and generation: when someone writes a test vector, they would create all the test blocks, execute the test with the marf hashes set to None, and then use the output to fill in the expected hashes. Then, subsequent changes to the codebase would need to continue to pass tests with those hashes. A similar pattern would be used when setting up fuzzing targets.

…c argument

federico-stacks · 2025-10-08T13:10:42Z

With this update I added the marf_hash to the ExpectedBlockOutput (so that it is registered in the snapshot), and also merged with insta implementation from develop

Caveats:

Now that with don't have no more expected failure in TestOutput, the marf hash is always computed for all test case and if it fails (for invalid block) it set the zeroed marf hash. (Note: eventually we could restore the old implementation in case we decide to add some flag to TestBlock to say if it should be success or failure)
As a conseguence I removed the test test_append_state_index_root_mismatches
Futhermore I add to remove insta::allow_duplicates! because of fact we have different marf hashes for each expected block result

stackslib/src/chainstate/tests/consensus.rs

fdefelici added 4 commits October 3, 2025 12:11

feat: add marf input computation for test-harness, stacks-network#6523

01ffedd

chore: add documentation for marf input computation, stacks-network#6523

da5ab83

chore: improve marf computation failure message, stacks-network#6523

b3bdc1d

chore: remove unused marf_hash field, stacks-network#6523

32f3533

fdefelici requested review from Jiloc, jferrant and kantai October 3, 2025 14:23

fdefelici self-assigned this Oct 3, 2025

fdefelici added the aac Avoiding Accidental Consensus label Oct 3, 2025

fdefelici added this to Stacks Core Eng Oct 3, 2025

fdefelici added this to the 3.2.0.0.2 milestone Oct 3, 2025

fdefelici linked an issue Oct 3, 2025 that may be closed by this pull request

AAC Testing: Develop Integration Test Harness for append_block in stackslib #6523

Open

fdefelici marked this pull request as ready for review October 6, 2025 07:35

fdefelici requested review from a team as code owners October 6, 2025 07:35

fdefelici added the aac-testing Avoiding Accidental Consensus Testing Specific Task label Oct 6, 2025

fdefelici moved this to Status: In Review in Stacks Core Eng Oct 6, 2025

Jiloc reviewed Oct 6, 2025

View reviewed changes

stackslib/src/chainstate/tests/consensus.rs Show resolved Hide resolved

stackslib/src/chainstate/tests/consensus.rs Show resolved Hide resolved

aaronb-stacks reviewed Oct 6, 2025

View reviewed changes

francesco-stacks mentioned this pull request Oct 7, 2025

feat: add macros for contract consensus tests #6562

Merged

5 tasks

fdefelici added 4 commits October 8, 2025 10:03

merge: develop with conflicts

6852a78

chore: address test compile issue on windows requiring specify generi…

ef3aa82

…c argument

refactor: aac update test marf computation

12521f0

feat: add marf_hash to ExpectedBlockOutput

111595a

fdefelici force-pushed the feat/aac-compute-marf branch from e47882b to 111595a Compare October 8, 2025 13:05

jferrant approved these changes Oct 8, 2025

View reviewed changes

Jiloc approved these changes Oct 8, 2025

View reviewed changes

stackslib/src/chainstate/tests/consensus.rs Show resolved Hide resolved

fdefelici added this pull request to the merge queue Oct 9, 2025

Merged via the queue into stacks-network:develop with commit a7f4240 Oct 9, 2025
299 of 303 checks passed

fdefelici deleted the feat/aac-compute-marf branch October 9, 2025 07:30

github-project-automation bot moved this from Status: In Review to Status: ✅ Done in Stacks Core Eng Oct 9, 2025

fdefelici removed this from the 3.2.0.0.2 milestone Oct 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: aac add marf computation in test harness #6560

feat: aac add marf computation in test harness #6560

Uh oh!

fdefelici commented Oct 3, 2025

Uh oh!

codecov bot commented Oct 6, 2025 •

edited

Loading

Uh oh!

Jiloc left a comment

Uh oh!

Uh oh!

Uh oh!

jferrant commented Oct 6, 2025

Uh oh!

aaronb-stacks left a comment

Uh oh!

federico-stacks commented Oct 8, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

feat: aac add marf computation in test harness #6560

feat: aac add marf computation in test harness #6560

Uh oh!

Conversation

fdefelici commented Oct 3, 2025

Description

Applicable issues

Additional info (benefits, drawbacks, caveats)

Checklist

Uh oh!

codecov bot commented Oct 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Jiloc left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

jferrant commented Oct 6, 2025

Uh oh!

aaronb-stacks left a comment

Choose a reason for hiding this comment

Uh oh!

federico-stacks commented Oct 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

codecov bot commented Oct 6, 2025 •

edited

Loading

federico-stacks commented Oct 8, 2025 •

edited

Loading