feat: add dataset-level registration QC summary with error decomposition by ataha24 · Pull Request #102 · afids/autoafids

ataha24 · 2026-03-05T22:01:42Z

Add a new regqc_summary rule that aggregates per-subject registration QC data into a single interactive HTML report. The report includes:

Dataset summary statistics (mean/median/std ED, best/worst subject)
Sortable per-subject table with hyperlinks to individual reports
Registration error heatmap (subjects × AFIDs)
ED distribution boxplots per AFID
Mean ED bar chart per subject (colour-coded)
3D scatter of all subjects' AFIDs in MNI space
Error decomposition: systematic bias vs random scatter per AFID
3D cone plot showing bias direction/magnitude at each AFID

The error decomposition separates registration error into a systematic component (mean signed error vector, consistent across subjects and potentially correctable) and a random component (standard deviation, the precision floor of the registration). This enables targeted diagnosis of registration pipeline failures.

Scalability: tested on 50 synthetic subjects (r=0.994 bias recovery, 5.1MB file size). Uses single-trace Plotly rendering and conditional boxpoints for efficient rendering at large N.

Files:

NEW: autoafids/workflow/scripts/regqc_summary.py
MOD: autoafids/workflow/rules/regqc.smk (regqc_summary rule)
MOD: autoafids/workflow/Snakefile (wire into rule all)

Proposed changes

Describe the changes implemented in this pull request. If the changes fix a bug or resolves a feature request, be sure to link the issue. Please explain the issue and how the changes address the issue.

Types of changes

What types of changes does your code introduce? Put an x in the boxes that apply

Bugfix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionalitiy)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Documentation (non-breaking change which adds content to documentation)
Other (if none of the other choices apply)

Checklist

Put an x in the boxes that apply. You can also fill these out after creating the PR. If you are unsure about any of the choices, don't hesitate to ask!

Changes have been tested to ensure that fix is effective or that a feature works.
Changes pass the unit tests
Changes pass the wet run tests
Code has been run through the poe quality task
I have included necessary documentation or comments (as necessary)
Any dependent changes have been merged and published

Notes

All PRs will undergo unit and wet run testing before being reviewed. You may be requested to explain or make additional changes before the PR is accepted.

Add a new regqc_summary rule that aggregates per-subject registration QC data into a single interactive HTML report. The report includes: - Dataset summary statistics (mean/median/std ED, best/worst subject) - Sortable per-subject table with hyperlinks to individual reports - Registration error heatmap (subjects × AFIDs) - ED distribution boxplots per AFID - Mean ED bar chart per subject (colour-coded) - 3D scatter of all subjects' AFIDs in MNI space - Error decomposition: systematic bias vs random scatter per AFID - 3D cone plot showing bias direction/magnitude at each AFID The error decomposition separates registration error into a systematic component (mean signed error vector, consistent across subjects and potentially correctable) and a random component (standard deviation, the precision floor of the registration). This enables targeted diagnosis of registration pipeline failures. Scalability: tested on 50 synthetic subjects (r=0.994 bias recovery, 5.1MB file size). Uses single-trace Plotly rendering and conditional boxpoints for efficient rendering at large N. Files: - NEW: autoafids/workflow/scripts/regqc_summary.py - MOD: autoafids/workflow/rules/regqc.smk (regqc_summary rule) - MOD: autoafids/workflow/Snakefile (wire into rule all)

Dhananjhay · 2026-03-17T15:17:42Z

@ataha24 Any reason why the html file is stored in the dataset dir outside of the subject output?

config  dataset  logs  sub-RUN  work

ataha24 · 2026-03-17T16:32:44Z

@ataha24 Any reason why the html file is stored in the dataset dir outside of the subject output?
config  dataset  logs  sub-RUN  work

Are you referring to the new html? That is dataset level summary (i.e., all subjects aggregated)! I am not sure how bids would organize that. Also we should think about how to name that based on dataset too. Would love your thoughts on this!

Dhananjhay · 2026-03-17T18:20:02Z

@ataha24 Any reason why the html file is stored in the dataset dir outside of the subject output?
config  dataset  logs  sub-RUN  work
Are you referring to the new html? That is dataset level summary (i.e., all subjects aggregated)! I am not sure how bids would organize that. Also we should think about how to name that based on dataset too. Would love your thoughts on this!

Yep! I was just wondering why isn't it inside sub-RUN/ses-preop/regqc dir?

ataha24 · 2026-03-17T18:29:07Z

@ataha24 Any reason why the html file is stored in the dataset dir outside of the subject output?
config  dataset  logs  sub-RUN  work
Are you referring to the new html? That is dataset level summary (i.e., all subjects aggregated)! I am not sure how bids would organize that. Also we should think about how to name that based on dataset too. Would love your thoughts on this!
Yep! I was just wondering why isn't it inside sub-RUN/ses-preop/regqc dir?

Ahh, thats because it is produced from pooling the data across ALL subjects in the bids dataset. So, I am not sure the new html file should sit within a specific subject directory.

github-actions bot requested a review from Dhananjhay March 5, 2026 22:01

github-actions bot assigned ataha24 Mar 5, 2026

Dhananjhay added 4 commits March 17, 2026 09:16

replace ambigious x with small letter x

2843884

fix ruff lint

25e0f26

fix long line errors

6e6165f

run snakefmt

3049158

upgrade regqc env to python 3.10

6e272d3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add dataset-level registration QC summary with error decomposition#102

feat: add dataset-level registration QC summary with error decomposition#102
ataha24 wants to merge 6 commits intomainfrom
feature/regqc-dataset-summary

ataha24 commented Mar 5, 2026

Uh oh!

Dhananjhay commented Mar 17, 2026

Uh oh!

ataha24 commented Mar 17, 2026

Uh oh!

Dhananjhay commented Mar 17, 2026 •

edited

Loading

Uh oh!

ataha24 commented Mar 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ataha24 commented Mar 5, 2026

Proposed changes

Types of changes

Checklist

Notes

Uh oh!

Dhananjhay commented Mar 17, 2026

Uh oh!

ataha24 commented Mar 17, 2026

Uh oh!

Dhananjhay commented Mar 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ataha24 commented Mar 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Dhananjhay commented Mar 17, 2026 •

edited

Loading