refactor: model class hierarchy into Forecaster/StepPredictor layers by parth000007 · Pull Request #513 · mllam/neural-lam

parth000007 · 2026-03-24T13:28:18Z

Refactors the monolithic ARModel class into a composable hierarchy:

ForecasterModule (pl.LightningModule): Training loop, metrics, plotting
ARForecaster (nn.Module): Auto-regressive unrolling
StepPredictor (nn.Module): Single-step prediction interface
BaseGraphModel inherits StepPredictor instead of ARModel

This separation enables:

Non-autoregressive forecasters
New step predictor architectures (e.g. Vision Transformers)
Ensemble strategies without modifying training infrastructure

Also fixes two pre-existing bugs:

interior_mask_bool shape (1,) → (N,) for correct loss masking
all_gather_cat dimension collapse on single-device runs

Refs #49

Describe your changes

< Summary of the changes.>

< Please also include relevant motivation and context. >

< List any dependencies that are required for this change. >

Issue Link

Type of change

🐛 Bug fix (non-breaking change that fixes an issue)
✨ New feature (non-breaking change that adds functionality)
💥 Breaking change (fix or feature that would cause existing functionality to not work as expected)
📖 Documentation (Addition or improvements to documentation)

Checklist before requesting a review

My branch is up-to-date with the target branch - if not update your fork with the changes from the target branch (use pull with --rebase option if possible).
I have performed a self-review of my code
For any new/modified functions/classes I have added docstrings that clearly describe its purpose, expected inputs and returned values
I have placed in-line comments to clarify the intent of any hard-to-understand passages of my code
I have updated the README to cover introduced code changes
I have added tests that prove my fix is effective or that my feature works
I have given the PR a name that clearly describes the change, written in imperative form (context).
I have requested a reviewer and an assignee (assignee is responsible for merging). This applies only if you have write access to the repo, otherwise feel free to tag a maintainer to add a reviewer and assignee.

Checklist for reviewers

Each PR comes with its own improvements and flaws. The reviewer should check the following:

the code is readable
the code is well tested
the code is documented (including return types and parameters)
the code is easy to maintain

Author checklist after completed review

I have added a line to the CHANGELOG describing this change, in a section
reflecting type of change (add section where missing):
- added: when you have added new functionality
- changed: when default behaviour of the code has been changed
- fixes: when your contribution fixes a bug
- maintenance: when your contribution is relates to repo maintenance, e.g. CI/CD or documentation

Checklist for assignee

PR is up to date with the base branch
the tests pass
(if the PR is not just maintenance/bugfix) the PR is assigned to the next milestone. If it is not, propose it for a future milestone.
author has added an entry to the changelog (and designated the change as added, changed, fixed or maintenance)
Once the PR is ready to be merged, squash commits and merge the PR.

Refactors the monolithic ARModel class into a composable hierarchy: - ForecasterModule (pl.LightningModule): Training loop, metrics, plotting - ARForecaster (nn.Module): Auto-regressive unrolling - StepPredictor (nn.Module): Single-step prediction interface - BaseGraphModel inherits StepPredictor instead of ARModel This separation enables: - Non-autoregressive forecasters - New step predictor architectures (e.g. Vision Transformers) - Ensemble strategies without modifying training infrastructure Also fixes two pre-existing bugs: - interior_mask_bool shape (1,) → (N,) for correct loss masking - all_gather_cat dimension collapse on single-device runs Refs mllam#49 Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>

parth000007 · 2026-03-24T13:34:49Z

hi @j6k4m8 @leifdenby @kshirajahere can you please check this PR its benefits it attach here by:-

kshirajahere · 2026-03-24T13:56:28Z

Hey @parth000007 there is an issue with the PR description it has parts of the placeholder of template, please get rid of them..
Also I dont know if it is just issue with my internet but i am not able to see the screenshot.

kshirajahere · 2026-03-24T13:57:16Z

pyproject.toml

The PR is undoing some recent changes

EDIT : Meant to tag ./Agents.md

kshirajahere · 2026-03-24T14:02:53Z

This seems too large and overlapping to review as a separate parallel refactor PR. Since it targets the same #49 hierarchy lane as #444, I think consolidation there would be better than keeping two full refactor paths open.

Sir-Sloth-The-Lazy · 2026-03-24T15:22:36Z

Hi @parth000007, I noticed this PR overlaps significantly with #208 which covers the same ARModel → ForecasterModule/ARForecaster/StepPredictor refactor and is currently under review. Could you clarify how this differs or if it was intended to supersede #208? cc @joeloskarsson @sadamov @leifdenby @observingClouds

Sir-Sloth-The-Lazy · 2026-03-24T15:37:43Z

.github/workflows/ci-pypi-deploy.yml

-        uses: astral-sh/setup-uv@v7
-      - name: Build with uv
-        run: uv build
+      - uses: actions/setup-python@v5


I cannot understand why we have to do this ?

Sir-Sloth-The-Lazy · 2026-03-24T15:40:24Z

neural_lam/datastore/npyfilesmeps/compute_standardization_stats.py

-                f"SLURM_JOB_NODELIST is set to {repr(nodelist)}, but "
-                "'scontrol show hostnames' returned no hostnames. "
-                "Please check your SLURM job configuration."
+        master_node = (


i think the previous version provided more security against a command injection vulnerability

sadamov · 2026-03-25T02:11:29Z

@parth000007 such a massive PR certainly warrants some discussion in #49 before implementation. Especially why you think we need another approach to #208. I'd ask you to contribute to #208 instead and introduce your ideas through code reviews, comments or PRs into that branch instead. thanks!

kshirajahere reviewed Mar 24, 2026

View reviewed changes

Sir-Sloth-The-Lazy reviewed Mar 24, 2026

View reviewed changes

sadamov closed this Mar 25, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor: model class hierarchy into Forecaster/StepPredictor layers#513

refactor: model class hierarchy into Forecaster/StepPredictor layers#513
parth000007 wants to merge 1 commit intomllam:mainfrom
parth000007:claude/refactor-model-class-hierarchy-again

parth000007 commented Mar 24, 2026 •

edited

Loading

Uh oh!

parth000007 commented Mar 24, 2026

Uh oh!

kshirajahere commented Mar 24, 2026

Uh oh!

kshirajahere Mar 24, 2026 •

edited

Loading

Uh oh!

kshirajahere Mar 24, 2026

Uh oh!

kshirajahere commented Mar 24, 2026

Uh oh!

Sir-Sloth-The-Lazy commented Mar 24, 2026 •

edited

Loading

Uh oh!

Sir-Sloth-The-Lazy Mar 24, 2026

Uh oh!

Sir-Sloth-The-Lazy Mar 24, 2026

Uh oh!

sadamov commented Mar 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

parth000007 commented Mar 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Describe your changes

Issue Link

Type of change

Checklist before requesting a review

Checklist for reviewers

Author checklist after completed review

Checklist for assignee

Uh oh!

parth000007 commented Mar 24, 2026

Uh oh!

kshirajahere commented Mar 24, 2026

Uh oh!

kshirajahere Mar 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kshirajahere Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

kshirajahere commented Mar 24, 2026

Uh oh!

Sir-Sloth-The-Lazy commented Mar 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Sir-Sloth-The-Lazy Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

Sir-Sloth-The-Lazy Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

sadamov commented Mar 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

parth000007 commented Mar 24, 2026 •

edited

Loading

kshirajahere Mar 24, 2026 •

edited

Loading

Sir-Sloth-The-Lazy commented Mar 24, 2026 •

edited

Loading