Add Nemotron 3 to tests via tiny model by sergiopaniego · Pull Request #5278 · huggingface/trl

sergiopaniego · 2026-03-12T11:53:47Z

What does this PR do?

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a GitHub issue? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

Who can review?

Note

Medium Risk
Primarily affects test coverage and internal tiny-model generation, but introduces a new architecture path (NemotronH) and tightens minimum transformers version expectations, which could cause CI/runtime mismatches if versions diverge.

Overview
Adds a new trl-internal-testing/tiny-NemotronHForCausalLM tiny model generator entry (hybrid Mamba+attention) so unit tests can exercise NemotronH/Nemotron 3–style models, including mirroring the model’s float32-only Mamba parameters.

Wires this tiny model into existing tokenizer/data-utils and trainer test parametrizations (SFTTrainer/DPOTrainer), with skipif(transformers<5.7.0) guards due to NemotronH gradient-checkpointing requirements. Also updates the Nemotron 3 SFT example to require transformers>=5.7.0 and removes the forced disabling of gradient checkpointing.

^{Reviewed by Cursor Bugbot for commit b12633b. Bugbot is set up for automated code reviews on this repo. Configure here.}

HuggingFaceDocBuilderDev · 2026-03-12T11:58:35Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

…o nemotron3-tiny-tests

albertvillanova

Thanks!

The CI is red:

  FAILED tests/test_dpo_trainer.py::TestDPOTrainer::test_train[trl-internal-testing/tiny-NemotronHForCausalLM] - RuntimeError: causal_conv1d with channel last layout requires strides (x.stride(0) and x.stride(2)) to be multiples of 8
  FAILED tests/test_sft_trainer.py::TestSFTTrainer::test_train[trl-internal-testing/tiny-NemotronHForCausalLM] - RuntimeError: causal_conv1d with channel last layout requires strides (x.stride(0) and x.stride(2)) to be multiples of 8

qgallouedec

thanks!! just a few comments

qgallouedec · 2026-03-13T15:50:03Z

+        kwargs = {}
+        if "NemotronH" in model_id:
+            kwargs["gradient_checkpointing"] = False
+            kwargs["use_cpu"] = True


really not sure about this. we don't train on cpu, so why testing it + we wouldn't know it a gpu-specific issue is introduced

qgallouedec · 2026-03-13T15:51:34Z

Thanks!

The CI is red:

  FAILED tests/test_dpo_trainer.py::TestDPOTrainer::test_train[trl-internal-testing/tiny-NemotronHForCausalLM] - RuntimeError: causal_conv1d with channel last layout requires strides (x.stride(0) and x.stride(2)) to be multiples of 8
  FAILED tests/test_sft_trainer.py::TestSFTTrainer::test_train[trl-internal-testing/tiny-NemotronHForCausalLM] - RuntimeError: causal_conv1d with channel last layout requires strides (x.stride(0) and x.stride(2)) to be multiples of 8

is it possible that this error originates from what params are used to build the model?

sergiopaniego · 2026-03-19T11:08:08Z

Fixes applied. There is an issue with some dependencies that needs to be addressed in transformers.
I've created the PR there: huggingface/transformers#44853.

albertvillanova · 2026-04-08T08:45:30Z

Hi @sergiopaniego, is there any update on this or the corresponding upstream PR?

albertvillanova · 2026-04-20T06:54:01Z

Upstream PR has just been merged!

Fix Zamba2MambaMixer ignoring use_mamba_kernels=False transformers#44853

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

^{Reviewed by Cursor Bugbot for commit 0423cdb. Configure here.}

sergiopaniego · 2026-04-20T08:57:09Z

@albertvillanova Upstream PR has just been merged!

yes! It could be approved and merged if tests are green 😄

sergiopaniego · 2026-04-20T09:20:59Z

failed test is unrelated

# Conflicts: # tests/test_data_utils.py

sergiopaniego · 2026-04-27T09:24:23Z

Gradient checkpointing PR in transformers (huggingface/transformers#45625) is now merged so I've update the version here to the next transformers one (5.7.0). It should be now finally fixed once that version is released 😄 Tests are green

cc @albertvillanova

qgallouedec · 2026-04-28T19:44:45Z

thanks! I think I'll wait for #5637 to be merged or closed before merging this one :)

sergiopaniego · 2026-04-29T07:38:23Z

makes sense, ping me so I can update this PR once #5637 is merged

sergiopaniego added 3 commits March 12, 2026 12:51

Add Nemotron 3 to tests via tiny model

dd4f3a6

Code quality

529ef04

Merge branch 'main' into nemotron3-tiny-tests

875a46c

sergiopaniego added 2 commits March 12, 2026 14:41

Updated

b79861f

Merge branch 'nemotron3-tiny-tests' of github.com:huggingface/trl int…

27cef67

…o nemotron3-tiny-tests

albertvillanova reviewed Mar 12, 2026

View reviewed changes

sergiopaniego added 3 commits March 12, 2026 15:35

Update

4f9b1f8

Merge branch 'main' into nemotron3-tiny-tests

8122d6d

Update

c555040

cursor Bot reviewed Mar 13, 2026

View reviewed changes

Comment thread tests/test_sft_trainer.py Outdated

Cursor review

3c8f9d4

qgallouedec reviewed Mar 13, 2026

View reviewed changes

Merge branch 'main' into nemotron3-tiny-tests

0f980b5

cursor Bot reviewed Mar 18, 2026

View reviewed changes

Comment thread scripts/generate_tiny_models.py

sergiopaniego added 2 commits March 18, 2026 15:41

Update

9eceafa

Merge branch 'main' into nemotron3-tiny-tests

a9d0c9a

sergiopaniego mentioned this pull request Mar 19, 2026

Fix Zamba2MambaMixer ignoring use_mamba_kernels=False huggingface/transformers#44853

Merged

5 tasks

Updated

b965041

cursor Bot reviewed Mar 19, 2026

View reviewed changes

Comment thread tests/test_sft_trainer.py Outdated

sergiopaniego added 2 commits March 19, 2026 11:34

Merge branch 'main' into nemotron3-tiny-tests

5cd55a9

code quality

6ba7b76

Merge branch 'main' into nemotron3-tiny-tests

0423cdb

cursor Bot reviewed Apr 20, 2026

View reviewed changes

Comment thread scripts/generate_tiny_models.py Outdated

Replace hasattr with explicit Mamba layer access in NemotronH tiny model

aef3814

revert grad chkpt patch

ee4906b

sergiopaniego mentioned this pull request Apr 24, 2026

Add supports_gradient_checkpointing to NemotronHPreTrainedModel huggingface/transformers#45625

Merged

6 tasks

sergiopaniego added 3 commits April 24, 2026 11:08

Merge remote-tracking branch 'origin/main' into nemotron3-tiny-tests

cd65331

# Conflicts: # tests/test_data_utils.py

Merge branch 'main' into nemotron3-tiny-tests

6ae9c3a

Bumped transformers version for NemotronH tests

b12633b

Conversation

sergiopaniego commented Mar 12, 2026 • edited by cursor Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

HuggingFaceDocBuilderDev commented Mar 12, 2026

Uh oh!

albertvillanova left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

qgallouedec left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

qgallouedec Mar 13, 2026

Choose a reason for hiding this comment

Uh oh!

qgallouedec commented Mar 13, 2026

Uh oh!

Uh oh!

Uh oh!

sergiopaniego commented Mar 19, 2026

Uh oh!

albertvillanova commented Apr 8, 2026

Uh oh!

albertvillanova commented Apr 20, 2026

Uh oh!

cursor Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sergiopaniego commented Apr 20, 2026

Uh oh!

sergiopaniego commented Apr 20, 2026

Uh oh!

sergiopaniego commented Apr 27, 2026

Uh oh!

qgallouedec commented Apr 28, 2026

Uh oh!

sergiopaniego commented Apr 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

sergiopaniego commented Mar 12, 2026 •

edited by cursor Bot

Loading

albertvillanova left a comment •

edited

Loading