Restructure learning config #692

mthede · 2025-11-26T07:57:43Z

Description

With the new learning architecture on main, we can streamline how default parameters are handled. Previously, defaults were defined redundantly in multiple places, making it difficult to determine which values applied in practice.

The updated structure centralizes all defaults within the learning role or in the dedicated learning_config data class, improving clarity and maintainability.

It also improves the learning configuration logic, based on four different simulation options (with and without DRL)

No RL -> no learning_config -> no learning role
Single run with loaded RL strategies -> learning_config with learning_mode: false and trained_policies_load_path is provided -> learning role, but no rl algorithm etc. necessary
Training run -> learning_mode: true and full set of learning functions
3.1. Training episodes
3.2. Evaluation episodes (config item evaluation_mode exists, but learning loop overwrites it anyway)
Continue learning -> continue_learning: true and trained_policies_load_path is provided (-> internally sets learning_mode = True for entering the usual learning process)

Checklist

Documentation updated (docstrings, READMEs, user guides, inline comments, doc folder updates etc.)
New unit/integration tests added (if applicable)
Changes noted in release notes (if any)
Consent to release this PR's code under the GNU Affero General Public License v3.0

Additional Notes (optional)

codecov · 2025-11-26T08:06:21Z

Codecov Report

❌ Patch coverage is 91.44737% with 13 lines in your changes missing coverage. Please review.
✅ Project coverage is 45.61%. Comparing base (54370e9) to head (c372568).
⚠️ Report is 15 commits behind head on main.

Files with missing lines	Patch %	Lines
assume/reinforcement_learning/learning_role.py	80.00%	9 Missing ⚠️
assume/world.py	89.47%	2 Missing ⚠️
assume/common/base.py	97.82%	1 Missing ⚠️
assume/scenario/loader_csv.py	92.85%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #692      +/-   ##
==========================================
- Coverage   45.72%   45.61%   -0.11%     
==========================================
  Files          54       54              
  Lines        8031     8030       -1     
==========================================
- Hits         3672     3663       -9     
- Misses       4359     4367       +8

Flag	Coverage Δ
pytest	`45.61% <91.44%> (-0.11%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

assume/scenario/loader_csv.py

assume/reinforcement_learning/learning_role.py

assume/world.py

assume/common/base.py

assume/world.py

maurerle

Good decision to create a new PR for it.
I think that the learning_config should be renamed if it is only a dict.

Just an idea:
If we always have a learning_role anyway and this learning_role does always have the learning_config - we could also just set the properties in the learning_role.
Therefore, we can ditch LearningConfig and have world.learning_role.trained_policies_save_path instead of world.learning_role.learning_config.trained_policies_save_path?

assume/scenario/loader_csv.py

tests/test_learning_role.py

jrasko

Training a simulation (02a) on my GPU does not work in this branch, however it does fine on main

RuntimeError: Expected all tensors to be on the same device, but got mat1 is on cpu, different from other tensors on cuda:0 (when checking argument in method wrapper_CUDA_addmm)

mthede · 2025-11-27T17:30:07Z

@jrasko Will look into the cuda problem tomorrow! Could you please try removing the "and not self.learning_config.evaluation_mode" in the learning role?

mthede · 2025-11-27T17:48:34Z

Good decision to create a new PR for it. I think that the learning_config should be renamed if it is only a dict.

Do you mean renaming it in the yaml files?

Just an idea: If we always have a learning_role anyway and this learning_role does always have the learning_config - we could also just set the properties in the learning_role. Therefore, we can ditch LearningConfig and have world.learning_role.trained_policies_save_path instead of world.learning_role.learning_config.trained_policies_save_path

Yeah, I don't like the very long nesting either. But I felt like the config will be extended in the future and I wanted to keep everything that is user-settable via the config in one place. And it's still/already unpacked in the learning strategies, so then kept it at least centralized in the learning role. Before, all the settings were passed to the LearningStrategy as kwargs.

assume/world.py

jrasko · 2025-11-28T08:29:21Z

@jrasko Will look into the cuda problem tomorrow! Could you please try removing the "and not self.learning_config.evaluation_mode" in the learning role?

@mthede yep, thats why. Removing this condition fixes the error.

assume/reinforcement_learning/learning_role.py

…ation options 1. No RL -> no learning_config -> no learning role 2. Single run with loaded RL strategies -> learning_mode: false and trained_policies_load_path is provided -> learning role, but no rl algorithm etc. necessary 3. Training run -> learning_mode: true 3.1. Training episodes 3.2. Evaluation episodes (config item evaluation_mode exists, but learning loop overwrites it anyway) 4. Continue learning -> continue_learning: true and trained_policies_load_path is provided

…ms in world

…where not shown)

…g_config exists - run pre-commit

assume/reinforcement_learning/neural_network_architecture.py

assume/strategies/learning_strategies.py

- add abosult change to early stopping criterion, have that on my personal brnach for a while now

kim-mskw

all the changes I wanted where discussed bilateral

maurerle · 2025-12-02T11:17:58Z

tests/test_drl_storage_strategy.py

        with patch.object(Storage, "calculate_marginal_cost", return_value=10.0):
            # Calculate bids using the strategy
-            bids = strategy.calculate_bids(
+            bids = strategy.calculate_bids(  # TODO


What did this TODO say?
At least add a comment here?

mthede requested review from kim-mskw and maurerle November 26, 2025 07:58

mthede marked this pull request as draft November 26, 2025 07:59

mthede marked this pull request as ready for review November 26, 2025 08:05

mthede commented Nov 26, 2025

View reviewed changes

assume/scenario/loader_csv.py Outdated Show resolved Hide resolved

jrasko reviewed Nov 26, 2025

View reviewed changes

assume/reinforcement_learning/learning_role.py Show resolved Hide resolved

jrasko reviewed Nov 26, 2025

View reviewed changes

assume/reinforcement_learning/learning_role.py Outdated Show resolved Hide resolved

assume/reinforcement_learning/learning_role.py Outdated Show resolved Hide resolved

assume/world.py Outdated Show resolved Hide resolved

maurerle reviewed Nov 26, 2025

View reviewed changes

assume/common/base.py Show resolved Hide resolved

maurerle reviewed Nov 26, 2025

View reviewed changes

assume/world.py Outdated Show resolved Hide resolved

maurerle requested changes Nov 26, 2025

View reviewed changes

assume/scenario/loader_csv.py Show resolved Hide resolved

assume/scenario/loader_csv.py Outdated Show resolved Hide resolved

tests/test_learning_role.py Show resolved Hide resolved

jrasko requested changes Nov 27, 2025

View reviewed changes

mthede requested review from jrasko and maurerle November 27, 2025 17:31

mthede commented Nov 27, 2025

View reviewed changes

assume/world.py Outdated Show resolved Hide resolved

mthede commented Nov 28, 2025

View reviewed changes

assume/reinforcement_learning/learning_role.py Outdated Show resolved Hide resolved

mthede and others added 8 commits November 28, 2025 13:47

- draft for removal of learning config redundancies

c8b9868

- fix test formatting with pre-commit

68f630a

- address comments

33eb8dd

- add error handling for missing learning_config

2ef9a59

initialize learning_config in world instead of passing individual ite…

4bae38c

…ms in world

- fix action chart in learning dashboard (multiple actions per agent …

9ea2b08

…where not shown)

- remove error check because setup_learning is only called if learnin…

f2eeab2

…g_config exists - run pre-commit

mthede force-pushed the learning_config_pr branch from de192fb to f2eeab2 Compare November 28, 2025 12:47

mthede added 2 commits November 28, 2025 16:13

- switch to loading latest policies for final simulation run

653903c

- fix learning with GPU

e3adddb

kim-mskw reviewed Dec 1, 2025

View reviewed changes

assume/reinforcement_learning/neural_network_architecture.py Show resolved Hide resolved

kim-mskw reviewed Dec 1, 2025

View reviewed changes

assume/strategies/learning_strategies.py Outdated Show resolved Hide resolved

- minor comment changes

82c7f7c

- add abosult change to early stopping criterion, have that on my personal brnach for a while now

kim-mskw self-requested a review December 1, 2025 13:44

- revert config changes due to testing of notebooks

c163be3

kim-mskw approved these changes Dec 1, 2025

View reviewed changes

jrasko approved these changes Dec 1, 2025

View reviewed changes

maurerle approved these changes Dec 2, 2025

View reviewed changes

kim-mskw and others added 2 commits December 2, 2025 12:36

- change comment of actor preload

4811d57

call super functions in mango roles for consistency

c372568

maurerle force-pushed the learning_config_pr branch from 502b429 to c372568 Compare December 2, 2025 11:38

maurerle merged commit 095f342 into main Dec 2, 2025
9 checks passed

maurerle deleted the learning_config_pr branch December 2, 2025 11:42

Restructure learning config #692

Restructure learning config #692

Uh oh!

Conversation

mthede commented Nov 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Checklist

Additional Notes (optional)

Uh oh!

codecov bot commented Nov 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

maurerle left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jrasko left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mthede commented Nov 27, 2025

Uh oh!

mthede commented Nov 27, 2025

Uh oh!

Uh oh!

jrasko commented Nov 28, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kim-mskw left a comment

Choose a reason for hiding this comment

Uh oh!

maurerle Dec 2, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

mthede commented Nov 26, 2025 •

edited

Loading

codecov bot commented Nov 26, 2025 •

edited

Loading

jrasko left a comment •

edited

Loading