Release 22/inconsistent device tensor action in trainers [Don't Merge] #6225

Jkho80 · 2025-07-20T06:27:12Z

Proposed change(s)

Hi,

While working with mlagents-learn and running my environment with --torch-device cuda, I encountered multiple runtime errors such as:

RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu!

After investigating the root cause, I found that several tensor operations in the codebase implicitly rely on default CPU-based tensors. These issues are often introduced when:

Creating tensors directly with torch.tensor() Using NumPy-based Parameters or Variable without specifying device (e.g., torch.tensor(numpy_value) instead of torch.from_numpy(numpy_value).to(device)).
Combining GPU-based tensors with CPU-based masks or constants Computation.

This is especially problematic in scenarios where ML-Agents interacts with compute shaders or when training with Unity in GPU mode, as values originating from RAM (via NumPy) default to CPU memory and cause device mismatches in PyTorch computations.

What I did:
To address this, I:

Identified common points where tensors were created without explicit device assignment.

Ensured that all relevant tensors (especially masks, constants, and externally created inputs) are moved to the correct device using .to(device) based on the context tensor.

This should make training on CUDA more stable and prevent errors due to cross-device tensor operations.

Thanks for the great work on ML-Agents!

Useful links (Github issues, JIRA tickets, ML-Agents forum threads etc.)

None

Types of change(s)

Checklist

Added tests that prove my fix is effective or that my feature works
Updated the changelog (if applicable)
Updated the documentation (if applicable)
Updated the migration guide (if applicable)

Other comments

mlagents-learn RobotReacher.yaml --run-id robot01 --torch-device cuda --force --debug

Error and Success Run Log

This reverts commit d67dc94.

…gies#6145) * Update PerformancProject and DevProject. * Removed mac perf tests.

… bokken image.

…ts into release/3.0.0

CLAassistant · 2025-07-20T06:27:20Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you all sign our Contributor License Agreement before we can accept your contribution.
2 out of 4 committers have signed the CLA.

✅ Jkho80
✅ miguelalonsojr
❌ Aurimas Petrovas
❌ AlexRibard

Aurimas Petrovas seems not to be a GitHub user. You need a GitHub account to be able to sign the CLA. If you have already a GitHub account, please add the email address used for this commit to your account.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

cycode-security · 2025-07-20T06:27:21Z

ml-agents/setup.py

❗Cycode: Security vulnerabilities found in newly introduced dependency.

Ecosystem PyPI

Dependency grpcio

Dependency Paths grpcio 1.48.2

Direct Dependency Yes

The following vulnerabilities were introduced:

GHSA CVE Severity Fixed Version

GHSA-496j-2rq6-j6cc CVE-2023-33953 HIGH 1.53.2

GHSA-6628-q6j9-w8vg CVE-2023-1428 HIGH 1.53.0

GHSA-cfgp-2977-2fmm CVE-2023-32731 HIGH 1.53.0

Highest fixed version: 1.53.2

Description

Detects when new vulnerabilities affect your dependencies.

Tell us how you wish to proceed using one of the following commands:

Tag Short Description

#cycode_vulnerable_package_fix_this_violation Fix this violation via a commit to this branch

#cycode_ignore_manifest_here <reason> Applies to this manifest in this request only

⚠️ When commenting on Github, you may need to refresh the page to see the latest updates.

⚠️ Due to API limitations, we can not comment on the exact line (58)

AlexRibard and others added 30 commits August 28, 2024 15:39

adding wrench

241b919

correct build path

c4668ac

release branch and 6.0 target

205396d

XmlDoc update

fc6ca44

adressing xml docs

66d1508

more docs

d8e30fa

updating the release

3ce9849

test xmldoc fixes

eb3b742

more xml doc fixes

f82f8cd

Uncompress the 3DBall sample

d67dc94

Fix API documentation

d93a5fc

more xml doc fixes

3008e79

Revert "Uncompress the 3DBall sample"

2ee26c1

This reverts commit d67dc94.

reformat MaxStep xml

401418f

more xml doc fixes

a1e3bc9

fix more xml doc issues

5a7535e

fix summary tag

430a1b2

Updated changelog for missing PRs.

8fd8460

Removed tabs from .tests.json.

799f8f0

Updated changelog.

47da469

Removed tabs from CHANGELOG.

b658a48

Fix failing ci post upgrade (Unity-Technologies#6141) (Unity-Technolo…

038e99b

…gies#6145) * Update PerformancProject and DevProject. * Removed mac perf tests.

Removing standalone tests dep from wrench packaging.

f74208d

Fixed package works issues. Updated com.unity.ml-agents.md.

2fa290c

Updated com.unity.ml-agents.md.

2ab6c24

Updated package version in Academy.cs

1612e55

Adding back in package pack deps.

0d3c10d

Updated package pack testing deps..

d374fa1

Regenerated wrench ymls.

1a0e031

License update.

9ecb3bd

miguelalonsojr and others added 18 commits September 24, 2024 08:34

Extensions License update.

67c7a27

Another license tweak.

900555d

Another license tweak.

d9ea8f2

Upgraded to sentis 2.1.0.

0874236

Merge branch 'develop' into release/3.0.0

54db12e

Updated standalone yamato build test to using new ml-agents ubuntu ci…

86814f4

… bokken image.

Merge branch 'release/3.0.0' of github.com:Unity-Technologies/ml-agen…

9ac55b2

…ts into release/3.0.0

Bumped python and extensions package versions.

e9e3601

Changed ci image for pytest gpu yamato test.

c683e77

Changed default cuda dtype to torch.float32.

54664fc

Updated version validation and extensions version.

e70d023

Fixed failing GPU test.

9d34978

Fixed failing GPU test.

24485ed

Updated readme table and make_readme_table.py

501ba88

Updated publish to pypi gha.

200fe54

Fixed Mixed CPU, GPU Computation Error

4c02a9c

Tiny change on dependencies.

3962bab

undo change on ml-agents/setup.py

9fba4eb

cycode-security bot reviewed Jul 20, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Release 22/inconsistent device tensor action in trainers [Don't Merge] #6225

Release 22/inconsistent device tensor action in trainers [Don't Merge] #6225

Uh oh!

Jkho80 commented Jul 20, 2025

Uh oh!

CLAassistant commented Jul 20, 2025 •

edited

Loading

Uh oh!

cycode-security bot Jul 20, 2025

Uh oh!

Uh oh!


Ecosystem	PyPI
Dependency	`grpcio`
Dependency Paths	`grpcio 1.48.2`
Direct Dependency	Yes

GHSA	CVE	Severity	Fixed Version
GHSA-496j-2rq6-j6cc	CVE-2023-33953	HIGH	1.53.2
GHSA-6628-q6j9-w8vg	CVE-2023-1428	HIGH	1.53.0
GHSA-cfgp-2977-2fmm	CVE-2023-32731	HIGH	1.53.0

Tag	Short Description
#cycode_vulnerable_package_fix_this_violation	Fix this violation via a commit to this branch
#cycode_ignore_manifest_here <reason>	Applies to this manifest in this request only

Release 22/inconsistent device tensor action in trainers [Don't Merge] #6225

Are you sure you want to change the base?

Release 22/inconsistent device tensor action in trainers [Don't Merge] #6225

Uh oh!

Conversation

Jkho80 commented Jul 20, 2025

Proposed change(s)

Useful links (Github issues, JIRA tickets, ML-Agents forum threads etc.)

Types of change(s)

Checklist

Other comments

Uh oh!

CLAassistant commented Jul 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cycode-security bot Jul 20, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

CLAassistant commented Jul 20, 2025 •

edited

Loading