Skip to content

Conversation

@Tcc0403
Copy link
Collaborator

@Tcc0403 Tcc0403 commented Aug 12, 2025

Summary

This PR is a follow-up to #830, exposing accum_dtype option for monkey patch functions.
All bf16 convergence tests related to fused linear cross entropy are also enforced to run with accum_dtype=torch.float32 for numerical stability.

Related: #512, #742, #827, #850

Testing Done

  • Hardware Type:
  • run make test to ensure correctness
  • run make checkstyle to ensure code style
  • run make test-convergence to ensure convergence

@Tcc0403 Tcc0403 force-pushed the tcc/accum_dtype_monkey_patch branch from e0ca830 to 9e0456e Compare August 12, 2025 12:47
@Tcc0403 Tcc0403 force-pushed the tcc/accum_dtype_monkey_patch branch from 9e0456e to a0b849e Compare August 12, 2025 12:49
@shimizust shimizust merged commit 90d66ce into main Aug 12, 2025
3 of 7 checks passed
@shimizust shimizust deleted the tcc/accum_dtype_monkey_patch branch August 12, 2025 20:57
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants