feat(FLCE): expose `accum_dtype` for hf model monkey patch #851

Tcc0403 · 2025-08-12T12:46:33Z

Summary

This PR is a follow-up to #830, exposing accum_dtype option for monkey patch functions.
All bf16 convergence tests related to fused linear cross entropy are also enforced to run with accum_dtype=torch.float32 for numerical stability.

Related: #512, #742, #827, #850

Testing Done

Hardware Type:
run make test to ensure correctness
run make checkstyle to ensure code style
run make test-convergence to ensure convergence

Signed-off-by: Tcc0403 <[email protected]>

Tcc0403 force-pushed the tcc/accum_dtype_monkey_patch branch from e0ca830 to 9e0456e Compare August 12, 2025 12:47

feat(FLCE): expose accum_dtype for hf model monkey patch

a0b849e

Signed-off-by: Tcc0403 <[email protected]>

Tcc0403 force-pushed the tcc/accum_dtype_monkey_patch branch from 9e0456e to a0b849e Compare August 12, 2025 12:49

shimizust approved these changes Aug 12, 2025

View reviewed changes

shimizust merged commit 90d66ce into main Aug 12, 2025
3 of 7 checks passed

shimizust deleted the tcc/accum_dtype_monkey_patch branch August 12, 2025 20:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(FLCE): expose `accum_dtype` for hf model monkey patch #851

feat(FLCE): expose `accum_dtype` for hf model monkey patch #851

Uh oh!

Tcc0403 commented Aug 12, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat(FLCE): expose accum_dtype for hf model monkey patch #851

feat(FLCE): expose accum_dtype for hf model monkey patch #851

Uh oh!

Conversation

Tcc0403 commented Aug 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Testing Done

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat(FLCE): expose `accum_dtype` for hf model monkey patch #851

feat(FLCE): expose `accum_dtype` for hf model monkey patch #851

Tcc0403 commented Aug 12, 2025 •

edited

Loading