Skip to content

Conversation

@BoxiangW
Copy link
Contributor

Added MLA and MHA(GQA) clipping support

Signed-off-by: Boxiang Wang <[email protected]>
Signed-off-by: Boxiang Wang <[email protected]>
Signed-off-by: Boxiang Wang <[email protected]>
Signed-off-by: Boxiang Wang <[email protected]>
Signed-off-by: Boxiang Wang <[email protected]>
Signed-off-by: Boxiang Wang <[email protected]>
Signed-off-by: Boxiang Wang <[email protected]>
Signed-off-by: Boxiang Wang <[email protected]>
Signed-off-by: Boxiang Wang <[email protected]>
Signed-off-by: Boxiang Wang <[email protected]>
Signed-off-by: Boxiang Wang <[email protected]>
Signed-off-by: Boxiang Wang <[email protected]>
Signed-off-by: Boxiang Wang <[email protected]>
@BoxiangW BoxiangW requested review from a team as code owners October 24, 2025 22:00
@copy-pr-bot
Copy link

copy-pr-bot bot commented Oct 24, 2025

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@BoxiangW BoxiangW self-assigned this Oct 24, 2025
@BoxiangW
Copy link
Contributor Author

TE's NVIDIA/TransformerEngine#2195 (2.9.0) is needed for this PR

Signed-off-by: Boxiang Wang <[email protected]>
Signed-off-by: Boxiang Wang <[email protected]>
Signed-off-by: Boxiang Wang <[email protected]>
@skyw
Copy link
Contributor

skyw commented Oct 28, 2025

TE's NVIDIA/TransformerEngine#2195 (2.9.0) is needed for this PR

It has been merged.

@BoxiangW BoxiangW added Expert Review Apply this label to indicate that your PR is ready for expert review. Run tests labels Oct 31, 2025
@BoxiangW BoxiangW added this to the Core 0.15 milestone Oct 31, 2025
@BoxiangW
Copy link
Contributor Author

BoxiangW commented Nov 3, 2025

/ok to test 7917e68

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Expert Review Apply this label to indicate that your PR is ready for expert review. Run tests

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants