Add skip layer normalization #3865

TedThemistokleous · 2025-03-04T20:59:13Z

Needed to support customer models. Seen in optimized versions of BERT or optimizations using the Onnxruntime toolset

Currently built off the add_attention_contrib_op branch as this is used to parse in an optimized bert model

official Doc here: https://github.com/microsoft/onnxruntime/blob/main/docs/ContribOperators.md#com.microsoft.SkipLayerNormalization

Other useful descriptions (this vs say simplified case)

https://pytorch.org/docs/stable/generated/torch.nn.RMSNorm.html#torch.nn.RMSNorm (also known as simplified)

https://sh-tsang.medium.com/review-layer-normalization-ln-6c2ae88bae47

needs work on input dimensions

…dLayerNormalization with added beta input

…t to reflect this Required that both mean and variance outputs are float type and not converted. This works as epsilon as well as this attribute is always float as well. Removes a bunch of extra converts when this operator is reduce precision (Float16/bf,etc)

beta and bias were flipped. Sorted this out and updated parser. captured testing beta path in parser test

Needs to have values checked. Currently working out data. Just grabbed the skip_simplied_layernorm portion and reused this while modying tests to add in beta and bias values We should be performing the following operation to verify data via numpy https://pytorch.org/docs/stable/generated/torch.nn.LayerNorm.html

TedThemistokleous self-assigned this Mar 4, 2025

TedThemistokleous added the Onnx Operators Adding or modifying an Onnx Operator in the MIGraphX codebase label Mar 4, 2025

TedThemistokleous requested review from ahsan-ca and pfultz2 March 4, 2025 21:06

Ted Themistokleous added 6 commits March 7, 2025 00:41

Took simplified version and tweaked a little

88675a9

needs work on input dimensions

Adjust parameters to match onnxruntime implimentation

270109c

Add initial parser tests - leverage existing parser for skipSimplifie…

ecf1fad

…dLayerNormalization with added beta input

Bit of cleanup

040a959

Replace sub/mul with sqdiff

5a55c57

Add updated test.

9709f30

TedThemistokleous force-pushed the add_skip_layer_normalization branch from 94f3908 to 9709f30 Compare March 7, 2025 00:41

Ted Themistokleous added 3 commits March 7, 2025 01:38

Update tests and handle error found with beta input

ba97a85

beta and bias were flipped. Sorted this out and updated parser. captured testing beta path in parser test

turneram deleted the branch add_attention_contrib_op March 7, 2025 15:48

turneram closed this Mar 7, 2025

turneram deleted the add_skip_layer_normalization branch March 7, 2025 15:48

TedThemistokleous restored the add_skip_layer_normalization branch March 7, 2025 15:50

TedThemistokleous reopened this Mar 7, 2025

TedThemistokleous mentioned this pull request Mar 7, 2025

Add optimized bert ops ROCm/onnxruntime#100

Open

TedThemistokleous assigned ahsan-ca Mar 7, 2025

TedThemistokleous added the roadmap Tasks to finish for a release label Mar 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add skip layer normalization #3865

Add skip layer normalization #3865

TedThemistokleous commented Mar 4, 2025 •

edited

Loading

Add skip layer normalization #3865

Are you sure you want to change the base?

Add skip layer normalization #3865

Conversation

TedThemistokleous commented Mar 4, 2025 • edited Loading

TedThemistokleous commented Mar 4, 2025 •

edited

Loading