add MHSA module #7

kuacakuaca · 2023-05-05T15:16:01Z

pretty much the same as in torchaudio.models.conformer.

Relative positional encoding is missing, i'm not sure if it should be added and if yes, do we implement it by ourselves or just import it from somewhere else.
Should more parameters be added, e.g. key_dim, value_dim?

i6_models/parts/conformer/mhsa.py

albertz

Why do we need this at all?

Again, as in the other PRs (#4, #6), I would keep consistent to our existing implementation (see rwth-i6/returnn_common#58 for the initial code, but then also check the current code, also in the RETURNN frontend, also check with @mmz33). We discussed many things about the naming of modules and also their behavior.

In this specific case, you would use a standard generic RelPosSelfAttention/RelPositionMultiHeadedAttention. Or the standard PyTorch MultiheadAttention.

Also, this module here would not add layer-norm, or the residual connection, or dropout. That would all be handled by the outer module (ConformerEncoderLayer).

So, rather than reimplementing the already existing MultiheadAttention, what is missing is sth like RelPositionMultiHeadedAttention. We have that in RETURNN-common / RETURNN-frontend as well already, so I would suggest to just follow that implementation, unless there are good reasons to do sth differently. But I would keep at least the naming, arguments and variable names consistent.

i6_models/parts/conformer/mhsa.py

SimBe195 · 2023-05-09T06:57:40Z

Some tests should also be added, similar to #4.

i6_models/parts/conformer/mhsa.py

Atticus1806

Please see comments. Some minor things, but also some more fundamental.
Regarding the failing test I think you need to add torch to the requirements.txt since its not done in the main yet.

i6_models/parts/conformer/mhsa.py

tests/test_conformer.py

michelwi

Only one comment left from my side

i6_models/parts/conformer/mhsa.py

…umentation on key_padding_mask

i6_models/parts/conformer/mhsa.py

Co-authored-by: michelwi <[email protected]>

add MHSA module

a88eea4

kuacakuaca requested review from albertz, curufinwe, JackTemaki, mmz33, Marvin84, michelwi, Judyxujj, vieting and Atticus1806 May 5, 2023 15:16

michelwi requested changes May 5, 2023

View reviewed changes

i6_models/parts/conformer/mhsa.py Outdated Show resolved Hide resolved

i6_models/parts/conformer/mhsa.py Outdated Show resolved Hide resolved

fix errors

2387483

albertz requested changes May 5, 2023

View reviewed changes

i6_models/parts/conformer/mhsa.py Show resolved Hide resolved

Atticus1806 reviewed May 8, 2023

View reviewed changes

i6_models/parts/conformer/mhsa.py Show resolved Hide resolved

add shape information

38537da

JackTemaki requested changes May 12, 2023

View reviewed changes

i6_models/parts/conformer/mhsa.py Outdated Show resolved Hide resolved

pzheng added 2 commits May 15, 2023 18:34

use ModelConfiguration, add ConformerMHSAV1test

8deb4ca

fix config

ce8ec49

Atticus1806 requested changes May 16, 2023

View reviewed changes

reformat comments, remove padding from config, rename

4945818

michelwi reviewed May 16, 2023

View reviewed changes

i6_models/parts/conformer/mhsa.py Outdated Show resolved Hide resolved

curufinwe reviewed May 17, 2023

View reviewed changes

i6_models/parts/conformer/mhsa.py Outdated Show resolved Hide resolved

Atticus1806 reviewed May 17, 2023

View reviewed changes

i6_models/parts/conformer/mhsa.py Show resolved Hide resolved

pzheng added 3 commits May 17, 2023 10:58

remove dropout default values

d8e2ce9

use torch.nn.functional.droput, value check in Configuration,more doc…

b71c2c3

…umentation on key_padding_mask

change doc

7a5ccec

Atticus1806 approved these changes May 17, 2023

View reviewed changes

michelwi approved these changes May 17, 2023

View reviewed changes

i6_models/parts/conformer/mhsa.py Outdated Show resolved Hide resolved

i6_models/parts/conformer/mhsa.py Outdated Show resolved Hide resolved

SimBe195 reviewed May 17, 2023

View reviewed changes

i6_models/parts/conformer/mhsa.py Outdated Show resolved Hide resolved

pzheng and others added 4 commits May 17, 2023 16:11

a small change in docstring

d21132a

Merge branch 'main' into add_conformer_mhsa_part

769f34b

black

e6b1052

Update i6_models/parts/conformer/mhsa.py

10f3893

Co-authored-by: michelwi <[email protected]>

curufinwe merged commit 83af04d into main May 18, 2023

christophmluscher deleted the add_conformer_mhsa_part branch June 16, 2023 15:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add MHSA module #7

add MHSA module #7

Uh oh!

kuacakuaca commented May 5, 2023

Uh oh!

Uh oh!

Uh oh!

albertz left a comment

Uh oh!

Uh oh!

Uh oh!

SimBe195 commented May 9, 2023

Uh oh!

Uh oh!

Atticus1806 left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

michelwi left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

add MHSA module #7

add MHSA module #7

Uh oh!

Conversation

kuacakuaca commented May 5, 2023

Uh oh!

Uh oh!

Uh oh!

albertz left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

SimBe195 commented May 9, 2023

Uh oh!

Uh oh!

Atticus1806 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

michelwi left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!