-
Notifications
You must be signed in to change notification settings - Fork 4
Open
Milestone
Description
On some internal data, and maybe also Switchboard and Librispeech.
Using nn.Conformer
. Making it somewhat more standard if possible, and then deviate from it when it makes sense.
Also compare it to earlier Conformer recipes, and earlier BLSTM recipes. Make sure the conditions are sane for comparison, e.g. same number of epochs.
When we have that, we should also change our the Conformer
defaults to sth reasonable.
I think our earlier Conformer recipes (there are several variants floating around in our group...) are somewhat non-standard:
- Check the frontend. Sometimes we use BLSTM, sometimes convolutions. Our conv-based frontends are also different to what is standard. Although the standard frontend probably uses too high dimensions, so that is maybe one thing to deviate from. Also see Conformer frontend should fix dimensions, be more standard #219.
- Our earlier Conformer recipes used the old-style relative pos encoding, while the standard Conformer uses the rel pos enc from Transformer-XL. This is already implemented and the default in
nn.Conformer
but we never really compared it systematically, and also thenn.Conformer
is not really well tested yet. See our wiki on relative positional encoding for further references.
References, and related:
- Example RETURNN config, using returnn-common models.
- Better naming of different relative positional encoding schemes #235
- GenericSelfAttention, biases are inconsistent to SelfAttentionLayer #234
- Conformer frontend should fix dimensions, be more standard #219
- Conformer misses relative pos encoding #132
- Implement standard Conformer encoder #54 / Conformer Modules #58
Metadata
Metadata
Assignees
Labels
No labels