Create good Conformer baselines

On some internal data, and maybe also Switchboard and Librispeech.

Using `nn.Conformer`. Making it somewhat more standard if possible, and then deviate from it when it makes sense.

Also compare it to earlier Conformer recipes, and earlier BLSTM recipes. Make sure the conditions are sane for comparison, e.g. same number of epochs.

When we have that, we should also change our the `Conformer` defaults to sth reasonable.

I think our earlier Conformer recipes (there are several variants floating around in our group...) are somewhat non-standard:
- Check the frontend. Sometimes we use BLSTM, sometimes convolutions. Our conv-based frontends are also different to what is standard. Although the standard frontend probably uses too high dimensions, so that is maybe one thing to deviate from. Also see #219.
- Our earlier Conformer recipes used the old-style relative pos encoding, while the standard Conformer uses the rel pos enc from [Transformer-XL](https://arxiv.org/pdf/1901.02860.pdf). This is already implemented and the default in `nn.Conformer` but we never really compared it systematically, and also the `nn.Conformer` is not really well tested yet. See [our wiki on relative positional encoding](https://github.com/rwth-i6/returnn_common/wiki/Relative-positional-encoding) for further references.

References, and related:
- [Example RETURNN config, using returnn-common models](https://github.com/rwth-i6/returnn_common/wiki/RETURNN-example-config).
- #235
- #234 
- #219
- #132
- #54 / #58
 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Create good Conformer baselines #233

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Create good Conformer baselines #233

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions