How to adapt knowledge-distillation to the ASR classes? #2041

jprobichaud · 2021-04-01T03:17:38Z

jprobichaud
Apr 1, 2021

Thanks for this great toolkit!

I'm seriously looking at implementing some knowledge distillation techniques for the ASR classes. I found this very simple and elegant pytorch-lightning approach here: https://github.com/vrvlive/knowlege-distillation/blob/master/training_module.py and I wonder how I could adapt it to EncDecCTCModel or EncDecCTCBPEModel classes (as well as for the rnn-t or conformer cases).

I would like a pointer or two, if possible, since the NeMo classes aren't trivial pl.LightningModule. What "worries" me the most is how the audio preprocessing pipeline would impact how we can simply call "forward()" for example.

Where should I start?

VahidooX · 2021-04-02T07:51:04Z

VahidooX
Apr 2, 2021
Collaborator

Actually EncDecCTCModel and EncDecCTCBPEModel are LightningModule, and you can call the forward like the example you provided. They just have some extra methods to support more capabilities. You should be able to use the same trick in that example for these two models. If you follow the approach used in the example, I think you do not need to worry about the data processing pipeline.

You may start by creating a new class inheriting EncDecCTCModel or EncDecCTCBPEModel and overriding init and training_step methods. For Conformer or Citrinet, BPE-based models should be better in terms of accuracy.

0 replies

jprobichaud · 2021-04-02T13:39:11Z

jprobichaud
Apr 2, 2021
Author

Thanks a lot! I'll give it a try!

0 replies

titu1994 · 2021-04-02T17:07:07Z

titu1994
Apr 2, 2021
Maintainer

I would suggest also reading through the forward step + train/validation steps. RNNT in particular is not sufficient to just call forward - there's a lot of extra steps done after that inside the train and validation steps.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to adapt knowledge-distillation to the ASR classes? #2041

Uh oh!

{{title}}

Uh oh!

Replies: 3 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

How to adapt knowledge-distillation to the ASR classes? #2041

Uh oh!

jprobichaud Apr 1, 2021

Replies: 3 comments

Uh oh!

VahidooX Apr 2, 2021 Collaborator

Uh oh!

jprobichaud Apr 2, 2021 Author

Uh oh!

titu1994 Apr 2, 2021 Maintainer

jprobichaud
Apr 1, 2021

VahidooX
Apr 2, 2021
Collaborator

jprobichaud
Apr 2, 2021
Author

titu1994
Apr 2, 2021
Maintainer