leaf-pytorch

Sponsors
About
Key Points
Dependencies
Results
Loading Pretrained Models
References

About

This is a PyTorch implementation of the LEAF audio frontend [1], made using the official tensorflow implementation as a direct reference.
This implementation supports training on TPUs using torch-xla.

Key Points

Will be evaluated on AudioSet, SpeechCommands and Voxceleb1 datasets, and pretrained weights will be made available.
Currently, torch-xla has some issues with certain complex64 operations: torch.view_as_real(comp), comp.real, comp.imag as highlighted in #Issue 3070. These are used primarily for generating gabor impulse responses. To bypass this shortcoming, an alternate implementation using manual complex number operations is provided.
Matched performance on SpeechCommands, experiments on other datasets ongoing
More details for commands to replicate experiments will be added shortly

Dependencies

torch >= 1.9.0
torchaudio >= 0.9.0
torch-audiomentations==0.9.0
SoundFile==0.10.3.post1
[Optional] torch_xla == 1.9

Additional dependencies include

[WavAugment](https://github.com/facebookresearch/WavAugment)

Results

All experiments on VoxCeleb1 and SpeechCommands were repeated at least 5 times, and 95% ci are reported.

Model	Dataset	Metric	features	Official	This repo	weights
EfficientNet-b0	SpeechCommands v2	Accuracy	LEAF	93.4±0.3	94.5±0.3	ckpt
ResNet-18	SpeechCommands v2	Accuracy	LEAF	N/A	94.05±0.3	ckpt
EfficientNet-b0	VoxCeleb1	Accuracy	LEAF	33.1±0.7	40.9±1.8	ckpt
ResNet-18	VoxCeleb1	Accuracy	LEAF	N/A	44.7±2.9	ckpt

Observations

ResNet-18 likely works better for VoxCeleb1 simply because it's a more difficult task than SpeechCommands and ResNet-18 has more parameters.

Evaluating different init schemes for `complex_conv` init

To evaluate how non-Mel initialization schemes for complex_conv work, experiments were repeated on xavier_normal, kaiming_normal and randn init schemes on the SpeechCommands dataset.

Model	Features	Init	Test Accuracy
EfficientNet-b0	LEAF	`Default (Mel)`	94.5±0.3
EfficientNet-b0	LEAF	`randn`	84.7±1.6
EfficientNet-b0	LEAF	`kaiming_normal`	84.7±2.3
EfficientNet-b0	LEAF	`xavier_normal`	79.1±0.7

Loading Pretrained Models

download and extract desired ckpt from Results.

import os
import torch
import pickle
from models.classifier import Classifier

results_dir = "<path to results folder>"
hparams_path = os.path.join(results_dir, "hparams.pickle")
ckpt_path = os.path.join(results_dir, "ckpts", "<checkpoint.pth>")
checkpoint = torch.load(ckpt_path)
with open(hparams_path, "rb") as fp:
    hparams = pickle.load(fp)
model = Classifier(hparams.cfg)
print(model.load_state_dict(checkpoint['model_state_dict']))

# to access just the pretrained LEAF frontend
frontend = model.features

References

[1] If you use this repository, kindly cite the LEAF paper:

@article{zeghidour2021leaf,
  title={LEAF: A Learnable Frontend for Audio Classification},
  author={Zeghidour, Neil and Teboul, Olivier and de Chaumont Quitry, F{\'e}lix and Tagliasacchi, Marco},
  journal={ICLR},
  year={2021}
}

Please also consider citing this implementation using the citation widget in the sidebar.

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
cfgs		cfgs
leaf_pytorch		leaf_pytorch
models		models
utilities		utilities
.gitignore		.gitignore
CITATION.cff		CITATION.cff
LICENSE		LICENSE
README.md		README.md
make_csv.ipynb		make_csv.ipynb
speechcommands_init_exps.sh		speechcommands_init_exps.sh
test.py		test.py
test_leaf.py		test_leaf.py
train.csv		train.csv
train.py		train.py
train_xla.py		train_xla.py
val.csv		val.csv
voxceleb_run.sh		voxceleb_run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

leaf-pytorch

Sponsors

About

Key Points

Dependencies

Results

Observations

Evaluating different init schemes for `complex_conv` init

Loading Pretrained Models

References

About

Releases

Packages

Languages

byuljwa/leaf-pytorch

Folders and files

Latest commit

History

Repository files navigation

leaf-pytorch

Sponsors

About

Key Points

Dependencies

Results

Observations

Evaluating different init schemes for complex_conv init

Loading Pretrained Models

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Evaluating different init schemes for `complex_conv` init

Packages