You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/supported_metrics.md
+4-3Lines changed: 4 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -55,8 +55,9 @@ We include x mark if the metric is auto-installed in versa.
55
55
| 48 | x | DNSMOS Pro: A Reduced-Size DNN for Probabilistic MOS of Speech | pseudo_mos | dnsmos_pro_bvcc |[DNSMOSPro](https://github.com/fcumlin/DNSMOSPro/tree/main)|[paper](https://www.isca-archive.org/interspeech_2024/cumlin24_interspeech.html)|
56
56
| 49 | x | DNSMOS Pro: A Reduced-Size DNN for Probabilistic MOS of Speech | pseudo_mos | dnsmos_pro_nisqa |[DNSMOSPro](https://github.com/fcumlin/DNSMOSPro/tree/main)|[paper](https://www.isca-archive.org/interspeech_2024/cumlin24_interspeech.html)|
57
57
| 50 | x | DNSMOS Pro: A Reduced-Size DNN for Probabilistic MOS of Speech | pseudo_mos | dnsmos_pro_vcc2018 |[DNSMOSPro](https://github.com/fcumlin/DNSMOSPro/tree/main)|[paper](https://www.isca-archive.org/interspeech_2024/cumlin24_interspeech.html)|
58
-
| 51 | x | VQScore (Self-Supervised Speech Quality Estimation and Enhancement Using Only Clean Speech) | vqscore | vqscore |[VQScore](https://github.com/JasonSWFu/VQscore)|[paper](https://arxiv.org/abs/2402.16321)|
| 53 | x | VQScore (Self-Supervised Speech Quality Estimation and Enhancement Using Only Clean Speech) | vqscore | vqscore |[VQScore](https://github.com/JasonSWFu/VQscore)|[paper](https://arxiv.org/abs/2402.16321)|
60
61
61
62
62
63
### Dependent Metrics
@@ -68,7 +69,7 @@ We include x mark if the metric is auto-installed in versa.
68
69
| 4 | x | Signal-to-interference Ratio (SIR) | signal_metric | sir |[espnet](https://github.com/espnet/espnet)| - |
69
70
| 5 | x | Signal-to-artifact Ratio (SAR) | signal_metric | sar |[espnet](https://github.com/espnet/espnet)| - |
70
71
| 6 | x | Signal-to-distortion Ratio (SDR) | signal_metric | sdr |[espnet](https://github.com/espnet/espnet)| - |
71
-
| 7 | x | Convolutional scale-invariant signal-to-distortion ratio (CI-SDR) | signal_metric | ci-sdr |[ci_sdr](https://github.com/fgnt/ci_sdr)|[paper](https://arxiv.org/abs/2011.15003)|
72
+
| 7 | x | Convolutional scale-invariant signal-to-distortion ratio (CI-SDR) | signal_metric | ci-sdr |[ci_sdr](https://github.com/fgnt/ci_sdr)|[paper](https://arxiv.(org/abs/2011.15003) |
72
73
| 8 | x | Scale-invariant signal-to-noise ratio (SI-SNR) | signal_metric | si-snr |[espnet](https://github.com/espnet/espnet)|[paper](https://arxiv.org/abs/1711.00541)|
73
74
| 9 | x | Perceptual Evaluation of Speech Quality (PESQ) | pesq | pesq |[pesq](https://pypi.org/project/pesq/)|[paper](https://ieeexplore.ieee.org/document/941023)|
74
75
| 10 | x | Short-Time Objective Intelligibility (STOI) | stoi | stoi |[pystoi](https://github.com/mpariente/pystoi)|[paper](https://ieeexplore.ieee.org/document/5495701)|
0 commit comments