Skip to content
View roedoejet's full-sized avatar

Highlights

  • Pro

Organizations

@nrc-cnrc @EveryVoiceTTS

Block or report roedoejet

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Python 5,483 498 Updated Aug 10, 2024

Implementation of F5-TTS in MLX

Python 485 49 Updated Feb 2, 2025

A simple, hackable text-to-speech system in PyTorch and MLX

Python 122 11 Updated Feb 23, 2025

Unified automatic quality assessment for speech, music, and sound.

Python 359 18 Updated Feb 13, 2025

TTS with kokoro and onnx runtime

Python 1,640 150 Updated Feb 15, 2025

An implementation of XLS-R automatic speech recognition as a recognizer for ELAN

Python 7 Updated Feb 14, 2023

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

Python 1,960 194 Updated Feb 25, 2025
JavaScript 1 Updated Aug 27, 2024

The TTSDS benchmark evaluates synthetic speech quality by considering prosody, speaker identity, and intelligibility, comparing these factors with real speech and noise datasets.

Python 29 1 Updated Dec 2, 2024

Inference and training library for high-quality TTS models.

Python 5,042 527 Updated Dec 10, 2024
Python 348 53 Updated Sep 3, 2024

Predicts the level of noise and reverberation on your audiofiles

Jupyter Notebook 144 25 Updated May 22, 2024

JSON-LD processor written in Python

Python 619 133 Updated May 10, 2024

VS Code extension that allows you to preview and play audio files.

TypeScript 150 16 Updated Jul 15, 2024

The EveryVoice TTS Toolkit - Text To Speech for your language

Python 24 2 Updated Feb 14, 2025

Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector

Python 522 64 Updated Oct 26, 2024

A fast, local neural text to speech system

C++ 7,955 592 Updated Oct 21, 2024

Modern, extensible Python project management

Python 6,370 323 Updated Feb 1, 2025

Scripts for computing the Intelligibility and CLVP scores for evaluating TTS models

Python 148 15 Updated Dec 18, 2023

A python package for grapheme aware string handling

Python 110 7 Updated Mar 21, 2022

Simple, safe way to store and distribute tensors

Python 3,133 220 Updated Feb 25, 2025

IPA tokeniser

Python 15 2 Updated Apr 7, 2024

This repository implements SummaryMixing, a simpler, faster and much cheaper replacement to self-attention for automatic speech recognition (see: https://arxiv.org/abs/2307.07421). The code is read…

Python 117 11 Updated Sep 17, 2024

g2p: English Grapheme To Phoneme Conversion

Python 837 129 Updated Jan 5, 2023

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 13,201 2,699 Updated Feb 25, 2025

Domain-specific programming language for linguistic grammars and transducers — Langage dédié pour les grammaires linguistiques et les transducteurs.

TypeScript 13 1 Updated Feb 20, 2025

AI-based Audio Watermarking Tool

Python 247 33 Updated Jan 7, 2024

open source knowledge for Syllabics font design and development

5 Updated Nov 13, 2024

Official implementation of "Separate Anything You Describe"

Python 1,686 120 Updated Nov 26, 2024
Python 350 66 Updated Sep 12, 2023
Next
Showing results