v0.16.0 #433
ZachNagengast
announced in
Announcements
v0.16.0
#433
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Highlights
This release introduces TTSKit - a brand-new optional library that brings high-quality text-to-speech capabilities on-device using the latest CoreML features such as
MLStateandMLTensorsfor optimal inference on the Apple Neural Engine.With this first release, we're launching Qwen3-TTS CustomVoice models 0.6b and 1.7b with instruction control, with more to come in future releases (including voice cloning).
Download, load, generate, and stream playback in 3 lines of code:
Key Features
.automode adapts based on the inference speed of the device for consistent, smooth playback.See the new TTSKit section in the README.md for full API docs, model selection, and advanced usage.
CLI
Try it out with the following command:
swift run -c release whisperkit-cli tts --text "Hello from TTSKit" --playAlso available via Homebrew upon release:
Gives full control over speaker, language, model variant, style, temperature, chunking strategy, compute units, seed for reproducibility, and more.
Example App
Along with the CLI, we're also releasing a new example app for developers to reference when building TTSKit into their apps. It features real-time waveform visualization, model management, persistent audio file history with metadata, and multi-platform support. Here's a screenshot:

More info about running this app in the example's README.md
Architecture Changes
ArgmaxCoretarget for common utilitiesThank you to @naykutguven and @shura-v for the excellent improvements packaged with this release prior to TTSKit listed below 🚀
What's Changed
prewarmby @chen-argmax in Update doc forprewarm#387New Contributors
Full Changelog: v0.15.0...v0.16.0
This discussion was created from the release v0.16.0.
Beta Was this translation helpful? Give feedback.
All reactions