Skip to content

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

License

Notifications You must be signed in to change notification settings

alphacep/vosk-api

Folders and files

NameName
Last commit message
Last commit date

Latest commit

cf67ed6 · Nov 13, 2024
Aug 22, 2024
May 15, 2022
Apr 22, 2024
May 6, 2024
Nov 10, 2021
Aug 22, 2024
Jun 19, 2024
Nov 13, 2024
Apr 22, 2024
Apr 22, 2024
Jul 4, 2022
Oct 24, 2024
Sep 26, 2024
Aug 23, 2024
Apr 22, 2024
Nov 7, 2021
Feb 9, 2021
Oct 10, 2024
Jan 2, 2020
Jul 4, 2022

Repository files navigation

Vosk Speech Recognition Toolkit

Vosk is an offline open source speech recognition toolkit. It enables speech recognition for 20+ languages and dialects - English, Indian English, German, French, Spanish, Portuguese, Chinese, Russian, Turkish, Vietnamese, Italian, Dutch, Catalan, Arabic, Greek, Farsi, Filipino, Ukrainian, Kazakh, Swedish, Japanese, Esperanto, Hindi, Czech, Polish. More to come.

Vosk models are small (50 Mb) but provide continuous large vocabulary transcription, zero-latency response with streaming API, reconfigurable vocabulary and speaker identification.

Speech recognition bindings implemented for various programming languages like Python, Java, Node.JS, C#, C++, Rust, Go and others.

Vosk supplies speech recognition for chatbots, smart home appliances, virtual assistants. It can also create subtitles for movies, transcription for lectures and interviews.

Vosk scales from small devices like Raspberry Pi or Android smartphone to big clusters.

Documentation

For installation instructions, examples and documentation visit Vosk Website.