Local Speech Transcriber is a minimalist voice typing application that uses OpenAI’s Whisper locally.
This WPF application captures audio from your microphone and uses a local Whisper-based STT server to transcribe speech, automatically typing the result into any active window.
The application is functional and designed with extensibility in mind. Since it already meets my personal usage needs, further development will slow down. Below is a list of potential features I may implement in the future:
- Cross-platform support via a non-WPF presentation layer using MAUI Blazor Hybrid
- Integration of a faster STT solution
- UI options for configuring transcription settings (model, language, etc.)
- Taskbar/tray icon showing the recording state.
- Clone this repository. This will also clone Whisper Streaming and SimulStreaming.
- Verify that Whisper Streaming runs successfully (needs Python and other dependencies).
- Update
appsettings.json
if needed. Refer to WhisperSettings for details on the available configuration options. - Build and run
LocalSpeechTranscriber.sln
. - Start transcription using the global hotkey (default:
Ctrl+Alt+Shift+D
) or by clicking the "Start Recording" button.