Linux Voice Tech

An index of voice technology tools accessible to Linux users

Star counts and last commit dates are shown via shields.io badges and update dynamically.

For background, notes on how the repo is organized, and inclusion criteria, see notes.md. For a getting started guide, see starting-points.md.

Keywords

Automatic speech recognition (ASR)
Speech-to-text (STT)
Text-to-speech (TTS)
Linux voice typing
Linux dictation
Linux TTS
Voice control
Transcription

Quick Navigation

Wayland-Compatible STT — Tools with explicit Wayland virtual input support
Voice Typing — GUIs — Desktop apps for dictation and transcription
Voice Typing — CLIs — Command-line dictation and transcription tools
Voice Notes & AI-Enhanced — Note-taking with AI post-processing
Real-Time Streaming STT — Low-latency, live transcription libraries
Self-Hosted / Web UI — Docker/web-based transcription services
Cloud STT / API-Based — Tools using OpenAI, Deepgram, or other cloud APIs
Voice Assistants — Voice-controlled assistant applications
Voice Commands & Automation — Voice-to-action, voice-to-MCP, computer control
Toolkits & Frameworks — Developer libraries for building voice apps
Whisper Variants & Optimizations — Faster/smaller/better Whisper implementations
Complementary Tools — VAD, diarization, noise suppression
Text-to-Speech (TTS) — TTS tools and frameworks
MCP Servers — Model Context Protocol voice servers
Awesome Lists — Other curated voice tech lists
Community Resources — GitHub topics, subreddits

STT Tools with Wayland Support

Projects with explicit Wayland support. Particularly valuable for users on modern Linux desktops (GNOME, KDE Plasma on Wayland, Hyprland, Sway, niri, etc.) where X11 virtual input methods don't work.

**dictation-tools** — · · Dictation tools with Wayland support
gfreeau/dictation-tools View on GitHub
**freespeak** — · · Voice dictation with Wayland support
Zetaphor/freespeak View on GitHub
**hyprvoice** — · · Voice dictation for Hyprland
LeonardoTrapani/hyprvoice View on GitHub
**hyprwhspr** — · · Whisper-based voice input for Hyprland
goodroot/hyprwhspr View on GitHub
**local-dictation-assistant** — · · Local dictation assistant with Wayland support
Wiecek-K/local-dictation-assistant View on GitHub
**niri-transcribe** — · · Transcription tool for niri compositor
sevos/niri-transcribe View on GitHub
**swictation** — · · Voice dictation for Sway/Wayland
robertelee78/swictation View on GitHub
**TalkType (ronb1964)** — · · Privacy-first voice dictation for Linux Wayland. Press key to talk, release to type. Whisper AI, 100% offline
ronb1964/TalkType View on GitHub
**vocalinux** — · · Offline voice dictation for Linux. Whisper.cpp, Whisper & VOSK engines, GPU-accelerated, X11 + Wayland
jatinkrmalik/vocalinux View on GitHub
**voice-typing-linux** — · · Voice typing for Linux with Wayland support
GitJuhb/voice-typing-linux View on GitHub
**wayland-voice-dictation** — · · Voice dictation designed for Wayland
MasonRhodesDev/wayland-voice-dictation View on GitHub
**whisper-wayland** — · · Whisper integration for Wayland
Andrewske/whisper-wayland View on GitHub
**whispy** — · · STT tool with Wayland support
daaku/whispy View on GitHub

Voice Typing — GUIs

Desktop applications for dictation and transcription with graphical interfaces.

**AI-Typer-V2** — · · Voice dictation with multimodal AI cleanup — speak naturally, get polished text
danielrosehill/AI-Typer-V2 View on GitHub
**aTrain** — · · Audio transcription training tool
JuergenFleiss/aTrain View on GitHub
**audiov** — · · Speech-to-text, voice-typing, dictation software for Linux distributions
WhiteSmoogy/audiov View on GitHub
**Buzz** — · · Offline audio transcription and translation. Supports Whisper, Whisper.cpp, Faster-Whisper. Available via Flatpak/Snap. Vulkan GPU support
chidiwilliams/buzz View on GitHub
**dsnote** — · · Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation
mkiol/dsnote View on GitHub
**LinuxWhisper** — · · Whisper for Linux
vitali87/LinuxWhisper View on GitHub
**maVoice-Linux** — · · Voice control for Linux
lliWcWill/maVoice-Linux View on GitHub
**mint-whisper** — · · Whisper for Linux Mint
codankra/mint-whisper View on GitHub
**murmure** — · · Fully local, private, cross-platform STT with LLM post-processing
Kieirra/murmure View on GitHub
**OpenFlow** — · · Local speech-to-text app for Linux
logabell/OpenFlow View on GitHub
**OpenWispr** — · · Open source Whisper-based voice assistant
imsidharthj/OpenWispr View on GitHub
**Parakeet-Type-Ubuntu** — · · On-device voice typing for Linux using Parakeet and NeMo ASR models via sherpa-onnx. No cloud, no GPU required
danielrosehill/Parakeet-Type-Ubuntu View on GitHub
**sotto** — · · Local speech-to-text transcription app for Linux using Whisper models
Maciejonos/sotto View on GitHub
**soundvibes** — · · Speech-to-text for Linux that just works
kejne/soundvibes View on GitHub
**TalkType (zyk42)** — · · Cross-platform Electron voice writing assistant. ASR + LLM for understanding, polishing, and Q&A
zyk42/TalkType View on GitHub
**TranscriptionSuite** — · · Fully local, private STT app with speaker diarization, Audio Notebook mode, LM Studio integration, longform and live transcription
homelab-00/TranscriptionSuite View on GitHub
**VoiceType** — · · Fork of Deepgram's Linux starter. CLI to GUI + hotkey support, API key editing, cost tracking
danielrosehill/VoiceType View on GitHub
**WhisperNow** — · · Real-time Whisper transcription
shinglyu/WhisperNow View on GitHub
**whisper-to-input-desktop** — · · Desktop app using OpenAI's Whisper to transcribe audio and input it as text
Rosbifbr/whisper-to-input-desktop View on GitHub
**whisper-ui** — · · Whisper UI interface
schnoddelbotz/whisper-ui View on GitHub
**whisperer** — · · Whisper-based transcription tool
mike-cr/whisperer View on GitHub
**whisply** — · · A simple GUI for OpenAI Whisper
tsmdt/whisply View on GitHub
**wisper** — · · Voice dictation app for Linux. Type directly at cursor with AI-powered transcription
taraksh01/wisper View on GitHub
**wispr-lite** — · · Lightweight Whisper-based transcription tool
dosment/wispr-lite View on GitHub

Voice Typing — CLIs

Command-line dictation and transcription tools.

**BlahST** — · · Offline, real-time, streaming speech-to-text transcription using OpenAI Whisper
QuantiusBenignus/BlahST View on GitHub
**blurt** — · · Whisper.cpp-based STT tool
QuantiusBenignus/blurt View on GitHub
**dicti** — · · Dictation tool
tksimson/dicti View on GitHub
**froshine** — · · Voice recognition tool
AdrianScott/froshine View on GitHub
**linux-stt-input** — · · Linux STT input method
fengwk/linux-stt-input View on GitHub
**linux-voice-to-text-ai** — · · Linux voice to text AI
trebormc/linux-voice-to-text-ai View on GitHub
**Linux-Dictation-Project** — · · Linux dictation project
wheeler01/Linux-Dictation-Project View on GitHub
**sonori** — · · Voice recognition tool
0xPD33/sonori View on GitHub
**speak-to-ai** — · · Speak to AI assistant
AshBuk/speak-to-ai View on GitHub
**speech-assistant** — · · Faster Whisper-based speech assistant
Mohamad-Hussein/speech-assistant View on GitHub
**speedofsound** — · · Voice typing for the Linux desktop
zugaldia/speedofsound View on GitHub
**stt-linux (afif)** — · · STT for Linux
afif-malghani/stt-linux View on GitHub
**STT-Assistant-linux** — · · STT assistant for Linux
4lext/STT-Assistant-linux View on GitHub
**super-stt** — · · Enhanced STT tool
jorge-menjivar/super-stt View on GitHub
**talktype** — · · Push-to-talk voice typing for the terminal. Local Whisper, cross-platform
lmacan1/talktype View on GitHub
**TermlAi** — · · Terminal AI assistant with voice
rohitkr150015/TermlAi View on GitHub
**transcribeAnywhere** — · · Universal transcription tool
naren200/transcribeAnywhere View on GitHub
**VocalFLow** — · · Voice flow dictation tool
R3DK3LL/VocalFLow View on GitHub
**voice-type** — · · Linux-first system-wide dictation tool. Unmatched accuracy and speed, totally free
eriknovikov/voice-type View on GitHub
**voicekeyboard** — · · Voice keyboard implementation
sam1am/voicekeyboard View on GitHub
**voxd** — · · Voice input daemon
jakovius/voxd View on GitHub
**whisp-away** — · · Whisper-based dictation tool
madjinn/whisp-away View on GitHub
**whisper-dictation (ananjiani)** — · · Whisper-based dictation tool
ananjiani/whisper-dictation View on GitHub
**whisper-hotkey-linux** — · · Whisper hotkey for Linux
atkvishnu/whisper-hotkey-linux View on GitHub
**whisper-toggle** — · · Toggle-based Whisper control
bradjmsu/whisper-toggle View on GitHub
**whisper-transcribe** — · · Whisper transcription tool
geraschenko/whisper-transcribe View on GitHub
**whisperd** — · · A daemon for OpenAI Whisper
AgenticToaster/whisprd View on GitHub
**WhisperVoice** — · · Whisper voice processing tool
SarwadnyaMahajan/WhisperVoice View on GitHub
**WhisperVoiceInput** — · · Whisper voice input tool
V0v1kkk/WhisperVoiceInput View on GitHub
**whispertrigger** — · · Trigger OpenAI Whisper with a hotkey
RetroTrigger/whispertrigger View on GitHub
**whispertux** — · · A simple CLI wrapper for OpenAI's Whisper speech-to-text model
cjams/whispertux View on GitHub
**wvcr** — · · Wave voice control recorder
bakeryproducts/wvcr View on GitHub

Voice Notes & AI-Enhanced Transcription

Tools focused on capturing voice notes with AI post-processing (LLM cleanup, formatting, summarization).

**handsfree** — · · Hands-free computing
achyudh/handsfree View on GitHub
**notesGPT** — · · Voice notes with GPT processing
Nutlope/notesGPT View on GitHub
**obsidian-scribe** — · · Obsidian voice note transcription
Mikodin/obsidian-scribe View on GitHub
**ScribeWizard** — · · Transcription wizard tool
Bklieger/ScribeWizard View on GitHub
**Thought-Pad** — · · Thought capture with STT
danielrosehill/Thought-Pad View on GitHub
**whisper-notes** — · · Whisper-powered note processing
AsyncFuncAI/whisper-notes View on GitHub
**whisper-notes-pro** — · · Professional whisper notes application
mzazakeith/whisper-notes-pro View on GitHub
**Whisper-Notepad-For-Linux** — · · Whisper notepad with post-processing
danielrosehill/Whisper-Notepad-For-Linux View on GitHub
**Whisper-Notepad-Simple** — · · Simplified Whisper notepad using OpenAI API
danielrosehill/Whisper-Notepad-Simple View on GitHub

Real-Time Streaming STT

Libraries and tools for low-latency, live transcription.

**RealtimeSTT** — · · Low-latency STT library with VAD, wake word activation. Uses WebRTCVAD + SileroVAD + Faster-Whisper
KoljaB/RealtimeSTT View on GitHub
**whisper_real_time** — · · Real-time transcription with OpenAI Whisper
davabase/whisper_real_time View on GitHub
**whisper_streaming** — · · Real-time streaming Whisper with self-adaptive latency using local agreement policy
ufal/whisper_streaming View on GitHub
**WhisperLive** — · · Real-time Whisper transcription from Collabora. OpenVINO support, browser extensions, iOS client
collabora/WhisperLive View on GitHub
**WhisperLiveKit** — · · 2025 SOTA streaming STT with speaker diarization. Simul-Whisper for ultra-low latency
QuentinFuxa/WhisperLiveKit View on GitHub

Self-Hosted / Web UI

Docker-deployed tools and web interfaces for self-hosted STT.

**meeting-minutes** — · · Self-hostable meeting transcription and minutes generation
Zackriya-Solutions/meeting-minutes View on GitHub
**Scriberr** — · · Voice transcription tool
rishikanthc/Scriberr View on GitHub
**Whisper-WebUI** — · · A Gradio-based browser interface for Whisper. Easy subtitle generation
jhj0517/Whisper-WebUI View on GitHub
**whisper-fastapi** — · · Whisper FastAPI service
heimoshuiyu/whisper-fastapi View on GitHub

Cloud STT / API-Based Tools

Projects that use cloud STT APIs for transcription.

OpenAI Whisper API

**speech2keys** — · · Speech to keystrokes using OpenAI Whisper API
vanviegen/speech2keys View on GitHub

Deepgram API

**Deepgram-Voice-Keyboard-Ubuntu** — · · STT project using Deepgram API for Ubuntu
danielrosehill/Deepgram-Voice-Keyboard-Ubuntu View on GitHub
**fortuna** — · · Deepgram Fortuna project
deepgram-devs/fortuna View on GitHub
**voice-keyboard-linux** — · · Deepgram voice keyboard for Linux
deepgram/voice-keyboard-linux View on GitHub

Hugging Face ASR Models

ASR Models (Trending) — Trending automatic speech recognition models on Hugging Face
Whisper on Hugging Face — OpenAI Whisper on Hugging Face

Voice Assistants

Privacy-Focused

Open source voice assistants emphasizing local processing and privacy.

**Neon AI** — · · Privacy-first voice assistant. Offline-capable, customizable. Maintains Mycroft community forums
NeonGeckoCom/NeonCore View on GitHub
**OpenVoiceOS** — · · Community-driven voice assistant framework. Local processing, privacy-focused. Continuation of Mycroft
OpenVoiceOS/ovos-core View on GitHub
**Project Alice** — · · Modular smart assistant, fully offline. Built around Snips, guarantees privacy
project-alice-assistant/ProjectAlice View on GitHub
**SEPIA Framework** — · · Self-hosted, privacy-compliant voice assistant ecosystem
SEPIA-Framework/sepia-assist-server View on GitHub

General

**jarvis_linux** — · · Jarvis for Linux
morrolinux/jarvis_linux View on GitHub
**linux-voice-control** — · · Linux voice control system
omegaui/linux-voice-control View on GitHub
**LinuxVoiceAssistant** — · · Linux voice assistant
aydinnyunus/LinuxVoiceAssistant View on GitHub
**Local-Voice** — · · Local voice assistant
shashank2122/Local-Voice View on GitHub
**Personal-Voice-Assistent** — · · Personal voice assistant
Cyborgscode/Personal-Voice-Assistent View on GitHub
**tempest** — · · Voice assistant framework
lavafroth/tempest View on GitHub
**vosk-cli-dictation** — · · Vosk CLI dictation
RonanDavalan/vosk-cli-dictation View on GitHub

Voice Commands & Automation

Tools that translate voice into actions — computer control, voice-to-commands, voice-to-JSON, etc.

**Handy** — · · Voice-controlled computer interface - handy.computer
cjpais/Handy View on GitHub
**home-assistant-assist-desktop** — · · Home Assistant desktop client
timmo001/home-assistant-assist-desktop View on GitHub
**JustSayIt.jl** — · · Offline, low-latency translation of speech to computer commands or text. Julia-based
omlins/JustSayIt.jl View on GitHub
**numen** — N/A · N/A · Voice-controlled interface (hosted on SourceHut)
**voice2json** — · · Voice to JSON converter
synesthesiam/voice2json View on GitHub

Voice Operating Systems

**ovos-buildroot** — · · OpenVoiceOS - A minimalistic Linux OS bringing the open source voice assistant to IoT and embedded devices
OpenVoiceOS/ovos-buildroot View on GitHub

Subtitle Generation

**auto-subs** — · · Automatic subtitle generation
tmoroney/auto-subs View on GitHub
**whisper-subs** — · · Whisper subtitle generation
GhostNaN/whisper-subs View on GitHub

Service-Specific Voice Tools

**deepin-voice-note** — · · Deepin voice note application
linuxdeepin/deepin-voice-note View on GitHub
**overlayed** — · · Voice overlay for Discord on Linux
overlayeddev/overlayed View on GitHub
**whatsapp_voice_transcription** — · · WhatsApp voice message transcription
nerveband/whatsapp_voice_transcription View on GitHub

Voice Biometrics

**voiceprint** — · · Voice biometric authentication for Linux
Raymo111/voiceprint View on GitHub

Developer Tools

**mt_stt** — · · C wrapper for speech-to-text
RhinoDevel/mt_stt View on GitHub
**whisper.cpp-cli** — · · Whisper.cpp CLI wrapper
charliermarsh/whisper.cpp-cli View on GitHub
**whisper** (Nutlope) — · · Whisper implementation
Nutlope/whisper View on GitHub

Proof of Concepts

**stt-linux (samcole8)** — · · STT Linux proof of concept
samcole8/stt-linux View on GitHub
**whisperai** — · · Whisper AI proof of concept
jorgecastro05/whisperai View on GitHub

Complementary Tools

Tools that aren't STT themselves, but help make the most of voice workflows.

Noise Suppression & Audio Processing

**easyeffects** — · · Audio effects for PipeWire applications - noise reduction, equalization, and more
wwmm/easyeffects View on GitHub
**NoiseTorch** — · · Real-time microphone noise suppression on Linux
noisetorch/NoiseTorch View on GitHub

Voice Activity Detection (VAD) & Diarisation

**pyannote-audio** — · · Neural building blocks for speaker diarization: speech activity detection, speaker embedding, clustering
pyannote/pyannote-audio View on GitHub
**Silero VAD** — · · Enterprise-grade Voice Activity Detector. MIT license, <1ms per chunk on CPU
snakers4/silero-vad View on GitHub
**WebRTC VAD** — · · Python interface to WebRTC Voice Activity Detector
wiseman/py-webrtcvad View on GitHub
**wyoming-openwakeword** — · · Custom wake word detection for Home Assistant
rhasspy/wyoming-openwakeword View on GitHub

Toolkits & Frameworks

ASR/STT toolkits and frameworks for building voice applications. Developer libraries rather than end-user applications.

**Coqui STT** — · · Deep learning STT toolkit (continuation of Mozilla DeepSpeech). Custom model training
coqui-ai/STT View on GitHub
**fairseq** — · · Meta's sequence modeling toolkit. Includes Wav2Vec 2.0 for self-supervised ASR
facebookresearch/fairseq View on GitHub
**FunASR** — · · End-to-end speech recognition toolkit from Alibaba. Industrial-grade models
modelscope/FunASR View on GitHub
**NVIDIA NeMo** — · · Enterprise ASR toolkit with Conformer/Parakeet models. GPU-accelerated training and inference
NVIDIA/NeMo View on GitHub
**sherpa-onnx** — · · STT, TTS, speaker diarization, VAD using next-gen Kaldi with ONNX Runtime. Offline, 12 programming languages
k2-fsa/sherpa-onnx View on GitHub
**sherpa-onnx-go** — · · Go package for sherpa-onnx speech recognition without network access
k2-fsa/sherpa-onnx-go View on GitHub
**SpeechBrain** — · · PyTorch-based speech toolkit for ASR, speaker recognition, speech enhancement
speechbrain/speechbrain View on GitHub
**Vosk** — · · Offline speech recognition API. Lightweight, 20+ languages, works on Raspberry Pi
alphacep/vosk-api View on GitHub

Whisper Variants & Optimizations

Optimized implementations and variants of OpenAI's Whisper model.

**distil-whisper** — · · HuggingFace's distilled Whisper. 6x faster, 49% smaller, within 1% WER
huggingface/distil-whisper View on GitHub
**faster-whisper** — · · CTranslate2 reimplementation. 4x faster, less memory, 8-bit quantization support
SYSTRAN/faster-whisper View on GitHub
**insanely-fast-whisper** — · · CLI for fastest Whisper inference. Batching, flash attention, distil-whisper support
Vaibhavs10/insanely-fast-whisper View on GitHub
**whisper.cpp** — · · C/C++ port of Whisper. CPU inference, minimal dependencies, runs on edge devices
ggerganov/whisper.cpp View on GitHub
**whisper-plus** — · · Advanced Whisper pipelines with diarization, translation, and video transcription support
kadirnar/whisper-plus View on GitHub
**wyoming-faster-whisper** — · · Wyoming protocol server for faster-whisper. Home Assistant integration
rhasspy/wyoming-faster-whisper View on GitHub
**wyoming-whisper-api-client** — · · Wyoming protocol client for Whisper APIs. Centralizes STT for Home Assistant
ser/wyoming-whisper-api-client View on GitHub

Text-to-Speech (TTS)

**claude-tts** — · · TTS plugin for Claude Code — multi-provider support (ElevenLabs, OpenAI, Google, Amazon Polly, Azure, local system TTS)
MatiousCorp/claude-tts View on GitHub

MCP Servers

MCP (Model Context Protocol) servers that provide STT capabilities.

**stt-mcp-server-linux** — · · Local speech-to-text MCP server for Tmux on Linux (for use with Claude Code and other MCP clients)
marcindulak/stt-mcp-server-linux View on GitHub

Awesome Lists

**awesome-voice-typing** — · · Curated list of open-source STT and voice typing tools for Linux, macOS, Windows, Android, and iOS
primaprashant/awesome-voice-typing View on GitHub
**Voice-Apps-Index** — · · Index for STT and dictation apps and WIPs
danielrosehill/Voice-Apps-Index View on GitHub

Ideas & Specifications

Projects at the concept or specification stage.

**VoiceBox** — · · Idea for a speech tech solution — specced out by Claude
danielrosehill/VoiceBox View on GitHub

Archived Projects

Notable projects that are no longer actively maintained.

**AI-Transcription-Notepad** — · · Voice note taking utility using cloud audio multimodal models for single-pass transcription and text cleanup (archived)
danielrosehill/AI-Transcription-Notepad View on GitHub

Community Resources

GitHub Topics

asr — Automatic speech recognition
topics/asr View on GitHub
dictation — Dictation tools and applications
topics/dictation View on GitHub
speech-to-text — General speech-to-text projects
topics/speech-to-text View on GitHub
transcription — Audio/video transcription tools
topics/transcription View on GitHub
voice — General voice technology projects
topics/voice View on GitHub
voice-assistant — Voice assistant applications
topics/voice-assistant View on GitHub
voice-commands — Voice command implementations
topics/voice-commands View on GitHub
voice-control — Voice control tools
topics/voice-control View on GitHub
voice-dictation — Voice dictation specific projects
topics/voice-dictation View on GitHub
voice-recognition — Voice recognition systems
topics/voice-recognition View on GitHub

Subreddits

r/accessibility — Accessibility tools including voice control
r/LocalLLaMA — Local LLMs (frequently covers voice topics)
r/opensource — Open source projects including voice tools
r/speechrecognition — Speech recognition systems and discussion
r/TextToSpeech — TTS technology (complementary to STT)
r/VoiceTech — Voice technology and applications