Linux Voice Tech

Star counts and last commit dates are shown via shields.io badges and update dynamically.

Last updated: 06/04/2026

Linux Voice Tech

An index of voice technology tools accessible to Linux users

Star counts and last commit dates are shown via shields.io badges and update dynamically.

For background, notes on how the repo is organized, and inclusion criteria, see notes.md. For a getting started guide, see starting-points.md.

Keywords

Quick Navigation

STT Tools with Wayland Support

Projects with explicit Wayland support. Particularly valuable for users on modern Linux desktops (GNOME, KDE Plasma on Wayland, Hyprland, Sway, niri, etc.) where X11 virtual input methods don't work.

Voice Typing — GUIs

Desktop applications for dictation and transcription with graphical interfaces.

Voice Typing — CLIs

Command-line dictation and transcription tools.

Voice Notes & AI-Enhanced Transcription

Tools focused on capturing voice notes with AI post-processing (LLM cleanup, formatting, summarization).

Real-Time Streaming STT

Libraries and tools for low-latency, live transcription.

Self-Hosted / Web UI

Docker-deployed tools and web interfaces for self-hosted STT.

Cloud STT / API-Based Tools

Projects that use cloud STT APIs for transcription.

OpenAI Whisper API

Deepgram API

Hugging Face ASR Models

Voice Assistants

Privacy-Focused

Open source voice assistants emphasizing local processing and privacy.

General

Voice Commands & Automation

Tools that translate voice into actions — computer control, voice-to-commands, voice-to-JSON, etc.

Voice Operating Systems

Subtitle Generation

Service-Specific Voice Tools

Voice Biometrics

Developer Tools

Proof of Concepts

Complementary Tools

Tools that aren't STT themselves, but help make the most of voice workflows.

Noise Suppression & Audio Processing

Voice Activity Detection (VAD) & Diarisation

Toolkits & Frameworks

ASR/STT toolkits and frameworks for building voice applications. Developer libraries rather than end-user applications.

Whisper Variants & Optimizations

Optimized implementations and variants of OpenAI's Whisper model.

Text-to-Speech (TTS)

MCP Servers

MCP (Model Context Protocol) servers that provide STT capabilities.

Awesome Lists

Ideas & Specifications

Projects at the concept or specification stage.

Archived Projects

Notable projects that are no longer actively maintained.

Community Resources

GitHub Topics

Subreddits