> ## Documentation Index > Fetch the complete documentation index at: https://kindling.birklid.com/llms.txt > Use this file to discover all available pages before exploring further. # Voicebox > Free, local-first AI voice studio — voice cloning, TTS across 23 languages, Whisper dictation, and MCP integration. All on-device, fully private.

Open Source Applications Layer

# Voicebox **ElevenLabs-quality voice tools, completely on-device — clone voices, generate speech across 23 languages, and dictate with Whisper, all without a subscription.** Voicebox GitHub

Open Source (MIT) Applications TypeScript 24.9k+ ## What it is Voicebox is a free, open-source, local-first AI voice studio that bundles voice cloning, text-to-speech across 23 languages with seven TTS engine options, and Whisper-powered speech-to-text into a single native desktop app. Built with Tauri rather than Electron, it runs natively on Apple Silicon, NVIDIA CUDA, and AMD ROCm, keeping all audio data on your machine with no cloud dependency. It includes a multi-track story editor for podcast production, post-processing audio effects, and an MCP server for AI agent integration. The privacy advantage is the headline: everything happens locally. No audio leaves your device, no subscription is required, and no usage limits apply. For developers, the MCP server makes it callable from agent workflows that need voice output without routing through a third-party API. **Use this when** you need ElevenLabs-style voice capabilities but have privacy requirements, cost constraints, or offline needs — or when you want to integrate speech synthesis into an AI agent workflow via MCP without third-party API dependencies. ## Get started Source, releases for Mac/Windows/Linux, and MCP documentation. ## Related tools The cloud-based standard — useful comparison point for voice quality. Programmatic podcast generation — can use Voicebox-compatible TTS engines.