Download
Voicebox is available for macOS and Windows, with Linux builds coming soon.
macOS
Download: voicebox_aarch64.app.tar.gz
# Extract the archive
tar -xzf voicebox_aarch64.app.tar.gz
Move to Applications
mv Voicebox.app /Applications/
Download: voicebox_x64.app.tar.gz
# Extract the archive
tar -xzf voicebox_x64.app.tar.gz
Move to Applications
mv Voicebox.app /Applications/
Windows
Download: voicebox_x64_en-US.msi
Double-click the MSI file and follow the installation wizard.
Download: voicebox_x64-setup.exe
Run the executable and follow the installation wizard.
Linux
First Launch
When you launch Voicebox for the first time:
Model Download — The TTS engine you generate with first will download its model automatically. Sizes range from
350 MB (Kokoro) to ~8 GB (TADA 3B). Most users start with Qwen 1.7B (3.5 GB).Data Directory — Voice profiles and generated audio are stored in:
- macOS:
~/Library/Application Support/sh.voicebox.app/ - Windows:
%APPDATA%/sh.voicebox.app/ - Linux:
~/.config/sh.voicebox.app/
- macOS:
Backend Server — The bundled Python server starts automatically
System Requirements
Minimum
- OS: macOS 11+, Windows 10+, or Linux
- RAM: 8GB
- Storage: 5GB free space (for models and data)
- CPU: Modern multi-core processor
Recommended
- RAM: 16GB+
- GPU: CUDA-capable NVIDIA GPU (for faster generation)
- Storage: 10GB+ free space
Verification
After installation, verify everything works:
- Launch Voicebox
- Check the server status indicator in the bottom-left corner (should be green)
- Navigate to Profiles and create a test profile
- Generate a short audio clip to verify the TTS engine works
Next Steps
Create your first voice profile and generate speech