Voicebox Meta
Meta AI's generative speech model for in-context text-to-speech and style transfer
Voicebox Meta
Meta AI's generative speech model for in-context text-to-speech and style transfer
Voicebox is Meta AI's generative speech model trained on a large dataset of audiobooks in multiple languages. It performs in-context text-to-speech synthesis, noise removal, content editing, and style transfer without task-specific fine-tuning. As a research model, it represents a significant leap toward generalist AI voice systems that can match speaker styles across diverse samples.
Key Features
- ✓In-context TTS
- ✓Multi-language support
- ✓Style transfer
- ✓Noise removal
- ✓Content editing
Quick Info
- Category
- Voice & Audio
- Pricing
- Free
More Voice & Audio Tools
Poly AI
Voice & AudioEnterprise AI voice agents for customer service that sound like humans
Vall-E
Voice & AudioMicrosoft's neural codec language model for zero-shot voice synthesis
SpeechBrain
Voice & AudioOpen-source PyTorch toolkit for conversational AI, speech recognition, and speaker verification
MacWhisper
Voice & AudioMac app using OpenAI Whisper for local, private audio and video transcription on Mac