🗣️

Moshi AI

Kyutai's open-source real-time speech-to-speech conversational AI

Audio & Speech

Moshi AI

Kyutai's open-source real-time speech-to-speech conversational AI

Audio & SpeechFree

Moshi is an open-weight real-time speech-to-speech conversational AI model developed by Kyutai, a French AI research lab. Unlike LLM-based voice assistants that convert speech to text before processing, Moshi operates directly on audio streams, enabling fully natural real-time conversations with latency under 200ms. Moshi can listen and speak simultaneously, interrupting itself or being interrupted naturally. The model weights are fully open, enabling researchers and developers to run and fine-tune it.