Moshi: a speech-text foundation model for real-time dialogue Paper • 2410.00037 • Published Sep 17, 2024 • 13
Moshi v0.1 Release Collection MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 16 items • Updated Dec 24, 2025 • 244
NeuTTS Nano Multilingual Collection Collection NeuTTS Nano is a TTS model, 3x smaller than NeuTTS Air, that runs on CPU in real-time - now in English, Spanish, French, and German versions! • 12 items • Updated 9 days ago • 16
🎵 The MusicBox Collection A collection full of musical tasks demos, for musicians & music enthusiasts • 38 items • Updated Jun 21, 2025 • 32
Running on Zero 77 Vocal Separation SOTA 🎤 77 Separate vocals and music from any audio file or YouTube video
Running on CPU Upgrade Featured 3k The Smol Training Playbook 📚 3k The secrets to building world-class LLMs