Give AlbumentationsX a star on GitHub — it powers this leaderboard

Star on GitHub

kyutai-labs/moshi

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Stars: 9,747Language: Python
kyutai-labs/moshi - GitHub Repository | PyPI Leaderboard