Project Icon

LLM-Codec

Efficient Cross-Modal Audio Learning with LLM-driven Codec Models

Product DescriptionUniAudio 1.5 presents a pioneering LLM-driven audio codec model that converts audio data into textual tokens, allowing LLMs to perform audio tasks like emotion classification and TTS generation without fine-tuning. This open-source model compresses audio as a new language into LLMs, supporting few-shot learning with high efficacy and minimal examples.
Project Details