MotionGPT
MotionGPT offers a novel approach to integrating human motion with language, supporting diverse tasks like motion generation and prediction through its motion-language model. Employing discrete vector quantization, it regards human motion as a language, facilitating the convergence of these modalities. Pre-trained on extensive data, MotionGPT is optimized for tasks including motion captioning and in-between state generation, showcasing its proficiency with language model and instruction tuning advancements.