Project Icon

3D-Speaker

Comprehensive Open-source Toolkit for Multi-modal Speaker Tasks

Product DescriptionDiscover an open-source platform designed for single- and multi-modal speaker verification, recognition, and diarization. Benefit from pretrained models on ModelScope and utilize the large-scale 3D-Speaker speech corpus for research in speech representation. This toolkit includes multiple training and inference recipes for datasets such as 3D-Speaker, VoxCeleb, and CN-Celeb, featuring models like CAM++, ERes2Net, ERes2NetV2, and ECAPA-TDNN. Keep updated with regular releases and comprehensive documentation, making it a valuable resource for researchers and developers in speech technology.
Project Details