Project Icon

cambrian

Exploration of Open-source Vision-focused Multimodal Language Models

Product DescriptionCambrian project explores open-source vision-centric multimodal language models with state-of-the-art capabilities in 8B, 13B, and 34B sizes. It provides comprehensive benchmarks and datasets like Cambrian-10M for instruction tuning, allowing easy adoption and performance comparison with proprietary models such as GPT-4V. The project emphasizes two-stage training techniques for model robustness.
Project Details