en

#VLMEvalKit

VLMEvalKit is an open-source toolkit designed for evaluating large vision-language models (LVLMs) efficiently with a single command. It enables both exact matching and LLM-based answer extraction, simplifying the evaluation across diverse datasets without extensive data preparation. Recent updates include support for models such as Ovis1.6-Llama3.2-3B and Xinyuan-VL-2B, reflecting ongoing enhancements and community contributions. Multimodal leaderboards and datasets further augment its application for researchers and developers assessing LVLM performance comprehensively.

Terms of Use Privacy Policy Advertising Services

Feedback Email: [email protected]