VLMEvalKit
VLMEvalKit is an open-source toolkit designed for evaluating large vision-language models (LVLMs) efficiently with a single command. It enables both exact matching and LLM-based answer extraction, simplifying the evaluation across diverse datasets without extensive data preparation. Recent updates include support for models such as Ovis1.6-Llama3.2-3B and Xinyuan-VL-2B, reflecting ongoing enhancements and community contributions. Multimodal leaderboards and datasets further augment its application for researchers and developers assessing LVLM performance comprehensively.