Project Icon

mlx-vlm

Use MLX-VLM for Efficient Vision-Language Model Inference and Fine-Tuning on Mac

Product DescriptionMLX-VLM provides tools to perform inference and fine-tune vision-language models on macOS. It supports efficient interaction through a command-line interface and Gradio chat UI, and is compatible with models like Idefics 2 and Phi3-Vision. With features like multi-image chat support and model enhancement using LoRA and QLoRA, MLX-VLM facilitates comprehensive image analysis. Installation is straightforward via pip.
Project Details