Project Icon

VLDet

Improving Object Detection with Object-Language Alignment

Product DescriptionVLDet is an open-source project focused on aligning object detection tasks with language processing to enhance open-vocabulary capabilities. Leveraging image-text pairs and tackling the bipartite matching problem, VLDet excels in Open-vocabulary LVIS and COCO datasets. Designed for scalability, it integrates easily with new vocabularies and operates on Linux or macOS with Python ≥ 3.7 and PyTorch ≥ 1.9. Built on the Detectron2 framework, VLDet supports fine-tuning and evaluation of pretrained models, making it adaptable for new object categories without bias or exaggeration.
Project Details