Project Icon

ml-fastvit

FastViT Speedy and Accurate Vision Transformers Powered by Structural Reparameterization

Product DescriptionThis repository features FastViT, a rapid hybrid vision transformer utilizing structural reparameterization to boost image classification accuracy. Models have been trained on ImageNet-1K and benchmarked for latency on an iPhone 12 Pro via the ModelBench app. The repository includes setup guides for configuring environments, training, and evaluating models, along with scripts for implementation. It provides a varied collection of pre-trained models tailored for classification tasks, including the option for knowledge distillation. Comprehensive dataset preparation and model export instructions are available, making this a versatile tool for tasks ranging from training to fine-tuning in machine learning.
Project Details