nanodet - Highly efficient and precise object detection model optimized for mobile platforms

Overview of NanoDet

NanoDet is a lightweight, real-time object detection model designed for mobile devices. Its strengths lie in its minimal file size, swift performance, and impressive accuracy. Here, we delve into its features, architecture, benchmarks, and deployment strategies, making it an appealing choice for developers seeking efficient object detection solutions.

Key Features

Lightweight and Fast: NanoDet's model file sizes are remarkably small, just 980KB for INT8 and 1.8MB for FP16, allowing it to operate quickly, achieving 97 frames per second on mobile ARM CPUs.
High Accuracy: The model can achieve a mean Average Precision (mAP) of up to 34.3, making it both accurate and efficient on CPUs.
Training Friendly: It requires less GPU memory than other models, with batch sizes of 80 possible on a GTX1060 6G GPU.
Easy Deployment: It supports various backends like ncnn, MNN, and OpenVINO, and even includes an Android demo.

Architecture

NanoDet is an FCOS-style one-stage, anchor-free object detection model. It leverages Generalized Focal Loss for classification and regression. The NanoDet-Plus variant introduces a novel label assignment strategy using an Assign Guidance Module (AGM) and a Dynamic Soft Label Assigner (DSLA) for optimal label assignment during training. Additionally, a light feature pyramid, Ghost-PAN, enhances multi-layer feature fusion, boosting accuracy by 7 mAP on the COCO dataset over previous models.

Benchmarks

NanoDet's performance is benchmarked against various models:

NanoDet-m: With a resolution of 320x320, it offers an mAP of 20.6, operates with 0.72G FLOPS, and has a 0.95M parameter count.
NanoDet-Plus-m and its enhanced versions show significant improvements in mAP and operational speed, offering even higher efficiency than models like YOLOv3 and YOLOv4, which have larger file sizes and higher latency.

Recent Updates

NanoDet remains actively maintained with regular updates:

In January 2023, it upgraded to utilize PyTorch Lightning 1.9, supporting FP16 training and ignoring labels when necessary.
In December 2021, the NanoDet-Plus model was released, significantly enhancing accuracy with minimal additional computational costs.

Deployment

NanoDet supports multiple platforms and demos:

Android Demo: Demonstrates real-time performance on mobile devices.
NCNN C++ and OpenVINO Demos: Provides robust performance for desktop applications.
Web Browser Demo: Available via WebAssembly.
PyTorch Demo: Easily integrates with COCO pre-trained weights for image, video, or webcam inputs.

Installation and Training

To get started with NanoDet:

Create and activate a virtual environment using Conda.
Install PyTorch and clone the NanoDet repository.
Install any additional requirements and set up the environment.
Configure datasets and adjust the model settings based on your project requirements.
Start training using PyTorch Lightning for efficient model development.

Model Zoo

NanoDet supports various backbones, and its models are available for download with pre-trained weights. Each model varies by backbone type, resolution, and computational requirements, allowing flexibility based on project needs.

Conclusion

NanoDet offers an optimal balance between performance, ease of use, and deployment adaptability, making it an excellent choice for developers interested in lightweight and accurate object detection solutions. Its constant updates and broad support mean it continues to be a relevant tool in object detection technology.