EdgeSAM
EdgeSAM speeds up the Segment Anything Model (SAM) for edge devices, improving speed by 40 times with minimal performance trade-offs. It surpasses models such as MobileSAM and achieves better mIoU metrics on COCO and LVIS datasets. EdgeSAM can operate at more than 30 FPS on an iPhone 14. The model employs a sophisticated distillation process that involves the prompt encoder and mask decoder, enhancing the interaction between user input and mask generation. With available training and evaluation codes, EdgeSAM is deployable via ONNX and CoreML exports for a range of applications.