fastdup
Fastdup is a free, unsupervised tool designed for thorough analysis of image and video datasets, detecting duplicates, outliers, and mislabels effectively. Capable of processing up to 400 million images with a single CPU, it utilizes a C++ engine for speed and supports data privacy by local or cloud execution. Compatible with MacOS, Linux, and Windows, it supports labeled and unlabeled data formats. Suitable for extensive projects, it offers both interactive and static galleries and integrates with TIMM and ONNX for feature extraction.