nncf
NNCF provides a suite of algorithms for optimizing neural network inference, supporting PyTorch, TensorFlow, ONNX, and OpenVINO. Key features include quantization, sparsity, and pruning, aimed at efficient model optimization with minimal accuracy loss.