Project Icon

ppq

Scalable Neural Network Quantization for Diverse Industrial Uses

Product DescriptionThis advanced framework facilitates neural network quantization across various hardware platforms by transforming floating-point operations into fixed-point, enhancing chip design efficiency. It offers customizable quantization processes compatible with TensorRT and OpenVINO. The 0.6.6 version introduces FP8 quantization, upgraded Python APIs, and sophisticated graph fusion, providing adaptable solutions for evolving AI applications.
Project Details