intel-extension-for-pytorch
The Intel® Extension for PyTorch* enhances PyTorch with optimized features for improved performance on Intel hardware. Leveraging advanced instructions like Intel® AVX-512 and AI engines such as XMX, it supports CPU and GPU acceleration to maximize efficiency. Specific optimizations provide up to 30% performance improvement on Large Language Models (LLMs), starting from version 2.1.0, notably improving accuracy of renowned models including LLAMA and GPT. Additionally, module-level optimization APIs from release 2.3.0 offer enhanced alternatives for customized LLMs, ensuring continued advancements in Generative AI applications.