extension-cpp
Learn how to create custom C++ and CUDA extensions in PyTorch to improve computational efficiency. The project features a 'mymuladd' operation with support for both CPU and CUDA, compatible with PyTorch 2.4+. Includes straightforward build and test commands and a benchmark comparison across Python, C++, and CUDA implementations. Developed by Peter Goldsborough and Richard Zou, it supports developers in enhancing PyTorch application performance.