lightning-thunder
The Lightning Thunder project enhances PyTorch models' performance by utilizing a source-to-source compiler. Supporting both single and multi-GPU architectures, it integrates advanced executors like nvFuser, torch.compile, and cuDNN. Achieving up to a 40% increase in training speed, Thunder offers substantial efficiency improvements, making it a valuable asset for machine learning development. As the tool is currently in its alpha stage, it encourages contributions and exploration of its capabilities.