matmulfreellm
Discover the MatMul-Free LM, a groundbreaking architecture that removes matrix multiplication, optimized for the Transformers library. Leveraging efficient ternary weights, it outperforms traditional models such as Transformer++ in computational efficiency. This model ranges from 370M to 2.7B parameters, ensuring easy integration with PyTorch, Triton, and einops for seamless language model deployment.