s4
This repository offers insights into structured state space models, including S4 and its variants, for sequence modeling. It comprises implementations, training scripts, and kernel optimizations for long sequence processing with PyTorch. Features include customizable Hydra configurations, flexible integration with other repositories, and sophisticated generation capabilities. It supports model training and testing on datasets like MNIST, CIFAR, and WikiText, utilizing CUDA and Pykeops kernels for improved performance.