#sequence modeling
keras-tcn
Keras Temporal Convolutional Network provides a compelling option to LSTM and GRU for extended time series tasks like Sequential MNIST and Word-level PTB. It is known for its enhanced memory retention, superior performance, and stable gradients, supported by parallel architecture and flexible receptive fields. Suitable for TensorFlow 2.9 to 2.17, Keras TCN is designed for cross-platform installations with flexible configurations, including skip connections and batch normalization. Discover its potential through practical implementations like adding tasks and copy memory tasks for comprehensive sequence modeling.
s4
This repository offers insights into structured state space models, including S4 and its variants, for sequence modeling. It comprises implementations, training scripts, and kernel optimizations for long sequence processing with PyTorch. Features include customizable Hydra configurations, flexible integration with other repositories, and sophisticated generation capabilities. It supports model training and testing on datasets like MNIST, CIFAR, and WikiText, utilizing CUDA and Pykeops kernels for improved performance.
Feedback Email: [email protected]