mae_st
The PyTorch implementation of 'Masked Autoencoders As Spatiotemporal Learners' enhances video processing with pre-trained checkpoints for Kinetics series, interactive visualization demos, and fine-tuning options. Built on the modified MAE repository for PyTorch 1.8.1+, it allows examination of outputs with varied mask rates and includes comprehensive pre-training guidelines, making it a valuable resource for researchers and developers in video analysis.