Vision-RWKV
Vision-RWKV is an AI project offering efficient and scalable solutions for visual perception through RWKV-like architectures. It excels in high-resolution image processing with a global receptive field, achieving superior performance and stability, especially after pre-training on large datasets. Outperforming window-based and global attention ViTs in classification tasks, it boasts lower flops and faster speeds. Recent support for RWKV6 further boosts classification performance. The project provides multiple pre-trained models on ImageNet, suited for object detection and semantic segmentation, with straightforward access to checkpoints and configuration files for customization.