VMamba
VMamba presents an innovative Visual State Space (VSS) model designed to integrate the Mamba state-space language model into a computationally efficient vision backbone. Utilizing the 2D Selective Scan (SS2D) module, VMamba facilitates contextual information gathering from both one-dimensional and two-dimensional data. It excels in visual perception tasks, offering enhanced input scaling efficiency on recognized benchmarks. Recent updates optimize the code for readability and introduce mamba2 support, with the model recently featured at NeurIPS2024.