1d-tokenizer
The 1d-tokenizer encodes a 256x256 image into 32 tokens, significantly speeding up the process and achieving approximately 410 times faster results than traditional models while maintaining quality. Accepted by NeurIPS 2024, this project introduces a compact framework that overcomes 2D constraints, enhancing image representation efficiency. It includes updates and multiple model sizes for both VQ and VAE, aiding research advancement in image tokenization with training and evaluation resources.