CogView
CogView uses a 4 billion parameter transformer model for general text-to-image generation. It includes code releases and demos, with PB-relax and Sandwich-LN techniques for stable transformer training. While supporting multiple languages, CogView primarily uses Chinese text input with recommended English translations. It offers pretrained models, inference, and super-resolution features, along with detailed setup instructions for various environments, suitable for complex AI tasks, including both single and multi-node training.