BentoDiffusion
This guide illustrates the deployment and self-hosting of diffusion models with BentoML, specifically focusing on Stable Diffusion models for generating images and video from text prompts. It provides instructions to set up the SDXL Turbo model with an Nvidia GPU (minimum 12GB VRAM), details dependency installation, and local BentoML service execution. Interaction is possible through Swagger UI or cURL. For scalable solutions, it includes guidance on deploying to BentoCloud. The repository supports various models such as ControlNet, Latent Consistency Model, and Stable Video Diffusion, ensuring efficient deployment for both local and cloud environments.