EDGS
Create a 3D model from video or images
Generative approaches for visual synthesis, Invertible deep models for explainable AI, Deep metric and representation learning, self-supervised learning paradigms
Welcome to CompVis!
We host public weights for Latent Diffusion and Stable Diffusion models. There are several options to choose from, please check the details below.
Stable Diffusion Models
Stable Diffusion is a latent text-to-image diffusion model capable of generating photo-realistic images given any text input. For more information about how Stable Diffusion works, please have a look at π€'s Stable Diffusion with 𧨠Diffusers blog.
We recommend you use Stable Diffusion with π€ Diffusers library. You can also use the original CompVis code. There are variants of the weights depending on:
Please, refer to the details in the following table to choose the weights appropriate for your use.
Model | Library | Details |
---|---|---|
stable-diffusion-v1-1 | π€ Diffusers | 237k steps at resolution 256x256 on laion2B-en. 194k steps at resolution 512x512 on laion-high-resolution. |
stable-diffusion-v1-2 | π€ Diffusers | v1-1 plus: 515k steps at 512x512 on "laion-improved-aesthetics". |
stable-diffusion-v1-3 | π€ Diffusers | v1-2 plus: 195k steps at 512x512 on "laion-improved-aesthetics", with 10% dropping of text-conditioning. |
stable-diffusion-v1-4 | π€ Diffusers | v1-2 plus: 225k steps at 512x512 on "laion-aesthetics v2 5+", with 10% dropping of text conditioning. |
stable-diffusion-v-1-1-original | CompVis | 237k steps at resolution 256x256 on laion2B-en. 194k steps at resolution 512x512 on laion-high-resolution. |
stable-diffusion-v-1-2-original | CompVis | v1-1 plus: 515k steps at 512x512 on "laion-improved-aesthetics". |
stable-diffusion-v-1-3-original | CompVis | v1-2 plus: 195k steps at 512x512 on "laion-improved-aesthetics", with 10% dropping of text-conditioning. |
stable-diffusion-v-1-4-original | CompVis | v1-2 plus: 225k steps at 512x512 on "laion-aesthetics v2 5+", with 10% dropping of text conditioning. |