Search results
Feb 22, 2024 · Announcing Stable Diffusion 3 in early preview, our most capable text-to-image model with greatly improved performance in multi-subject prompts, image quality, and spelling abilities.
Stable Diffusion is a latent text-to-image diffusion model capable of generating photo-realistic images given any text input, cultivates autonomous freedom to produce incredible imagery, empowers billions of people to create stunning art within seconds.
New stable diffusion model (Stable Diffusion 2.0-v) at 768x768 resolution. Same number of parameters in the U-Net as 1.5, but uses OpenCLIP-ViT/H as the text encoder and is trained from scratch. SD 2.0-v is a so-called v-prediction model.
Jun 1, 2024 · Stable Diffusion is a technique in the field of generative artificial intelligence (AI) that aims to generate high-quality images. It is an extension of diffusion probabilistic models, which are a class of generative models used for image generation.
Stable Diffusion is a text-to-image model that generates photo-realistic images given any text input. What makes Stable Diffusion unique ? It is completely open source.
Stable Diffusion is a deep learning, text-to-image model released in 2022 based on diffusion techniques. The generative artificial intelligence technology is the premier product of Stability AI and is considered to be a part of the ongoing artificial intelligence boom .
Discover amazing ML apps made by the community.
Stable Diffusion 3 Medium is the latest and most advanced text-to-image AI model in our Stable Diffusion 3 series, comprising two billion parameters. It excels in photorealism, processes complex prompts, and generates clear text.
Stable Diffusion is a latent text-to-image diffusion model. Thanks to a generous compute donation from Stability AI and support from LAION, we were able to train a Latent Diffusion Model on 512x512 images from a subset of the LAION-5B database.
Stable Diffusion 3 is an advanced text-to-image model designed to create detailed and realistic images based on user-generated text prompts. It leverages a diffusion transformer architecture and flow matching technology to enhance image quality and speed of generation, making it a powerful tool for artists, designers, and content creators.