What is Stable Diffusion
Model for generating images from text descriptions
Stable Diffusion is an open-source machine learning model for generating images from text descriptions, developed by Stability AI.
How It Works
- Text prompt is converted to embedding
- Model gradually removes noise from random image
- Guided by text description (CLIP)
- Result is an image matching the prompt
- Latent diffusion: works in compressed space
Capabilities
- Text-to-image generation
- Image editing (inpainting)
- Style transfer (img2img)
- Resolution upscaling
- Variation generation
Advantages
- Open source
- Runs on consumer GPUs
- High quality images
- Active community
- Many extensions and models
Business Applications
- Marketing materials creation
- Design prototyping
- Social media content generation
- Concept art and visualization
- Personalized images
Tools
- Automatic1111 WebUI
- ComfyUI
- InvokeAI
- DiffusionBee (macOS)
- Draw Things (iOS)
Versions
- SD 1.5 — base stable version
- SD 2.0/2.1 — improved quality
- SDXL — high resolution (1024x1024)
- SD 3 — newest architecture