What is SDXL?
Stable Diffusion XL — the standard 1024×1024 resolution image generation model. Requires 8+ GB VRAM for practical GPU-accelerated generation. Benchmark: generation time in seconds.
Full Explanation
Stable Diffusion XL (SDXL) is Stability AI's 6.6B parameter image generation model, producing 1024×1024 images. It requires approximately 8 GB VRAM to run at full resolution without memory optimization tricks. On the RTX 5070, a 20-step SDXL generation completes in ~2.5 seconds. On the RX 9060 XT with ROCm, ~4 seconds. On a CPU-only mini PC, the same generation takes 3–8 minutes, making iterative creative work impractical. Flux.1 (2024) has largely superseded SDXL in quality but requires similar hardware.
Why It Matters for Local AI
SDXL generation time is the standard GPU benchmark for image generation workloads. If you plan to use Stable Diffusion or Flux seriously, target under 10 seconds per image — which means 8+ GB VRAM GPU hardware is mandatory.
Hardware Relevant to SDXL
gpu · Check Price on Amazon · 12 GB VRAM · 672 GB/s
gpu · Check Price on Amazon · 12 GB VRAM · 672 GB/s
gpu · Check Price on Amazon · 16 GB VRAM · 288 GB/s
Related Terms
VRAM→
Video RAM — dedicated memory on a GPU. Determines the maximum model size you can run with full GPU acceleration. Once a model exceeds VRAM, it spills to system RAM over the slow PCIe bus.
CUDA→
NVIDIA's proprietary parallel computing platform. Industry standard for AI/ML. Nearly every AI framework (PyTorch, Ollama, ComfyUI) supports CUDA natively and first.
ROCm→
AMD's open-source GPU compute platform — AMD's answer to NVIDIA CUDA. Required for GPU-accelerated AI on AMD cards. Mature on Linux; less reliable on Windows.
ComfyUI→
The node-based GUI for Stable Diffusion and Flux image generation. Industry standard for advanced AI image workflows. Requires a CUDA GPU for practical speeds; AMD ROCm on Linux works.