Hardware & Architecture

What is NVMe SSD?

High-speed solid-state storage using the PCIe bus. Affects how quickly models load into memory at startup — a PCIe 4.0 NVMe loads a 7B model in ~2 seconds vs ~15 seconds on SATA SSD.

Full Explanation

NVMe (Non-Volatile Memory Express) SSDs connect directly to the PCIe bus, delivering sequential read speeds of 5,000–14,000 MB/s versus 500–600 MB/s for SATA SSDs. For local AI, NVMe speed affects model load time — the duration between starting Ollama with a new model and generating the first token. A 4 GB Q4 7B model loads in roughly 1–2 seconds on a PCIe 4.0 NVMe, 3–5 seconds on PCIe 3.0, and 12–18 seconds on SATA SSD. Once loaded, inference speed is determined by VRAM/RAM bandwidth, not storage.

Why It Matters for Local AI

NVMe matters if you frequently switch between models — a common pattern when running multiple specialized models for different tasks. For users who load one model and keep it running, SATA SSD is adequate. Mini PCs and Macs universally ship with NVMe; verify PCIe generation (4.0 preferred) when comparing storage specs.

Hardware Relevant to NVMe SSD

Samsung 990 PRO Heatsink SSD 4TB NVMe M.2

accessory · Check Price on Amazon

Buy on AmazonAffiliate link — no extra cost to you
Samsung SSD 9100 PRO 2TB PCIe 5.0 NVMe

accessory · Check Price on Amazon

Buy on AmazonAffiliate link — no extra cost to you
OWC Envoy Express Thunderbolt NVMe Enclosure

accessory · Check Price on Amazon

Buy on AmazonAffiliate link — no extra cost to you

Related Terms