Memory & Storage

What is Unified Memory?

Apple Silicon uses a single pool of fast RAM shared between CPU and GPU. Larger unified memory = larger models run entirely at full bandwidth — no PCIe bottleneck.

Full Explanation

Apple's unified memory architecture places CPU, GPU, and Neural Engine on the same die with a shared high-bandwidth memory pool. On the M4 Pro, this pool runs at 273 GB/s — slower than an RTX 5070's GDDR7 but dramatically faster than any discrete GPU's PCIe bus overflow path. The critical advantage is capacity: a Mac Mini M4 Pro with 48 GB unified memory can fully accelerate a 70B parameter model at Q4, something no consumer GPU under $1,000 can do.

Why It Matters for Local AI

For running 70B models, unified memory Macs are currently the only sub-$2,000 option. A 16 GB M4 Mac Mini tops out at 13B models. The 24 GB M4 Pro comfortably runs 13B models and barely fits some 32B at Q4. The 48 GB M4 Pro config is the practical ceiling for consumer local AI.

Hardware Relevant to Unified Memory

Apple Mac Mini (M4, 2024)

mini-pc · Check Price on Amazon · 16 GB Unified · 120 GB/s

Buy on AmazonAffiliate link — no extra cost to you
Apple Mac Mini (M4 Pro, 2024)

mini-pc · Check Price on Amazon · 24 GB Unified · 273 GB/s

Buy on AmazonAffiliate link — no extra cost to you

Related Terms