Question 1

What is Unified Memory?

Accepted Answer

Apple's unified memory architecture places CPU, GPU, and Neural Engine on the same die with a shared high-bandwidth memory pool. On the M4 Pro, this pool runs at 273 GB/s — slower than an RTX 5070's GDDR7 but dramatically faster than any discrete GPU's PCIe bus overflow path. The critical advantage is capacity: a Mac Mini M4 Pro with 48 GB unified memory can fully accelerate a 70B parameter model at Q4, something no consumer GPU under $1,000 can do.

Question 2

Why does Unified Memory matter for local AI?

Accepted Answer

For running 70B models, unified memory Macs are currently the only sub-$2,000 option. A 16 GB M4 Mac Mini tops out at 13B models. The 24 GB M4 Pro comfortably runs 13B models and barely fits some 32B at Q4. The 48 GB M4 Pro config is the practical ceiling for consumer local AI.

What is Unified Memory?

Full Explanation

Why It Matters for Local AI

Hardware Relevant to Unified Memory

Related Terms