Question 1

What is eGPU?

Accepted Answer

An eGPU enclosure houses a full-size PCIe GPU and connects to a host computer via Thunderbolt 3/4/5. The GPU handles inference while the host CPU manages the OS and applications. Thunderbolt 5 at 120 Gbps is essential for eGPU LLM use — older Thunderbolt 3/4 at 40 Gbps creates a bandwidth bottleneck that can halve effective tokens per second compared to native PCIe. Apple Silicon Macs are not compatible with eGPU for GPU compute (only display output); eGPU is a Windows/Linux strategy.

Question 2

Why does eGPU matter for local AI?

Accepted Answer

eGPU is the path to GPU-accelerated LLM inference on small-form-factor Windows PCs or mini ITX builds that lack a PCIe x16 slot. With Thunderbolt 5, a mini PC like the Geekom A6 can drive an RTX 5070 externally with acceptable bandwidth overhead — expanding its AI capability significantly.

What is eGPU?

Full Explanation

Why It Matters for Local AI

Hardware Relevant to eGPU

Related Terms