The Definitive Guide toAI Data Centers
Ask the Guide
GuideGlossaryKV cache

KV cache · Key-Value cache

Stored attention tensors reused during decode; its size grows with context and concurrency, dominating inference memory.

← All terms