The Definitive Guide toAI Data Centers
Ask the Guide
GuideGlossaryTTFT

TTFT · Time To First Token

How long an inference request waits before the first output token appears; a key latency SLO set by prefill.

← All terms