The Definitive Guide toAI Data Centers
Ask the Guide
GuideGlossaryExpert parallelism

Expert parallelism · EP

Distributing a Mixture-of-Experts model's experts across GPUs, routing tokens to whichever GPU holds the chosen expert.

← All terms