A collective operation that sums gradients across all GPUs and shares the result, the dominant traffic in training.
← All terms