Infrastructure & Agents
What Is Pipeline Parallelism?
Pipeline parallelism places consecutive layers of a model on separate devices, passing activations from one stage to the next. Batches are split into micro-batches so devices can work concurrently and reduce idle time.
Further reading
Read more about pipeline parallelism — articles and blogs from around the web: