Kushagra Rastogi
Kushagra Rastogi's contributions
Article
MPI-powered gradient synchronization in PyTorch distributed training
Kushagra Rastogi
Explore the mechanics of gradient synchronization in PyTorch distributed training, focusing on MPI primitives like All-Reduce and core techniques like pipeline parallelism, tensor parallelism, and sharded data parallelism.
Article
MPI-powered gradient synchronization in PyTorch distributed training
Kushagra Rastogi
Explore the mechanics of gradient synchronization in PyTorch distributed training, focusing on MPI primitives like All-Reduce and core techniques like pipeline parallelism, tensor parallelism, and sharded data parallelism.