AI-systems

A Unified Architecture for Accelerating Distributed DNN Training in Heteogeneous GPU/CPU Clusters

Nccl Allreduce