Keynote and Invited Speakers

IMPORTANT DATES

Submission: August 20th, 2022
Notification: September 6th, 2022
Final Pre-Workshop papers: October 1st, 2022

ABOUT LCPC 2022

Keynotes

Invited Speakers

Saeed Maleki
GPU Collectives with MSCCL: Man vs. Dragons
Abstract: Collective communication primitives on GPUs are the primary bottleneck on large neural network models. Although there have been decades of research on optimizing computation kernels, there has been very little done for collective communication kernels on GPUs. There are many challenges in area including unique GPU interconnection topologies, high P2P transfer latency, wide range of use cases for neural networks, and software complexities. In this talk, I will present program synthesis as a primary solution for communication algorithms for these topologies and show how a bespoke algorithm can significantly improve the overall performance of a model. Lastly, I will present a high-level DSL along with a compiler for mapping from an abstract synthesized algorithm to a low-level CUDA code for collective communications.
BIO