Jae's Tech Blog
Home Archive About Game

Posts tagged "infiniband"

January 24, 2026 undefined min read

Distributed LLM Training 07 - NCCL and Topology: Why the Same GPU Count Can Behave Very Differently

In distributed training, performance is often shaped more by how GPUs are connected than by the raw number of GPUs

Lectures
Read more

© 2025 Jae ยท Notes on systems, software, and building things carefully.

RSS