Jae's Tech Blog
Home Archive About Game

Posts tagged "systems"

January 6, 2026 undefined min read

Distributed LLM Training 01 - Why LLM Training Becomes a Distributed Systems Problem

Once LLM training leaves a single GPU, it stops being only a modeling problem and becomes a systems problem around memory, communication, and recovery

Lectures
Read more
January 28, 2026 undefined min read

GPU Systems 00 - What You Should Know Before Starting This Series

The background knowledge that makes the GPU Systems series much easier to study properly

Lectures
Read more

© 2025 Jae ยท Notes on systems, software, and building things carefully.

RSS