Jae's Tech Blog
Home Archive About Game

Posts tagged "llm"

January 6, 2026 undefined min read

Distributed LLM Training 01 - Why LLM Training Becomes a Distributed Systems Problem

Once LLM training leaves a single GPU, it stops being only a modeling problem and becomes a systems problem around memory, communication, and recovery

Lectures
Read more
March 4, 2026 undefined min read

Distributed LLM Training 20 - A Practical Order for Designing an LLM Training Stack

Distributed training architecture is not about collecting fashionable techniques, but about choosing the smallest structure that matches the current bottleneck

Lectures
Read more
January 30, 2026 undefined min read

GPU Systems 01 - Roadmap to GPU Kernel Engineering

A practical study order from GPU architecture to CUDA, Triton, and kernel optimization

Lectures
Read more
February 9, 2026 undefined min read

GPU Systems 06 - Triton and the Practical Shape of Kernel Optimization

How Triton fits into real kernel optimization work, especially for LLM-style workloads

Lectures
Read more

© 2025 Jae ยท Notes on systems, software, and building things carefully.

RSS