Jae's Tech Blog
Home Archive About Game

Posts tagged "data-parallel"

January 9, 2026 undefined min read

Distributed LLM Training 02 - The Real Cost of Synchronous SGD and Data Parallelism

Data parallelism looks simple, but it carries both gradient synchronization cost and full model-state replication cost

Lectures
Read more

© 2025 Jae ยท Notes on systems, software, and building things carefully.

RSS