Jae's Tech Blog
Home Archive About Game

Posts tagged "cuda"

January 26, 2026 undefined min read

PyTorch Internals 08 - CUDA Streams, Events, and Asynchronous Execution

Many PyTorch CUDA operations are asynchronous, so timing, synchronization, and dependency need to be reasoned about explicitly

Lectures
Read more
February 16, 2026 undefined min read

PyTorch Internals 15 - Reading Operator Bottlenecks with PyTorch Profiling

The purpose of internals knowledge is to make a performance trace interpretable enough that you can actually change it

Lectures
Read more
โ† Previous
1 2 3
Next โ†’

© 2025 Jae ยท Notes on systems, software, and building things carefully.

RSS