Jae's Tech Blog

February 3, 2026 undefined min read

GPU Systems 03 - Memory Hierarchy and Bandwidth

How to think about the GPU memory hierarchy and bandwidth bottlenecks

Lectures

February 7, 2026 undefined min read

The optimization patterns that keep showing up in CUDA kernels

Lectures

February 17, 2026 undefined min read

Why tiled matrix multiplication and shared memory create such a big performance difference

Lectures

February 19, 2026 undefined min read

Why shared memory is not automatically fast and how bank conflicts appear

Lectures