Nikhil Paleti

Blog

Notes on machine learning systems, GPU performance, and things I had to learn the hard way.

Positional Encoding Explained

A rebuilt and expanded version of my transformer positional encoding essay, now with cleaner math, figures, and implementation notes.