Skip to content

Posts

every agent framework, ranked

May 2, 2026

Every agent framework is a disagreement about how much to trust the model.

inference engineering

May 1, 2026

Your GPU is mostly idle during text generation. The entire inference stack exists to fix that.

the annotated microgpt

Apr 2, 2026

Karpathy's nanoGPT, annotated line by line.

what happens when you press 'submit' on chatgpt

Mar 26, 2026

From raw internet text to a streaming response: pretraining, alignment, inference.

attention residuals

Mar 18, 2026

The residual connection solved deep learning in 2015. Ten years later, Moonshot AI noticed it's been sabotaging deep networks the whole time. The fix is 30 lines of PyTorch.

building pytorch from scratch

Mar 11, 2026

Building PyTorch from scratch in ~1300 lines - reverse-mode autograd, conv2d, optimizers. Trains MNIST. No C++, no CUDA, just closures and NumPy.

my favourite interview questions

Feb 13, 2026

Five questions I actually ask ML candidates.

building rag for my website

Jan 27, 2026

My blogs are too long so I built a RAG over them. Chunking, FAISS, reranking, streaming via Groq - and an eval pipeline because vibes aren't metrics.

the math behind deep learning

Jan 21, 2026

The math behind deep learning, with code for every concept.

numpy + pytorch from scratch

Jan 7, 2026

Ground-up NumPy and PyTorch - we build MNIST classifiers two ways to see how the pieces fit together.

how to read a paper (deepseekv3)

Jan 6, 2026

Taking DeepSeekV3 apart: MLA, MoE routing, multi-token prediction. And how to read a 53-page paper.

scaling laws

Jan 6, 2026

Maybe intelligence is just a lot of compute applied to a lot of data having a lot of parameters. OpenAI wrote a paper on this.

the yolo story (v1 → v10)

Nov 21, 2025

The full YOLO story from v1 to v10 - grid system, loss function, NMS, mAP, and a PyTorch implementation.

i gave claude a spotify remote

Oct 25, 2025

I built an MCP server so Claude can control my Spotify.

hello world of neural networks

Oct 25, 2025

Neural nets from scratch. NumPy only. Build a digit classifier.

some python tricks

Oct 25, 2025

Random Python tricks I've picked up.

i forked cpython and added a 'nuke' statement

Apr 27, 2025

Forking CPython, modifying the grammar, adding a 'nuke' statement, and other things I found digging through Python's C source.

things i like

Jan 1, 2025