LLM Inference Illustrated

A book about autoregressive LLM inference and the techniques that make it fast, efficient, and scalable
Author

Ted Kyi

Published

Draft dated 2026-03-28

Cover

LLM Inference Illustrated

Copyright © 2026 Ted Kyi

This work is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License (CC BY-NC-SA 4.0).

You are free to share and adapt this material for non-commercial purposes, provided you give appropriate credit and distribute your contributions under the same license.

Typeset with Quarto.