Topic · 32 concepts

Language Models

The foundational substrate of modern AI.

Large language models are transformer-based neural networks trained on vast text corpora to predict the next token. The concepts below cover the building blocks (transformer, attention, tokenization, context window), the failure modes (hallucination), and the production lens — when to call an LLM, when to specialize a small model instead, and why almost every serious AI stack ends up combining both. Foundational reading for everything else on this site.

Other topics
ZeroEntropy
The best AI teams build with ZeroEntropy models
Follow us on
GitHubTwitterSlackLinkedInDiscord