Topic · 16 concepts

Embeddings

The dense-vector layer of modern retrieval.

Embedding models compress a piece of text into a fixed-size vector whose position encodes meaning. Two queries about the same thing land near each other in the space; two unrelated queries land far apart. That spatial property is the foundation of dense retrieval, semantic search, and most modern RAG. The concepts below cover how embeddings are produced, how to compare them (cosine similarity, dot product, magnitudes), and the inference-time levers — dimension truncation, quantization, cross-lingual support — that determine whether your billion-document index costs cents or thousands of dollars per month.

Other topics
ZeroEntropy
The best AI teams build with ZeroEntropy models
Follow us on
GitHubTwitterSlackLinkedInDiscord