Topic · 9 concepts

Rerankers

The second stage that puts the right answer at the top.

First-pass retrieval is fast and approximate; rerankers are slow and precise. A reranker — typically a cross-encoder — takes a small candidate set from first-pass and reorders it by actual relevance, paying close attention to each (query, document) pair. The concepts below cover the architectural variants (cross-encoder vs bi-encoder, pointwise vs pairwise vs listwise), the production properties that matter (calibration, instruction-following, confidence), and the tradeoffs that make rerankers indispensable in any RAG pipeline aiming for high-quality answers.

Other topics
ZeroEntropy
The best AI teams build with ZeroEntropy models
Follow us on
GitHubTwitterSlackLinkedInDiscord