Topic · 21 concepts

Training Methodology

How modern retrieval models get their relevance signal.

The supervision signal a retrieval model is trained on determines what it can learn. The concepts below cover how modern rerankers and embeddings get their relevance targets — pairwise preferences from frontier-LLM ensembles, Thurstone fits that recover continuous Elo-style scores, and the distillation pipelines that compress a giant teacher into a fast specialized student. This is the methodology family behind zerank-1, zerank-2, and zembed-1, and the same shape generalizes to any narrow task where pairwise judgments are cheap and absolute scores are noisy.

Other topics
ZeroEntropy
The best AI teams build with ZeroEntropy models
Follow us on
GitHubTwitterSlackLinkedInDiscord