What is Reranking? - Definition & Meaning
Learn what reranking is, how retrieved documents are reordered for better RAG results, and which models and tools to use.
Definition
Reranking is the reordering of retrieved documents with a more accurate model (often a cross-encoder) to place the most relevant results at the top. It significantly improves RAG quality compared to vector search alone.
Technical explanation
First, many candidates (e.g., 100) are fetched via fast vector or keyword search. Then a cross-encoder scores query-document pairs and reorders by relevance. Cross-encoders are more accurate than bi-encoders (embedding similarity) but slower. Cohere Rerank, Jina Reranker, and open source (ms-marco, BGE-reranker) are common. Trade-off: more candidates = better recall but higher latency.
How AVARC Solutions applies this
AVARC Solutions adds reranking to retrieval pipelines where quality is critical. We use Cohere Rerank or open source cross-encoders. For high-traffic systems we limit reranking to the top 20–30 candidates.
Practical examples
- A RAG fetching 50 chunks via vector search, then selecting the top 5 with Cohere Rerank for the LLM.
- An enterprise search combining hybrid retrieval with a reranker for the most accurate results.
- A support chatbot where reranking ensures the right FAQ chunks appear at the top.
Related terms
Frequently asked questions
Related articles
What are Chunking Strategies? - Definition & Meaning
Learn what chunking strategies are, how to optimally split documents for RAG, and which methods fit your use case best.
What is Contextual Compression? - Definition & Meaning
Learn what contextual compression is, how retrieved documents are compressed based on the query, and why it makes RAG more efficient and effective.
What is RAG (Retrieval Augmented Generation)? - Definition & Meaning
Learn what RAG is, how it combines LLMs with external knowledge sources for accurate and up-to-date answers, and why it is essential for enterprise AI.
LangChain vs LlamaIndex: Which AI Framework for RAG Should You Choose?
Compare LangChain and LlamaIndex on RAG, document processing, and developer experience. Discover which framework fits your LLM application.