// RAG
arXiv Self-Correcting RAG
Multi-agent research synthesis with LangGraph
RAGMulti-AgentLangGraphPythonLLMs
Overview
Standard RAG breaks down on academic search — papers are long, citations matter, and the question often requires synthesis across multiple papers. This system adds two missing pieces: adaptive retrieval and self-correction.
How it works
- Query routing — the orchestrator decides whether to retrieve, search the web, ask for clarification, or answer from prior context
- Adaptive retrieval — initial vector search; if relevance is low, the agent reformulates the query and tries again
- Document scoring — each retrieved chunk is scored for relevance to the question; low-scoring chunks are discarded before generation
- Self-correction loop — after generating an answer, an evaluator agent checks for hallucinations and missing citations; if the check fails, it goes back to retrieval with feedback
- Citation enforcement — every claim in the final answer must point to a specific arXiv ID and section
Built on LangGraph
The state machine is the point. LangGraph lets you express "if eval fails, go back to retrieve" cleanly without manual loop management — and the graph is inspectable for debugging.
Result
Substantially fewer hallucinations on academic queries compared to a single-pass RAG baseline, at the cost of higher latency. Worth it for a research assistant.