From memory
to evidence.
The single most effective way to make AI reliable: stop asking what it remembers and start showing it the documents. Six stages.
The problem with memory.
A language model’s knowledge was frozen on the day its training ended. And even within that, recall is reconstruction — patterns, not records. Ask it about your specific contract, your specific witness, your specific deadline, and it has nothing to recall — so it may produce something plausible instead.
Show, don’t ask.
Grounding means putting the actual documents into the conversation and instructing the model to answer only from them. The question changes from “what do you remember about limitation periods?” to “what does this contract say about limitation?” — a task language models are genuinely excellent at.
The model answers from pattern — with no document to check it against.
[the clause text, pasted in full]
</covenant>
<task>
Set out what this clause restricts, quoting the words you rely on.
</task>
The same question, twice.
RAG, in one diagram.
Retrieval-Augmented Generation is grounding at scale. When the documents are too many to paste — a disclosure exercise, a precedent bank, an entire bundle — a search step finds the relevant passages and inserts them into the context automatically.
What it fixes. What it doesn’t.
- Recall. Answers come from the page, not the model’s memory.
- Currency. Your documents can be newer than the model’s training.
- Checkability. Quotes give you something to verify against the source.
- Retrieval misses. The search can miss the key passage — the model answers from what was found, not what exists.
- Reasoning errors. Correct sources can still be misapplied on top.
- Overstatement. A model can still overstate what a passage supports.
Source-locked, by instruction.
The practical pattern: tell the model it may use only the supplied documents, must quote what it relies on, and must say “not in the materials” rather than fill gaps.
Evidence in, evidence out.
Grounding is the bridge between a model that sounds right and one you can check. Next in the series: how to brief the model precisely once it has the documents.
Next: Prompting Opus 4.8 for Legal Work →