Retrieval-augmented generation reduces hallucinations only if retrieval quality is high: sensible chunk boundaries, metadata filters (space, product, locale), and periodic re-index when upstream docs change.
- Tag chunks with source URL and `last_updated` so answers can cite freshness.
- Separate internal-only from customer-safe corpora at index time, not in the prompt only.
- Measure answer grounding with offline eval sets before widening to customers.
Pair RAG with APIs like Gemini for multimodal questions when screenshots or diagrams matter.
