Retrieval-augmented generation reduces hallucinations only if retrieval quality is high: sensible chunk boundaries, metadata filters (space, product, locale), and periodic re-index when upstream docs change.

  • Tag chunks with source URL and `last_updated` so answers can cite freshness.
  • Separate internal-only from customer-safe corpora at index time, not in the prompt only.
  • Measure answer grounding with offline eval sets before widening to customers.

Pair RAG with APIs like Gemini for multimodal questions when screenshots or diagrams matter.