RAG (Retrieval Augmented Generation)

« Back to Glossary Index

Retrieval-Augmented Generation (RAG) is an approach in natural language processing (NLP) that combines retrieval mechanisms with text generation models to produce more accurate, relevant, and up-to-date responses.

How it works: RAG involves retrieving information from an external knowledge base (like documents or databases) and feeding that context into a generative model (e.g., GPT) to create responses that are grounded in the retrieved data.
Key Applications:
- Customer support: Crafting answers to queries by referencing company knowledge bases.
- Education: Generating answers to questions using trusted academic sources.
- Search engines: Providing concise, sourced, and context-aware responses.

RAG ensures that the generative outputs remain factually accurate and grounded in external, up-to-date information, bridging the gap between static generative models and real-world applications.

« Back to Glossary Index