RAG (Retrieval-Augmented Generation)
Retrieval-augmented generation (RAG) is an architecture pattern that connects a large language model to external knowledge sources—documents, databases, APIs—so its responses draw on real, current information rather than relying solely on what it memorized during training. The model retrieves relevant context at query time, then generates an answer grounded in that evidence. RAG is one way enterprises make general-purpose AI useful for their specific business, though the approach is not without its limitations and has been surpassed by simpler read/write/grep tools in many instances.
Related terms:
Hallucination
Hallucination occurs when a language model generates text that sounds confident and plausible but is factually incorrect, such as invented citations or...
AI Evaluation
AI evaluation is the practice of systematically measuring an AI system’s performance against defined criteria—accuracy, latency, cost, safety, and user...
AI Agent
An AI agent is a system that autonomously breaks a goal into steps—calling tools, reading results, and adjusting course—without waiting for a human prompt.