RAG (Retrieval-Augmented Generation)
Retrieval-augmented generation (RAG) is an architecture pattern that connects a large language model to external knowledge sources—documents, databases, APIs—so its responses draw on real, current information rather than relying solely on what it memorized during training. The model retrieves relevant context at query time, then generates an answer grounded in that evidence. RAG is one way enterprises make general-purpose AI useful for their specific business, though the approach is not without its limitations and has been surpassed by simpler read/write/grep tools in many instances.
Related terms:
AI for Marketing
AI for marketing leverages language models, predictive analytics, and automation to accelerate traditional workflows like content creation, audience...
Temperature
Temperature is a parameter controlling a language model’s randomness: at 0 it always picks the most probable next token for deterministic, reliable output,...
Context Window
A context window is the maximum amount of text a language model can process in a single call—input and output combined—measured in tokens.