LLM (Large Language Model)
A large language model is a neural network trained on massive text corpora—often trillions of tokens—to predict the next word in a sequence. That simple objective, scaled up, produces systems that can write code, summarize legal briefs, translate languages, and hold surprisingly coherent conversations. GPT-4, Claude, Gemini, and Llama are all LLMs. The "large" refers to parameter count, typically in the billions, which correlates loosely with capability but tightly with compute cost. LLMs are general-purpose by default and useless for specific tasks until you shape their behavior through prompting, fine-tuning, or connecting them to your data. The model is the engine. Everything else—retrieval, guardrails, UI, evaluation—is what makes it drive straight.
Related terms:
WWGPTD
WWGPTD began as internal Slack shorthand to remind teams that using AI isn’t cheating but the essential first step. It reframes strategy by asking how AI.
Transformer
The transformer is the neural network architecture introduced in Vaswani et al.’s “Attention Is All You Need” that replaces recurrence with parallel...
Inference
Inference is the process of running a trained model on new input to generate a prediction or output—such as sending a prompt to GPT-4 and receiving a...