Architecture · intermediate
Reasoning Model (thinking model)
A reasoning model spends extra compute thinking step-by-step before answering. OpenAI o1/o3, DeepSeek R1, and Anthropic's extended thinking are reasoning models.
Explanation
Standard LLMs generate output as fast as you can read it. Reasoning models pause first — internally producing a long scratchpad of chain-of-thought that the user typically does not see — then commit to a final answer informed by that reasoning.
The training recipe combines large-scale chain-of-thought data with reinforcement learning that rewards correctness on verifiable tasks (math, code, logic). The result: large gains on math/coding/science benchmarks at the cost of higher latency and per-call price.
Reasoning models opened a new scaling axis — test-time compute — that runs alongside parameter and data scaling.
Examples
- OpenAI o1 solving a competition math problem with hidden CoT.
- DeepSeek R1 open-weights reasoning model.
- Claude extended thinking mode.
When to use reasoning model
When the task is hard, verifiable, and quality dominates latency cost — math, code, scientific analysis, multi-step planning.
Frequently asked
What is Reasoning Model?
A reasoning model spends extra compute thinking step-by-step before answering. OpenAI o1/o3, DeepSeek R1, and Anthropic's extended thinking are reasoning models.
What is an example of reasoning model?
OpenAI o1 solving a competition math problem with hidden CoT.
How is Reasoning Model related to Chain-of-Thought?
Reasoning Model and Chain-of-Thought are both architecture concepts. Chain-of-thought prompting asks the model to show its reasoning step by step before giving a final answer. It dramatically improves performance on multi-step problems.
When should I use reasoning model?
When the task is hard, verifiable, and quality dominates latency cost — math, code, scientific analysis, multi-step planning.
Is Reasoning Model considered intermediate?
Reasoning Model is generally considered intermediate-level material in the AI and LLM space.