Training · intermediate

Instruction Tuning

Instruction tuning is fine-tuning on examples of (instruction, desired response) pairs so a base model learns to follow natural-language directions.

Published May 29, 2026

Explanation

A base LLM trained only on next-token prediction tends to complete text rather than follow instructions. If you prompt "Write a poem about the sea," a base model might continue "...and other writing tips for beginners." Instruction tuning explicitly teaches the model that "Write a poem about the sea" is a request to produce a poem.

The training data is usually a mix of human-written instruction examples (FLAN, OpenAssistant) and instructions distilled from a larger model. Most public chat models — Llama-3-Instruct, Mistral-Instruct — are instruction-tuned versions of corresponding base models.

Instruction tuning typically comes before RLHF in the post-training pipeline.

Examples

FLAN tuning Google's T5 to follow instructions.
Llama-3-8B-Instruct: the instruction-tuned variant of Llama-3-8B.

Frequently asked

What is Instruction Tuning?

Instruction tuning is fine-tuning on examples of (instruction, desired response) pairs so a base model learns to follow natural-language directions.

What is an example of instruction tuning?

FLAN tuning Google's T5 to follow instructions.

How is Instruction Tuning related to Fine-tuning?

Instruction Tuning and Fine-tuning are both training concepts. Fine-tuning continues training a pretrained model on a smaller, task-specific dataset, adjusting its weights to specialize behavior or knowledge.

Is Instruction Tuning considered intermediate?

Instruction Tuning is generally considered intermediate-level material in the AI and LLM space.

Fine-tuningTraining

Fine-tuning continues training a pretrained model on a smaller, task-specific dataset, adjusting its weights to specialize behavior or knowledge.

Supervised Fine-TuningTraining

SFT is fine-tuning where each training example has an explicit input and a desired output, supervised by a loss that penalizes deviation from that output.

Reinforcement Learning from Human FeedbackTraining

RLHF fine-tunes an LLM to maximize a reward model that was itself trained on human preference judgments between candidate responses.

PretrainingTraining

Pretraining is the initial training phase where an LLM learns to predict the next token on trillions of tokens of general text. It produces a base model that can be adapted later.

Side-by-side comparisons

Sources

FLAN paper (arXiv)