Comparison

Alignment vs Constitutional AI

Alignment and Constitutional AI are both common AI/LLM terms but cover different ideas. Here is a quick side-by-side.

When you would reach for Alignment

Alignment comes up when the question is fundamentally about safety & alignment.

Tuning a model to refuse to help with bioweapon synthesis.

When you would reach for Constitutional AI

Constitutional AI comes up when the question is fundamentally about safety & alignment.

A constitutional principle: "Choose the response that is least harmful and most helpful."

Frequently asked

What is the difference between Alignment and Constitutional AI?

Alignment: Alignment is the problem of making an AI system pursue what humans actually want rather than the literal letter of its training objective. RLHF and Constitutional AI are alignment techniques. Constitutional AI: Constitutional AI is Anthropic's alignment technique that uses a written set of principles ("constitution") plus AI feedback to shape model behavior instead of relying entirely on human labels.

When should I use Alignment vs Constitutional AI?

Alignment is the right concept when you are focused on safety & alignment. Constitutional AI applies when you are focused on safety & alignment.

Are Alignment and Constitutional AI the same thing?

No. Alignment is safety & alignment; Constitutional AI is safety & alignment. They are related but address different parts of the AI stack.