Architecture · intermediate
Embedding (vector embedding)
An embedding is a list of numbers (a vector) that represents a piece of input — a word, a sentence, an image — in a space where similar things end up close together.
Explanation
Inside an LLM, every token is converted to a vector via an embedding lookup table at the very first layer. Each layer then transforms these vectors. The final vectors carry the model's "understanding" of the input.
Separately, embedding models are encoders trained specifically to produce useful sentence- or document-level vectors. These power semantic search and retrieval-augmented generation: encode your documents once, encode a query, return the documents whose vectors are closest.
Embedding dimensionality is typically 256-4096; closeness is usually measured by cosine similarity.
Examples
- OpenAI's text-embedding-3-large produces 3,072-dim vectors.
- "king" - "man" + "woman" approximately equals "queen" in classic word2vec embeddings.
- Semantic search: nearest neighbors in embedding space.
Frequently asked
What is Embedding?
An embedding is a list of numbers (a vector) that represents a piece of input — a word, a sentence, an image — in a space where similar things end up close together.
What is an example of embedding?
OpenAI's text-embedding-3-large produces 3,072-dim vectors.
How is Embedding related to Vector Database?
Embedding and Vector Database are both architecture concepts. A vector database stores high-dimensional embeddings and answers "find the K nearest vectors to this query" extremely fast. The retrieval engine behind most RAG systems.
Is Embedding considered intermediate?
Embedding is generally considered intermediate-level material in the AI and LLM space.