Embed models can be used to generate embeddings from text or classify it based on various parameters. Embeddings can be used for estimating semantic similarity between two texts, choosing a sentence which is most likely to follow another sentence, or categorizing user feedback. When used with the Classify endpoint, embeddings can be used for any classification or analysis task.

English Models

Latest ModelDescriptionDimensionsMax Tokens (Context Length)Similarity MetricEndpoints
embed-english-v3.0A model that allows for text to be classified or turned into embeddings. English only.1024512Cosine SimilarityEmbed,
Embed Jobs
embed-english-light-v3.0A smaller, faster version of embed-english-v3.0. Almost as capable, but a lot faster. English only.384512Cosine SimilarityEmbed,
Embed Jobs
embed-english-v2.0Our older embeddings model that allows for text to be classified or turned into embeddings. English only4096512Cosine SimilarityClassify, Embed
embed-english-light-v2.0A smaller, faster version of embed-english-v2.0. Almost as capable, but a lot faster. English only.1024512Cosine SimilarityClassify, Embed

Multi-Lingual Models

Latest ModelDescriptionDimensionsMax Tokens (Context Length)Similarity MetricEndpoints
embed-multilingual-v3.0Provides multilingual classification and embedding support. See supported languages here.1024512Cosine SimilarityEmbed, Embed Jobs
embed-multilingual-light-v3.0A smaller, faster version of embed-multilingual-v3.0. Almost as capable, but a lot faster. Supports multiple languages.384512Cosine SimilarityEmbed,
Embed Jobs
embed-multilingual-v2.0Provides multilingual classification and embedding support. See supported languages here.768256Dot Product SimilarityClassify, Embed

Frequently Asked Questions

What is the Context Length for Cohere Embeddings Models?

You can find the context length for various Cohere embeddings models in the tables above. It's in the "Max Tokens (Context Length)" column.