ai/mxbai-embed-large

Verified Publisher

By Docker

•Updated about 1 year ago

mxbai-embed-large-v1 is a top English embed model by Mixedbread AI, great for RAG and more.

Model

10K+

Overview Tags

ai/mxbai-embed-large repository overview

⁠mxbai-embed-large-v1

logo

mxbai-embed-large-v1 is a state-of-the-art English language embedding model developed by Mixedbread AI. It converts text into dense vector representations, capturing the semantic essence of the input. Trained on a vast dataset exceeding 700 million pairs using contrastive training methods and fine-tuned on over 30 million high-quality triplets with the AnglE loss function, this model adapts to a wide range of topics and domains, making it suitable for various real-world applications and Retrieval-Augmented Generation (RAG) use cases.

⁠Intended uses

mxbai-embed-large-v1 is designed for generating sentence embeddings suitable for various NLP applications.

SemanticsSearch and information retrieval: Specifically designed for RAG, this model enhances search systems by providing relevant document embeddings, improving the accuracy and relevance of search results.
Semantic textual similarity: Measures the similarity between sentences, aiding in tasks such as clustering, duplicate detection, and paraphrase identification.
Text classification: Serves as input features for classifiers in tasks like sentiment analysis, topic categorization, and intent detection.

⁠Characteristics

Attribute	Details
Provider	Mixedbread AI
Architecture	BERT
Cutoff Date	September 2023
Languages	English
Tool Calling	❌
Input Modalities	Text
Output Modalities	Text embeddings
License	Apache 2.0

⁠Available model variants

Model variant	Parameters	Quantization	Context window	VRAM¹	Size
`ai/mxbai-embed-large:latest` `ai/mxbai-embed-large:335M-F16`	334.09 M	F16	512 tokens	0.63 GiB	638.85 MB
`ai/mxbai-embed-large:335M-F16`	334.09 M	F16	512 tokens	0.63 GiB	638.85 MB

¹: VRAM estimated based on model characteristics.

latest → 335M-F16

⁠Use this AI model with Docker Model Runner

First, pull the model:

docker model pull ai/mxbai-embed-large

Then run the model:

docker model run ai/mxbai-embed-large

For more information on Docker Model Runner, explore the documentation⁠.

⁠Considerations

Prompt usage: For retrieval tasks, prepend the query with the prompt. For example, "Represent this sentence for searching relevant passages:". This practice helps the model understand the context and improves performance. For other tasks, the text can be used as-is without any additional prompt.
Language limitation: The model is trained exclusively on English text and is specifically designed for the English language.
Sequence length: The suggested maximum sequence length is 512 tokens. Longer sequences may be truncated, leading to a loss of information.

⁠Benchmark performance

Task Category	mxbai-embed-large-v1
Avg (56 datasets)	64.68
Classification	75.64
Clustering	46.71
Pair Classification	87.2
Reranking	60.11
Retrieval	54.39
STS	85.00
Summarization	32.71

⁠Links

Tag summary

Recent tags

Content type

Model

Digest

sha256:e5e025b14…

Size

639.5 MB

Last updated

about 1 year ago

docker model pull ai/mxbai-embed-large

This week's pulls

Pulls:

101

Jun 1 to Jun 7

Learn more⁠