ai/mxbai-embed-large

Verified Publisher

By Docker

Updated about 1 year ago

mxbai-embed-large-v1 is a top English embed model by Mixedbread AI, great for RAG and more.

Model
3

10K+

ai/mxbai-embed-large repository overview

mxbai-embed-large-v1

logo

mxbai-embed-large-v1 is a state-of-the-art English language embedding model developed by Mixedbread AI. It converts text into dense vector representations, capturing the semantic essence of the input. Trained on a vast dataset exceeding 700 million pairs using contrastive training methods and fine-tuned on over 30 million high-quality triplets with the AnglE loss function, this model adapts to a wide range of topics and domains, making it suitable for various real-world applications and Retrieval-Augmented Generation (RAG) use cases.

Intended uses

mxbai-embed-large-v1 is designed for generating sentence embeddings suitable for various NLP applications.

  • SemanticsSearch and information retrieval: Specifically designed for RAG, this model enhances search systems by providing relevant document embeddings, improving the accuracy and relevance of search results.
  • Semantic textual similarity: Measures the similarity between sentences, aiding in tasks such as clustering, duplicate detection, and paraphrase identification.
  • Text classification: Serves as input features for classifiers in tasks like sentiment analysis, topic categorization, and intent detection.

Characteristics

AttributeDetails
ProviderMixedbread AI
ArchitectureBERT
Cutoff DateSeptember 2023
LanguagesEnglish
Tool Calling
Input ModalitiesText
Output ModalitiesText embeddings
LicenseApache 2.0

Available model variants

Model variantParametersQuantizationContext windowVRAM¹Size
ai/mxbai-embed-large:latest

ai/mxbai-embed-large:335M-F16
334.09 MF16512 tokens0.63 GiB638.85 MB
ai/mxbai-embed-large:335M-F16334.09 MF16512 tokens0.63 GiB638.85 MB

¹: VRAM estimated based on model characteristics.

latest335M-F16

Use this AI model with Docker Model Runner

First, pull the model:

docker model pull ai/mxbai-embed-large

Then run the model:

docker model run ai/mxbai-embed-large

For more information on Docker Model Runner, explore the documentation.

Considerations

  • Prompt usage: For retrieval tasks, prepend the query with the prompt. For example, "Represent this sentence for searching relevant passages:". This practice helps the model understand the context and improves performance. For other tasks, the text can be used as-is without any additional prompt.
  • Language limitation: The model is trained exclusively on English text and is specifically designed for the English language.
  • Sequence length: The suggested maximum sequence length is 512 tokens. Longer sequences may be truncated, leading to a loss of information.

Benchmark performance

Task Categorymxbai-embed-large-v1
Avg (56 datasets)64.68
Classification75.64
Clustering46.71
Pair Classification87.2
Reranking60.11
Retrieval54.39
STS85.00
Summarization32.71

Tag summary

Content type

Model

Digest

sha256:e5e025b14

Size

639.5 MB

Last updated

about 1 year ago

docker model pull ai/mxbai-embed-large

This week's pulls

Pulls:

101

Jun 1 to Jun 7