ai/nomic-embed-text-v1.5

Verified Publisher

By Docker

Updated about 1 month ago

Nomic Embed Text v1 is an open‑source, fully auditable text embedding model

Model
4

10K+

ai/nomic-embed-text-v1.5 repository overview

Nomic Embed Text

logo

Nomic Embed Text v1 is an open‑source, fully auditable text embedding model with an 8192‑token context window. It outperforms OpenAI Ada‑002 and text‑embedding‑3‑small on various embedding benchmarks while providing open weights, training code, and data under an Apache‑2 license.

Intended uses

Nomic Embed Text v1 is designed for applications requiring high‑quality embeddings over very long contexts:

  • Semantic search and retrieval: Excellent for retrieval‑augmented generation (RAG), clustering, and information retrieval tasks using long documents.
  • Clustering and classification: Embeddings can be used downstream for clustering, classification, and data visualization.
  • Auditable, open embedding pipelines: Provides full transparency with open data, code, and model weights—ideal for enterprise and research use where auditability matters.

Characteristics

AttributeDetails
ProviderNomic AI
ArchitectureTransformer-based encoder, initialized from a BERT-style model (Nomic-BERT‑2048) with rotary embeddings, SwiGLU activations, and long‑context adaptations
Cutoff date-
LanguagesEnglish
Tool calling
Input modalitiesText (tokens up to 8192 sequence length)
Output modalitiesEmbedding vectors
LicenseApache 2.0

Available model variants

Model variantParametersQuantizationContext windowVRAM¹Size
ai/nomic-embed-text-v1.5:latest

ai/nomic-embed-text-v1.5:137M-F16
137MMOSTLY_F162K tokens0.51 GiB260.87 MB
ai/nomic-embed-text-v1.5:137M-F16137MMOSTLY_F162K tokens0.51 GiB260.87 MB

¹: VRAM estimated based on model characteristics.

latest137M-F16

Use this AI model with Docker Model Runner

First, pull the model:

docker model pull ai/nomic-embed-text-v1.5

Then run the model:

url --location 'http://localhost:12434/engines/llama.cpp/v1/embeddings' \
--header 'Content-Type: application/json' \
--data '{
    "model": "ai/nomic-embed-text-v1.5",
    "input": "hello world!"
  }'

Considerations

  • While performance is strong on MTEB and LoCo benchmarks, on the Jina Long Context Benchmark it does not outperform closed-source models like Ada‑002 or text‑embedding‑3‑small.
  • Best suited for applications needing open-source, very long‑context embeddings with full reproducibility.

Tag summary

Content type

Model

Digest

sha256:653017dd0

Size

261.6 MB

Last updated

about 1 month ago

docker model pull ai/nomic-embed-text-v1.5

This week's pulls

Pulls:

1,149

Last week