Nomic Embed Text v1 is an open‑source, fully auditable text embedding model
10K+
Nomic Embed Text v1 is an open‑source, fully auditable text embedding model with an 8192‑token context window. It outperforms OpenAI Ada‑002 and text‑embedding‑3‑small on various embedding benchmarks while providing open weights, training code, and data under an Apache‑2 license.
Nomic Embed Text v1 is designed for applications requiring high‑quality embeddings over very long contexts:
| Attribute | Details |
|---|---|
| Provider | Nomic AI |
| Architecture | Transformer-based encoder, initialized from a BERT-style model (Nomic-BERT‑2048) with rotary embeddings, SwiGLU activations, and long‑context adaptations |
| Cutoff date | - |
| Languages | English |
| Tool calling | ❌ |
| Input modalities | Text (tokens up to 8192 sequence length) |
| Output modalities | Embedding vectors |
| License | Apache 2.0 |
| Model variant | Parameters | Quantization | Context window | VRAM¹ | Size |
|---|---|---|---|---|---|
ai/nomic-embed-text-v1.5:latestai/nomic-embed-text-v1.5:137M-F16 | 137M | MOSTLY_F16 | 2K tokens | 0.51 GiB | 260.87 MB |
ai/nomic-embed-text-v1.5:137M-F16 | 137M | MOSTLY_F16 | 2K tokens | 0.51 GiB | 260.87 MB |
¹: VRAM estimated based on model characteristics.
latest→137M-F16
First, pull the model:
docker model pull ai/nomic-embed-text-v1.5
Then run the model:
url --location 'http://localhost:12434/engines/llama.cpp/v1/embeddings' \
--header 'Content-Type: application/json' \
--data '{
"model": "ai/nomic-embed-text-v1.5",
"input": "hello world!"
}'
Content type
Model
Digest
sha256:653017dd0…
Size
261.6 MB
Last updated
about 1 month ago
docker model pull ai/nomic-embed-text-v1.5Pulls:
1,149
Last week