Embedding Gemma is a state-of-the-art text embedding model from Google DeepMind
10K+
Embedding Gemma is a state-of-the-art text embedding model from Google DeepMind, designed to create high-quality vector representations of text. Built on the Gemma architecture, this model converts text into dense vector embeddings that capture semantic meaning, making it ideal for retrieval-augmented generation (RAG), semantic search, and similarity tasks. With open weights and efficient design, Embedding Gemma provides a powerful foundation for embedding-based applications. The GGUF format version is provided by Unsloth.
Embedding Gemma is designed for applications requiring high-quality text embeddings:
| Attribute | Details |
|---|---|
| Provider | Google DeepMind |
| Architecture | Gemma Embedding |
| Cutoff date | - |
| Languages | English |
| Tool calling | ❌ |
| Input modalities | Text |
| Output modalities | Embedding vectors |
| License | Gemma Terms |
| Model variant | Parameters | Quantization | Context window | VRAM¹ | Size |
|---|---|---|---|---|---|
ai/embeddinggemma:latestai/embeddinggemma:300M-Q8_0 | 300M | Q8_0 | 2K tokens | 1.43 GiB | 307.13 MB |
ai/embeddinggemma:300M-Q8_0 | 300M | Q8_0 | 2K tokens | 1.43 GiB | 307.13 MB |
¹: VRAM estimated based on model characteristics.
latest→300M-Q8_0
First, pull the model:
docker model pull ai/embeddinggemma
Then run the model:
docker model run ai/embeddinggemma
To generate embeddings using the API:
curl --location 'http://localhost:12434/engines/llama.cpp/v1/embeddings' \
--header 'Content-Type: application/json' \
--data '{
"model": "ai/embeddinggemma",
"input": "Your text to embed here"
}'
For more information on Docker Model Runner, explore the documentation.
| Task Category | Embedding Gemma |
|---|---|
| Retrieval | 54.87 |
| STS | 78.53 |
| Classification | 73.26 |
| Clustering | 44.72 |
| Pair Classification | 85.94 |
| Reranking | 59.36 |
Content type
Model
Digest
sha256:b6635ddcd…
Size
313.4 MB
Last updated
9 months ago
docker model pull ai/embeddinggemmaPulls:
338
Jun 1 to Jun 7