Jina Embeddings v3

Built on XLM-RoBERTa with 570M parameters supporting 8,192 token context length across 89 languages. Features five task-specific LoRA adapters for retrieval, clustering, and classification with Matryoshka learning for flexible sizing. If you want to compare the best embedding models for your data, try Agentset.

Leaderboard Rank

#17

of 18

ELO Rating

1414

#17

Win Rate

34.6%

#16

Accuracy (nDCG@10)

0.674

Latency

223ms

#12

Model Information

Provider: Jina AI
License: Open Source
Price per 1M tokens: $0.045
Dimensions: 1024
Release Date: 2024-09-18
Model Name: jina-embeddings-v3
Total Evaluations: 830

Performance Record

Wins287 (34.6%)

Losses487 (58.7%)

Ties56 (6.7%)

Wins

Losses

Ties

Embedding Models Are Just One Piece of RAG

Agentset gives you a managed RAG pipeline with the top-ranked models and best practices baked in. No infrastructure to maintain, no embeddings to manage.

Schedule Demo Login

Trusted by teams building production RAG applications

5M+

Documents

1,500+

Teams

99.9%

Uptime

Performance Overview

ELO ratings by dataset

Jina Embeddings v3's ELO performance varies across different benchmark datasets, showing its strengths in specific domains.

Jina Embeddings v3 - ELO by Dataset

Detailed Metrics

Dataset breakdown

Performance metrics across different benchmark datasets, including accuracy and latency percentiles.

business reports

ELO 150022.5% WR36W-123L-1T

Accuracy Metrics

nDCG@5: 0.000
nDCG@10: 0.000
Recall@5: 0.000
Recall@10: 0.000

Latency Distribution

Mean: 26ms
P50 (Median): 26ms
P90: 26ms

DBPedia

ELO 150028.7% WR46W-101L-13T

Accuracy Metrics

nDCG@5: 0.835
nDCG@10: 0.789
Recall@5: 0.062
Recall@10: 0.121

Latency Distribution

Mean: 107ms
P50 (Median): 107ms
P90: 107ms

FiQa

ELO 150036.7% WR55W-93L-2T

Accuracy Metrics

nDCG@5: 0.764
nDCG@10: 0.775
Recall@5: 0.635
Recall@10: 0.745

Latency Distribution

Mean: 273ms
P50 (Median): 273ms
P90: 273ms

SciFact

ELO 150043.1% WR69W-87L-4T

Accuracy Metrics

nDCG@5: 0.600
nDCG@10: 0.636
Recall@5: 0.709
Recall@10: 0.816

Latency Distribution

Mean: 75ms
P50 (Median): 75ms
P90: 75ms

MSMARCO

ELO 150038.8% WR62W-70L-28T

Accuracy Metrics

nDCG@5: 0.958
nDCG@10: 0.944
Recall@5: 0.124
Recall@10: 0.219

Latency Distribution

Mean: 346ms
P50 (Median): 346ms
P90: 346ms

ARCD

ELO 150047.5% WR19W-13L-8T

Accuracy Metrics

nDCG@5: 0.797
nDCG@10: 0.809
Recall@5: 0.920
Recall@10: 0.960

Latency Distribution

Mean: 513ms
P50 (Median): 513ms
P90: 513ms

Build RAG in Minutes, Not Months

Agentset gives you a complete RAG API with top-ranked embedding models and smart retrieval built in. Upload your data, call the API, and get accurate results from day one.

Schedule Demo Read the docs

import { Agentset } from "agentset";

const agentset = new Agentset();
const ns = agentset.namespace("ns_1234");

const results = await ns.search(
  "What is multi-head attention?"
);

for (const result of results) {
  console.log(result.text);
}