Back to all embeddings

Qwen3 Embedding 8B

Supports 119 languages with robust multilingual and cross-lingual retrieval capabilities. Features 8 billion parameters optimized for text retrieval, clustering, and classification tasks across diverse languages.

Leaderboard Rank
#3
of 13
ELO Rating
1516
#3
Win Rate
47.9%
#4
Accuracy (nDCG@10)
0.818
#2
Latency
130758ms
#12

Model Information

Provider
Qwen
License
Open Source
Price per 1M tokens
$0.050
Release Date
2025-06-06
Model Name
qwen3-embedding-8b
Total Evaluations
1917

Performance Record

Wins919 (47.9%)
Losses730 (38.1%)
Ties268 (14.0%)
Wins
Losses
Ties

Performance Overview

ELO ratings by dataset

Qwen3 Embedding 8B's ELO performance varies across different benchmark datasets, showing its strengths in specific domains.

Qwen3 Embedding 8B - ELO by Dataset

Detailed Metrics

Dataset breakdown

Performance metrics across different benchmark datasets, including accuracy and latency percentiles.

ARCD

ELO 155050.0% WR120W-45L-75T

Accuracy Metrics

nDCG@5
0.913
nDCG@10
0.919
Recall@5
0.920
Recall@10
0.940

Latency Distribution

Mean
12156ms
P50 (Median)
11913ms
P90
13979ms

SciFact

ELO 152858.3% WR140W-95L-5T

Accuracy Metrics

nDCG@5
0.750
nDCG@10
0.765
Recall@5
0.843
Recall@10
0.883

Latency Distribution

Mean
200031ms
P50 (Median)
196030ms
P90
230036ms

DBPedia

ELO 152554.9% WR130W-89L-18T

Accuracy Metrics

nDCG@5
0.633
nDCG@10
0.625
Recall@5
0.235
Recall@10
0.381

Latency Distribution

Mean
200373ms
P50 (Median)
196366ms
P90
230429ms

FiQa

ELO 152154.2% WR130W-103L-7T

Accuracy Metrics

nDCG@5
0.760
nDCG@10
0.781
Recall@5
0.732
Recall@10
0.814

Latency Distribution

Mean
189471ms
P50 (Median)
185682ms
P90
217892ms

MSMARCO

ELO 150647.1% WR113W-98L-29T

Accuracy Metrics

nDCG@5
1.000
nDCG@10
1.000
Recall@5
0.123
Recall@10
0.224

Latency Distribution

Mean
180144ms
P50 (Median)
176541ms
P90
207166ms

NorQuAD

ELO 150326.7% WR64W-58L-118T

Latency Distribution

Mean
18544ms
P50 (Median)
18173ms
P90
21326ms

business reports

ELO 150247.5% WR114W-115L-11T

Latency Distribution

Mean
93874ms
P50 (Median)
91997ms
P90
107955ms

PG

ELO 149145.0% WR108W-127L-5T

Latency Distribution

Mean
151470ms
P50 (Median)
148441ms
P90
174191ms

Compare Models

See how it stacks up

Compare Qwen3 Embedding 8B with other top embeddings to understand the differences in performance, accuracy, and latency.