Back to all embeddings

Qwen3 Embedding 4B

Mid-size variant supporting 119 languages optimized for semantic retrieval, classification, and RAG applications. Open-sourced under Apache 2.0 license on Hugging Face, GitHub, and ModelScope.

Leaderboard Rank
#8
of 13
ELO Rating
1496
#8
Win Rate
41.1%
#8
Accuracy (nDCG@10)
0.802
#6
Latency
80021ms
#10

Model Information

Provider
Qwen
License
Open Source
Price per 1M tokens
$0.020
Release Date
2025-06-06
Model Name
qwen3-embedding-4b
Total Evaluations
1918

Performance Record

Wins789 (41.1%)
Losses854 (44.5%)
Ties275 (14.3%)
Wins
Losses
Ties

Performance Overview

ELO ratings by dataset

Qwen3 Embedding 4B's ELO performance varies across different benchmark datasets, showing its strengths in specific domains.

Qwen3 Embedding 4B - ELO by Dataset

Detailed Metrics

Dataset breakdown

Performance metrics across different benchmark datasets, including accuracy and latency percentiles.

ARCD

ELO 152142.5% WR102W-72L-66T

Accuracy Metrics

nDCG@5
0.915
nDCG@10
0.922
Recall@5
0.940
Recall@10
0.960

Latency Distribution

Mean
9581ms
P50 (Median)
9389ms
P90
11018ms

MSMARCO

ELO 151145.0% WR108W-90L-42T

Accuracy Metrics

nDCG@5
1.000
nDCG@10
1.000
Recall@5
0.123
Recall@10
0.224

Latency Distribution

Mean
112945ms
P50 (Median)
110686ms
P90
129887ms

FiQa

ELO 149747.5% WR114W-123L-3T

Accuracy Metrics

nDCG@5
0.724
nDCG@10
0.759
Recall@5
0.715
Recall@10
0.835

Latency Distribution

Mean
120137ms
P50 (Median)
117734ms
P90
138158ms

SciFact

ELO 149546.3% WR111W-124L-5T

Accuracy Metrics

nDCG@5
0.695
nDCG@10
0.733
Recall@5
0.787
Recall@10
0.893

Latency Distribution

Mean
143712ms
P50 (Median)
140838ms
P90
165269ms

PG

ELO 149447.1% WR113W-120L-7T

Latency Distribution

Mean
107349ms
P50 (Median)
105202ms
P90
123451ms

NorQuAD

ELO 149319.2% WR46W-57L-137T

Latency Distribution

Mean
13597ms
P50 (Median)
13325ms
P90
15637ms

DBPedia

ELO 147940.3% WR96W-129L-13T

Accuracy Metrics

nDCG@5
0.603
nDCG@10
0.595
Recall@5
0.234
Recall@10
0.375

Latency Distribution

Mean
106747ms
P50 (Median)
104612ms
P90
122759ms

business reports

ELO 147541.3% WR99W-139L-2T

Latency Distribution

Mean
26100ms
P50 (Median)
25578ms
P90
30015ms

Compare Models

See how it stacks up

Compare Qwen3 Embedding 4B with other top embeddings to understand the differences in performance, accuracy, and latency.