Back to all embeddings

Qwen3 Embedding 4B

Mid-size 4 billion parameter model with strong multilingual capabilities across 100+ languages. Supports user-defined instructions for task-specific optimization in text retrieval, classification, and clustering applications.

Leaderboard Rank
#8
of 13
ELO Rating
1496
#8
Win Rate
41.1%
#8
Accuracy (nDCG@10)
0.802
#6
Latency
27ms
#9

Model Information

Provider
Qwen
License
Open Source
Price per 1M tokens
$0.020
Dimensions
2560
Release Date
2025-06-06
Model Name
qwen3-embedding-4b
Total Evaluations
1918

Performance Record

Wins789 (41.1%)
Losses854 (44.5%)
Ties275 (14.3%)
Wins
Losses
Ties

Performance Overview

ELO ratings by dataset

Qwen3 Embedding 4B's ELO performance varies across different benchmark datasets, showing its strengths in specific domains.

Qwen3 Embedding 4B - ELO by Dataset

Detailed Metrics

Dataset breakdown

Performance metrics across different benchmark datasets, including accuracy and latency percentiles.

ARCD

ELO 152142.5% WR102W-72L-66T

Accuracy Metrics

nDCG@5
0.915
nDCG@10
0.922
Recall@5
0.940
Recall@10
0.960

Latency Distribution

Mean
33ms
P50 (Median)
32ms
P90
38ms

MSMARCO

ELO 151145.0% WR108W-90L-42T

Accuracy Metrics

nDCG@5
1.000
nDCG@10
1.000
Recall@5
0.123
Recall@10
0.224

Latency Distribution

Mean
22ms
P50 (Median)
22ms
P90
26ms

FiQa

ELO 149747.5% WR114W-123L-3T

Accuracy Metrics

nDCG@5
0.724
nDCG@10
0.759
Recall@5
0.715
Recall@10
0.835

Latency Distribution

Mean
24ms
P50 (Median)
23ms
P90
27ms

SciFact

ELO 149546.3% WR111W-124L-5T

Accuracy Metrics

nDCG@5
0.695
nDCG@10
0.733
Recall@5
0.787
Recall@10
0.893

Latency Distribution

Mean
28ms
P50 (Median)
28ms
P90
33ms

PG

ELO 149447.1% WR113W-120L-7T

Latency Distribution

Mean
18ms
P50 (Median)
18ms
P90
21ms

NorQuAD

ELO 149319.2% WR46W-57L-137T

Latency Distribution

Mean
33ms
P50 (Median)
33ms
P90
38ms

DBPedia

ELO 147940.3% WR96W-129L-13T

Accuracy Metrics

nDCG@5
0.603
nDCG@10
0.595
Recall@5
0.234
Recall@10
0.375

Latency Distribution

Mean
21ms
P50 (Median)
21ms
P90
24ms

business reports

ELO 147541.3% WR99W-139L-2T

Latency Distribution

Mean
37ms
P50 (Median)
36ms
P90
42ms

Compare Models

See how it stacks up

Compare Qwen3 Embedding 4B with other top embeddings to understand the differences in performance, accuracy, and latency.