Qwen3 Embedding 4B
Mid-size variant supporting 119 languages optimized for semantic retrieval, classification, and RAG applications. Open-sourced under Apache 2.0 license on Hugging Face, GitHub, and ModelScope.
Model Information
- Provider
- Qwen
- License
- Open Source
- Price per 1M tokens
- $0.020
- Release Date
- 2025-06-06
- Model Name
- qwen3-embedding-4b
- Total Evaluations
- 1918
Performance Record
Wins789 (41.1%)
Losses854 (44.5%)
Ties275 (14.3%)
Wins
Losses
Ties
Performance Overview
ELO ratings by dataset
Qwen3 Embedding 4B's ELO performance varies across different benchmark datasets, showing its strengths in specific domains.
Qwen3 Embedding 4B - ELO by Dataset
Detailed Metrics
Dataset breakdown
Performance metrics across different benchmark datasets, including accuracy and latency percentiles.
ARCD
ELO 152142.5% WR102W-72L-66T
Accuracy Metrics
- nDCG@5
- 0.915
- nDCG@10
- 0.922
- Recall@5
- 0.940
- Recall@10
- 0.960
Latency Distribution
- Mean
- 9581ms
- P50 (Median)
- 9389ms
- P90
- 11018ms
MSMARCO
ELO 151145.0% WR108W-90L-42T
Accuracy Metrics
- nDCG@5
- 1.000
- nDCG@10
- 1.000
- Recall@5
- 0.123
- Recall@10
- 0.224
Latency Distribution
- Mean
- 112945ms
- P50 (Median)
- 110686ms
- P90
- 129887ms
FiQa
ELO 149747.5% WR114W-123L-3T
Accuracy Metrics
- nDCG@5
- 0.724
- nDCG@10
- 0.759
- Recall@5
- 0.715
- Recall@10
- 0.835
Latency Distribution
- Mean
- 120137ms
- P50 (Median)
- 117734ms
- P90
- 138158ms
SciFact
ELO 149546.3% WR111W-124L-5T
Accuracy Metrics
- nDCG@5
- 0.695
- nDCG@10
- 0.733
- Recall@5
- 0.787
- Recall@10
- 0.893
Latency Distribution
- Mean
- 143712ms
- P50 (Median)
- 140838ms
- P90
- 165269ms
PG
ELO 149447.1% WR113W-120L-7T
Latency Distribution
- Mean
- 107349ms
- P50 (Median)
- 105202ms
- P90
- 123451ms
NorQuAD
ELO 149319.2% WR46W-57L-137T
Latency Distribution
- Mean
- 13597ms
- P50 (Median)
- 13325ms
- P90
- 15637ms
DBPedia
ELO 147940.3% WR96W-129L-13T
Accuracy Metrics
- nDCG@5
- 0.603
- nDCG@10
- 0.595
- Recall@5
- 0.234
- Recall@10
- 0.375
Latency Distribution
- Mean
- 106747ms
- P50 (Median)
- 104612ms
- P90
- 122759ms
business reports
ELO 147541.3% WR99W-139L-2T
Latency Distribution
- Mean
- 26100ms
- P50 (Median)
- 25578ms
- P90
- 30015ms
Compare Models
See how it stacks up
Compare Qwen3 Embedding 4B with other top embeddings to understand the differences in performance, accuracy, and latency.