BAAI/bge-m3
First embedding model supporting three retrieval methods: dense, multi-vector, and sparse retrieval simultaneously. Supports 100+ languages with up to 8,192 token inputs and achieves SOTA on multilingual MIRACL and cross-lingual MKQA benchmarks.
Leaderboard Rank
#10
of 13
ELO Rating
1491
#10
Win Rate
40.9%
#10
Accuracy (nDCG@10)
0.753
#10
Latency
80874ms
#11
Model Information
- Provider
- BAAI
- License
- Open Source
- Price per 1M tokens
- $0.010
- Release Date
- 2024-01-27
- Model Name
- bge-m3
- Total Evaluations
- 1920
Performance Record
Wins785 (40.9%)
Losses885 (46.1%)
Ties250 (13.0%)
Wins
Losses
Ties
Performance Overview
ELO ratings by dataset
BAAI/bge-m3's ELO performance varies across different benchmark datasets, showing its strengths in specific domains.
BAAI/bge-m3 - ELO by Dataset
Detailed Metrics
Dataset breakdown
Performance metrics across different benchmark datasets, including accuracy and latency percentiles.
ARCD
ELO 151542.9% WR103W-70L-67T
Accuracy Metrics
- nDCG@5
- 0.941
- nDCG@10
- 0.941
- Recall@5
- 0.960
- Recall@10
- 0.960
Latency Distribution
- Mean
- 12723ms
- P50 (Median)
- 12469ms
- P90
- 14631ms
MSMARCO
ELO 150744.6% WR107W-99L-34T
Accuracy Metrics
- nDCG@5
- 0.997
- nDCG@10
- 0.997
- Recall@5
- 0.122
- Recall@10
- 0.220
Latency Distribution
- Mean
- 96153ms
- P50 (Median)
- 94230ms
- P90
- 110576ms
PG
ELO 149848.8% WR117W-118L-5T
Latency Distribution
- Mean
- 96375ms
- P50 (Median)
- 94448ms
- P90
- 110831ms
business reports
ELO 149546.7% WR112W-118L-10T
Latency Distribution
- Mean
- 24163ms
- P50 (Median)
- 23680ms
- P90
- 27787ms
DBPedia
ELO 149545.8% WR110W-113L-17T
Accuracy Metrics
- nDCG@5
- 0.625
- nDCG@10
- 0.603
- Recall@5
- 0.236
- Recall@10
- 0.341
Latency Distribution
- Mean
- 86619ms
- P50 (Median)
- 84887ms
- P90
- 99612ms
FiQa
ELO 149047.1% WR113W-126L-1T
Accuracy Metrics
- nDCG@5
- 0.597
- nDCG@10
- 0.609
- Recall@5
- 0.607
- Recall@10
- 0.666
Latency Distribution
- Mean
- 159871ms
- P50 (Median)
- 156674ms
- P90
- 183852ms
NorQuAD
ELO 148621.7% WR52W-73L-115T
Latency Distribution
- Mean
- 16285ms
- P50 (Median)
- 15959ms
- P90
- 18728ms
SciFact
ELO 144029.6% WR71W-168L-1T
Accuracy Metrics
- nDCG@5
- 0.578
- nDCG@10
- 0.617
- Recall@5
- 0.652
- Recall@10
- 0.763
Latency Distribution
- Mean
- 154804ms
- P50 (Median)
- 151708ms
- P90
- 178025ms
Compare Models
See how it stacks up
Compare BAAI/bge-m3 with other top embeddings to understand the differences in performance, accuracy, and latency.