Back to all embeddings

BAAI/bge-m3

First embedding model supporting three retrieval methods: dense, multi-vector, and sparse retrieval simultaneously. Supports 100+ languages with up to 8,192 token inputs and achieves SOTA on multilingual MIRACL and cross-lingual MKQA benchmarks.

Leaderboard Rank
#10
of 13
ELO Rating
1491
#10
Win Rate
40.9%
#10
Accuracy (nDCG@10)
0.753
#10
Latency
80874ms
#11

Model Information

Provider
BAAI
License
Open Source
Price per 1M tokens
$0.010
Release Date
2024-01-27
Model Name
bge-m3
Total Evaluations
1920

Performance Record

Wins785 (40.9%)
Losses885 (46.1%)
Ties250 (13.0%)
Wins
Losses
Ties

Performance Overview

ELO ratings by dataset

BAAI/bge-m3's ELO performance varies across different benchmark datasets, showing its strengths in specific domains.

BAAI/bge-m3 - ELO by Dataset

Detailed Metrics

Dataset breakdown

Performance metrics across different benchmark datasets, including accuracy and latency percentiles.

ARCD

ELO 151542.9% WR103W-70L-67T

Accuracy Metrics

nDCG@5
0.941
nDCG@10
0.941
Recall@5
0.960
Recall@10
0.960

Latency Distribution

Mean
12723ms
P50 (Median)
12469ms
P90
14631ms

MSMARCO

ELO 150744.6% WR107W-99L-34T

Accuracy Metrics

nDCG@5
0.997
nDCG@10
0.997
Recall@5
0.122
Recall@10
0.220

Latency Distribution

Mean
96153ms
P50 (Median)
94230ms
P90
110576ms

PG

ELO 149848.8% WR117W-118L-5T

Latency Distribution

Mean
96375ms
P50 (Median)
94448ms
P90
110831ms

business reports

ELO 149546.7% WR112W-118L-10T

Latency Distribution

Mean
24163ms
P50 (Median)
23680ms
P90
27787ms

DBPedia

ELO 149545.8% WR110W-113L-17T

Accuracy Metrics

nDCG@5
0.625
nDCG@10
0.603
Recall@5
0.236
Recall@10
0.341

Latency Distribution

Mean
86619ms
P50 (Median)
84887ms
P90
99612ms

FiQa

ELO 149047.1% WR113W-126L-1T

Accuracy Metrics

nDCG@5
0.597
nDCG@10
0.609
Recall@5
0.607
Recall@10
0.666

Latency Distribution

Mean
159871ms
P50 (Median)
156674ms
P90
183852ms

NorQuAD

ELO 148621.7% WR52W-73L-115T

Latency Distribution

Mean
16285ms
P50 (Median)
15959ms
P90
18728ms

SciFact

ELO 144029.6% WR71W-168L-1T

Accuracy Metrics

nDCG@5
0.578
nDCG@10
0.617
Recall@5
0.652
Recall@10
0.763

Latency Distribution

Mean
154804ms
P50 (Median)
151708ms
P90
178025ms

Compare Models

See how it stacks up

Compare BAAI/bge-m3 with other top embeddings to understand the differences in performance, accuracy, and latency.