Back to all rerankers

Contextual AI Rerank v2 Instruct

Available in 1B, 2B, and 6B parameter sizes with unique recency-awareness capabilities for time-sensitive ranking. Only reranker family capable of ranking recent information higher with ~35% performance improvement on recency tasks.

Leaderboard Rank
#8
of 11
ELO Rating
1461
#8
Win Rate
42.3%
#8
Accuracy (nDCG@10)
0.230
#2
Latency
3333ms
#11

Model Information

Provider
Contextual AI
License
cc-by-nc-4.0
Price per 1M tokens
$0.050
Release Date
2025-09-12
Model Name
ctxl-rerank-v2-instruct-multilingual
Total Evaluations
3000

Performance Record

Wins1270 (42.3%)
Losses1644 (54.8%)
Ties86 (2.9%)
Wins
Losses
Ties

Performance Overview

ELO ratings by dataset

Contextual AI Rerank v2 Instruct's ELO performance varies across different benchmark datasets, showing its strengths in specific domains.

Contextual AI Rerank v2 Instruct - ELO by Dataset

Detailed Metrics

Dataset breakdown

Performance metrics across different benchmark datasets, including accuracy and latency percentiles.

PG

ELO 152657.4% WR287W-213L-0T

Accuracy Metrics

nDCG@5
0.000
nDCG@10
0.000
Recall@5
0.000
Recall@10
0.000

Latency Distribution

Mean
3566ms
P50 (Median)
3475ms
P90
4148ms

business reports

ELO 152346.0% WR230W-268L-2T

Accuracy Metrics

nDCG@5
0.000
nDCG@10
0.000
Recall@5
0.000
Recall@10
0.000

Latency Distribution

Mean
3231ms
P50 (Median)
3129ms
P90
3651ms

FiQa

ELO 146832.0% WR160W-327L-13T

Accuracy Metrics

nDCG@5
0.119
nDCG@10
0.125
Recall@5
0.123
Recall@10
0.135

Latency Distribution

Mean
3283ms
P50 (Median)
3209ms
P90
3891ms

MSMARCO

ELO 144745.6% WR228W-222L-50T

Accuracy Metrics

nDCG@5
0.510
nDCG@10
0.538
Recall@5
0.720
Recall@10
0.800

Latency Distribution

Mean
3283ms
P50 (Median)
3260ms
P90
3885ms

DBPedia

ELO 141435.4% WR177W-303L-20T

Accuracy Metrics

nDCG@5
0.158
nDCG@10
0.159
Recall@5
0.004
Recall@10
0.005

Latency Distribution

Mean
3010ms
P50 (Median)
3042ms
P90
3283ms

arguana

ELO 138637.6% WR188W-311L-1T

Accuracy Metrics

nDCG@5
0.525
nDCG@10
0.560
Recall@5
0.860
Recall@10
0.960

Latency Distribution

Mean
3627ms
P50 (Median)
3601ms
P90
4037ms

Compare Models

See how it stacks up

Compare Contextual AI Rerank v2 Instruct with other top rerankers to understand the differences in performance, accuracy, and latency.