Back to all rerankers

Contextual AI Rerank v2 Instruct

Available in 1B, 2B, and 6B parameter sizes with unique recency-awareness capabilities for time-sensitive ranking. Only reranker family capable of ranking recent information higher with ~35% performance improvement on recency tasks.

Leaderboard Rank
#6
of 9
ELO Rating
1481
#6
Win Rate
45.1%
#6
Accuracy (nDCG@10)
0.114
#1
Latency
3333ms
#9

Model Information

Provider
Contextual AI
License
cc-by-nc-4.0
Price per 1M tokens
$0.050
Release Date
2025-09-12
Model Name
ctxl-rerank-v2-instruct-multilingual
Total Evaluations
2335

Performance Record

Wins1053 (45.1%)
Losses1282 (54.9%)
Ties0 (0.0%)
Wins
Losses
Ties

Performance Overview

ELO ratings by dataset

Contextual AI Rerank v2 Instruct's ELO performance varies across different benchmark datasets, showing its strengths in specific domains.

Contextual AI Rerank v2 Instruct - ELO by Dataset

Detailed Metrics

Dataset breakdown

Performance metrics across different benchmark datasets, including accuracy and latency percentiles.

PG

ELO 156157.8% WR231W-169L-0T

Accuracy Metrics

nDCG@5
0.000
nDCG@10
0.000
Recall@5
0.000
Recall@10
0.000

Latency Distribution

Mean
3566ms
P50 (Median)
3475ms
P90
4148ms

business reports

ELO 155649.1% WR196W-203L-0T

Accuracy Metrics

nDCG@5
0.000
nDCG@10
0.000
Recall@5
0.000
Recall@10
0.000

Latency Distribution

Mean
3231ms
P50 (Median)
3129ms
P90
3651ms

FiQa

ELO 149135.0% WR136W-252L-0T

Accuracy Metrics

nDCG@5
0.119
nDCG@10
0.125
Recall@5
0.123
Recall@10
0.135

Latency Distribution

Mean
3283ms
P50 (Median)
3209ms
P90
3891ms

MSMARCO

ELO 144150.5% WR184W-180L-0T

Accuracy Metrics

nDCG@5
0.000
nDCG@10
0.000
Recall@5
0.000
Recall@10
0.000

Latency Distribution

Mean
3283ms
P50 (Median)
3260ms
P90
3885ms

DBPedia

ELO 143138.7% WR149W-236L-0T

Accuracy Metrics

nDCG@5
0.000
nDCG@10
0.000
Recall@5
0.000
Recall@10
0.000

Latency Distribution

Mean
3010ms
P50 (Median)
3042ms
P90
3283ms

arguana

ELO 140439.4% WR157W-242L-0T

Accuracy Metrics

nDCG@5
0.525
nDCG@10
0.560
Recall@5
0.860
Recall@10
0.960

Latency Distribution

Mean
3627ms
P50 (Median)
3601ms
P90
4037ms

Compare Models

See how it stacks up

Compare Contextual AI Rerank v2 Instruct with other top rerankers to understand the differences in performance, accuracy, and latency.