Contextual AI Rerank v2 Instruct
Available in 1B, 2B, and 6B parameter sizes with unique recency-awareness capabilities for time-sensitive ranking. Only reranker family capable of ranking recent information higher with ~35% performance improvement on recency tasks.
Model Information
- Provider
- Contextual AI
- License
- cc-by-nc-4.0
- Price per 1M tokens
- $0.050
- Release Date
- 2025-09-12
- Model Name
- ctxl-rerank-v2-instruct-multilingual
- Total Evaluations
- 3000
Performance Record
Wins1270 (42.3%)
Losses1644 (54.8%)
Ties86 (2.9%)
Wins
Losses
Ties
Performance Overview
ELO ratings by dataset
Contextual AI Rerank v2 Instruct's ELO performance varies across different benchmark datasets, showing its strengths in specific domains.
Contextual AI Rerank v2 Instruct - ELO by Dataset
Detailed Metrics
Dataset breakdown
Performance metrics across different benchmark datasets, including accuracy and latency percentiles.
PG
ELO 152657.4% WR287W-213L-0T
Accuracy Metrics
- nDCG@5
- 0.000
- nDCG@10
- 0.000
- Recall@5
- 0.000
- Recall@10
- 0.000
Latency Distribution
- Mean
- 3566ms
- P50 (Median)
- 3475ms
- P90
- 4148ms
business reports
ELO 152346.0% WR230W-268L-2T
Accuracy Metrics
- nDCG@5
- 0.000
- nDCG@10
- 0.000
- Recall@5
- 0.000
- Recall@10
- 0.000
Latency Distribution
- Mean
- 3231ms
- P50 (Median)
- 3129ms
- P90
- 3651ms
FiQa
ELO 146832.0% WR160W-327L-13T
Accuracy Metrics
- nDCG@5
- 0.119
- nDCG@10
- 0.125
- Recall@5
- 0.123
- Recall@10
- 0.135
Latency Distribution
- Mean
- 3283ms
- P50 (Median)
- 3209ms
- P90
- 3891ms
MSMARCO
ELO 144745.6% WR228W-222L-50T
Accuracy Metrics
- nDCG@5
- 0.510
- nDCG@10
- 0.538
- Recall@5
- 0.720
- Recall@10
- 0.800
Latency Distribution
- Mean
- 3283ms
- P50 (Median)
- 3260ms
- P90
- 3885ms
DBPedia
ELO 141435.4% WR177W-303L-20T
Accuracy Metrics
- nDCG@5
- 0.158
- nDCG@10
- 0.159
- Recall@5
- 0.004
- Recall@10
- 0.005
Latency Distribution
- Mean
- 3010ms
- P50 (Median)
- 3042ms
- P90
- 3283ms
arguana
ELO 138637.6% WR188W-311L-1T
Accuracy Metrics
- nDCG@5
- 0.525
- nDCG@10
- 0.560
- Recall@5
- 0.860
- Recall@10
- 0.960
Latency Distribution
- Mean
- 3627ms
- P50 (Median)
- 3601ms
- P90
- 4037ms
Compare Models
See how it stacks up
Compare Contextual AI Rerank v2 Instruct with other top rerankers to understand the differences in performance, accuracy, and latency.