Back to all rerankers

Cohere Rerank 4 Fast

Fast cross-encoder reranker for enterprise search and RAG, built for low-latency production workloads. Supports up to 32K context and strong multilingual retrieval across 100+ languages, with optional self-learning to adapt to your domain over time.

Leaderboard Rank
#7
of 11
ELO Rating
1506
#7
Win Rate
49.3%
#7
Latency
447ms
#5

Model Information

Provider
Cohere
License
Proprietary
Price per 1M tokens
$0.050
Release Date
2025-12-11
Model Name
rerank-v4.0-fast
Total Evaluations
3000

Performance Record

Wins1480 (49.3%)
Losses1415 (47.2%)
Ties105 (3.5%)
Wins
Losses
Ties

Performance Overview

ELO ratings by dataset

Cohere Rerank 4 Fast's ELO performance varies across different benchmark datasets, showing its strengths in specific domains.

Cohere Rerank 4 Fast - ELO by Dataset

Detailed Metrics

Dataset breakdown

Performance metrics across different benchmark datasets, including accuracy and latency percentiles.

DBPedia

ELO 160342.8% WR214W-249L-37T

Latency Distribution

Mean
297ms
P50 (Median)
297ms
P90
309ms

business reports

ELO 157956.0% WR280W-210L-10T

Latency Distribution

Mean
428ms
P50 (Median)
408ms
P90
550ms

MSMARCO

ELO 151345.2% WR226W-227L-47T

Latency Distribution

Mean
403ms
P50 (Median)
382ms
P90
486ms

PG

ELO 145642.0% WR210W-290L-0T

Latency Distribution

Mean
492ms
P50 (Median)
439ms
P90
650ms

FiQa

ELO 144352.4% WR262W-229L-9T

Latency Distribution

Mean
485ms
P50 (Median)
459ms
P90
624ms

arguana

ELO 144357.6% WR288W-210L-2T

Latency Distribution

Mean
574ms
P50 (Median)
562ms
P90
728ms

Compare Models

See how it stacks up

Compare Cohere Rerank 4 Fast with other top rerankers to understand the differences in performance, accuracy, and latency.