Cohere Rerank 3.5 vs Voyage AI Rerank 2.5

Detailed comparison between Cohere Rerank 3.5 and Voyage AI Rerank 2.5. See which reranker best meets your accuracy and performance needs.

Model Comparison

Voyage AI Rerank 2.5 takes the lead.

Both Cohere Rerank 3.5 and Voyage AI Rerank 2.5 are powerful reranking models designed to improve retrieval quality in RAG applications. However, their performance characteristics differ in important ways.

Why Voyage AI Rerank 2.5:

  • Voyage AI Rerank 2.5 has 207 higher ELO rating
  • Voyage AI Rerank 2.5 has a 30.3% higher win rate

Overview

Key metrics

ELO Rating

Overall ranking quality

Cohere Rerank 3.5

1395

Voyage AI Rerank 2.5

1601

Win Rate

Head-to-head performance

Cohere Rerank 3.5

34.9%

Voyage AI Rerank 2.5

65.2%

Accuracy (nDCG@10)

Ranking quality metric

Cohere Rerank 3.5

0.497

Voyage AI Rerank 2.5

0.501

Average Latency

Response time

Cohere Rerank 3.5

603ms

Voyage AI Rerank 2.5

595ms

Visual Performance Analysis

Performance

ELO Rating Comparison

Win/Loss/Tie Breakdown

Accuracy Across Datasets (nDCG@10)

Latency Distribution (ms)

Breakdown

How the models stack up

MetricCohere Rerank 3.5Voyage AI Rerank 2.5Description
Overall Performance
ELO Rating
1395
1601
Overall ranking quality based on pairwise comparisons
Win Rate
34.9%
65.2%
Percentage of comparisons won against other models
Accuracy Metrics
Avg nDCG@10
0.497
0.501
Normalized discounted cumulative gain at position 10
Performance Metrics
Avg Latency
603ms
595ms
Average response time across all datasets

Dataset Performance

By field

Comprehensive comparison of accuracy metrics (nDCG, Recall) and latency percentiles for each benchmark dataset.

BEIR/fiqa

MetricCohere Rerank 3.5Voyage AI Rerank 2.5Description
Accuracy Metrics
nDCG@5
0.121
0.108
Ranking quality at top 5 results
nDCG@10
0.127
0.119
Ranking quality at top 10 results
Recall@5
0.118
0.098
% of relevant docs in top 5
Recall@10
0.130
0.128
% of relevant docs in top 10
Latency Metrics
Mean
510ms
530ms
Average response time
P50
569ms
482ms
50th percentile (median)
P90
617ms
722ms
90th percentile

BEIR/scifact

MetricCohere Rerank 3.5Voyage AI Rerank 2.5Description
Accuracy Metrics
nDCG@5
0.856
0.865
Ranking quality at top 5 results
nDCG@10
0.866
0.882
Ranking quality at top 10 results
Recall@5
0.891
0.892
% of relevant docs in top 5
Recall@10
0.920
0.940
% of relevant docs in top 10
Latency Metrics
Mean
638ms
667ms
Average response time
P50
612ms
621ms
50th percentile (median)
P90
819ms
819ms
90th percentile

PG

MetricCohere Rerank 3.5Voyage AI Rerank 2.5Description
Accuracy Metrics
Latency Metrics
Mean
661ms
588ms
Average response time
P50
613ms
611ms
50th percentile (median)
P90
792ms
746ms
90th percentile

Explore More

Compare more rerankers

See how all reranking models stack up. Compare Cohere, Jina AI, Voyage, ZeRank, and more. View comprehensive benchmarks, compare performance metrics, and find the perfect reranker for your RAG application.