Jina Reranker v2 Base Multilingual vs Voyage AI Rerank 2.5
Detailed comparison between Jina Reranker v2 Base Multilingual and Voyage AI Rerank 2.5. See which reranker best meets your accuracy and performance needs.
Model Comparison
Voyage AI Rerank 2.5 takes the lead.
Both Jina Reranker v2 Base Multilingual and Voyage AI Rerank 2.5 are powerful reranking models designed to improve retrieval quality in RAG applications. However, their performance characteristics differ in important ways.
Why Voyage AI Rerank 2.5:
- Voyage AI Rerank 2.5 has 143 higher ELO rating
- Voyage AI Rerank 2.5 delivers better accuracy (nDCG@10: 0.501 vs 0.477)
- Voyage AI Rerank 2.5 is 449ms faster on average
- Voyage AI Rerank 2.5 has a 22.0% higher win rate
Overview
Key metrics
ELO Rating
Overall ranking quality
Jina Reranker v2 Base Multilingual
Voyage AI Rerank 2.5
Win Rate
Head-to-head performance
Jina Reranker v2 Base Multilingual
Voyage AI Rerank 2.5
Accuracy (nDCG@10)
Ranking quality metric
Jina Reranker v2 Base Multilingual
Voyage AI Rerank 2.5
Average Latency
Response time
Jina Reranker v2 Base Multilingual
Voyage AI Rerank 2.5
Visual Performance Analysis
Performance
ELO Rating Comparison
Win/Loss/Tie Breakdown
Accuracy Across Datasets (nDCG@10)
Latency Distribution (ms)
Breakdown
How the models stack up
| Metric | Jina Reranker v2 Base Multilingual | Voyage AI Rerank 2.5 | Description |
|---|---|---|---|
| Overall Performance | |||
| ELO Rating | 1458 | 1601 | Overall ranking quality based on pairwise comparisons |
| Win Rate | 43.2% | 65.2% | Percentage of comparisons won against other models |
| Accuracy Metrics | |||
| Avg nDCG@10 | 0.477 | 0.501 | Normalized discounted cumulative gain at position 10 |
| Performance Metrics | |||
| Avg Latency | 1044ms | 595ms | Average response time across all datasets |
Dataset Performance
By field
Comprehensive comparison of accuracy metrics (nDCG, Recall) and latency percentiles for each benchmark dataset.
BEIR/fiqa
| Metric | Jina Reranker v2 Base Multilingual | Voyage AI Rerank 2.5 | Description |
|---|---|---|---|
| Accuracy Metrics | |||
| nDCG@5 | 0.112 | 0.108 | Ranking quality at top 5 results |
| nDCG@10 | 0.121 | 0.119 | Ranking quality at top 10 results |
| Recall@5 | 0.105 | 0.098 | % of relevant docs in top 5 |
| Recall@10 | 0.130 | 0.128 | % of relevant docs in top 10 |
| Latency Metrics | |||
| Mean | 1014ms | 530ms | Average response time |
| P50 | 827ms | 482ms | 50th percentile (median) |
| P90 | 1605ms | 722ms | 90th percentile |
BEIR/scifact
| Metric | Jina Reranker v2 Base Multilingual | Voyage AI Rerank 2.5 | Description |
|---|---|---|---|
| Accuracy Metrics | |||
| nDCG@5 | 0.830 | 0.865 | Ranking quality at top 5 results |
| nDCG@10 | 0.832 | 0.882 | Ranking quality at top 10 results |
| Recall@5 | 0.871 | 0.892 | % of relevant docs in top 5 |
| Recall@10 | 0.876 | 0.940 | % of relevant docs in top 10 |
| Latency Metrics | |||
| Mean | 1123ms | 667ms | Average response time |
| P50 | 1015ms | 621ms | 50th percentile (median) |
| P90 | 1285ms | 819ms | 90th percentile |
PG
| Metric | Jina Reranker v2 Base Multilingual | Voyage AI Rerank 2.5 | Description |
|---|---|---|---|
| Accuracy Metrics | |||
| Latency Metrics | |||
| Mean | 994ms | 588ms | Average response time |
| P50 | 839ms | 611ms | 50th percentile (median) |
| P90 | 1436ms | 746ms | 90th percentile |
Explore More
Compare more rerankers
See how all reranking models stack up. Compare Cohere, Jina AI, Voyage, ZeRank, and more. View comprehensive benchmarks, compare performance metrics, and find the perfect reranker for your RAG application.