Voyage 3.5 Lite
Most cost-optimized variant at $0.02 per 1M tokens achieving retrieval quality within 0.3% of Cohere-v4 at 1/6 the cost. Supports quantization options including 32-bit, int8, and binary precision for storage efficiency.
Model Information
- Provider
- Voyage AI
- License
- Proprietary
- Price per 1M tokens
- $0.020
- Dimensions
- 512
- Release Date
- 2025-05-20
- Model Name
- voyage-3.5-lite
- Total Evaluations
- 1918
Performance Record
Wins852 (44.4%)
Losses803 (41.9%)
Ties263 (13.7%)
Wins
Losses
Ties
Performance Overview
ELO ratings by dataset
Voyage 3.5 Lite's ELO performance varies across different benchmark datasets, showing its strengths in specific domains.
Voyage 3.5 Lite - ELO by Dataset
Detailed Metrics
Dataset breakdown
Performance metrics across different benchmark datasets, including accuracy and latency percentiles.
FiQa
ELO 152457.9% WR139W-99L-2T
Accuracy Metrics
- nDCG@5
- 0.708
- nDCG@10
- 0.736
- Recall@5
- 0.687
- Recall@10
- 0.780
Latency Distribution
- Mean
- 16ms
- P50 (Median)
- 16ms
- P90
- 19ms
NorQuAD
ELO 151728.7% WR69W-42L-129T
Latency Distribution
- Mean
- 18ms
- P50 (Median)
- 18ms
- P90
- 21ms
ARCD
ELO 151641.7% WR100W-72L-68T
Accuracy Metrics
- nDCG@5
- 0.928
- nDCG@10
- 0.935
- Recall@5
- 0.960
- Recall@10
- 0.980
Latency Distribution
- Mean
- 13ms
- P50 (Median)
- 13ms
- P90
- 15ms
business reports
ELO 151252.1% WR125W-106L-9T
Latency Distribution
- Mean
- 7ms
- P50 (Median)
- 7ms
- P90
- 8ms
DBPedia
ELO 149847.1% WR112W-111L-15T
Accuracy Metrics
- nDCG@5
- 0.641
- nDCG@10
- 0.632
- Recall@5
- 0.219
- Recall@10
- 0.367
Latency Distribution
- Mean
- 6ms
- P50 (Median)
- 6ms
- P90
- 7ms
PG
ELO 149847.5% WR114W-119L-7T
Latency Distribution
- Mean
- 8ms
- P50 (Median)
- 8ms
- P90
- 9ms
SciFact
ELO 149347.1% WR113W-123L-4T
Accuracy Metrics
- nDCG@5
- 0.670
- nDCG@10
- 0.719
- Recall@5
- 0.718
- Recall@10
- 0.843
Latency Distribution
- Mean
- 12ms
- P50 (Median)
- 11ms
- P90
- 13ms
MSMARCO
ELO 146833.3% WR80W-131L-29T
Accuracy Metrics
- nDCG@5
- 0.994
- nDCG@10
- 0.995
- Recall@5
- 0.122
- Recall@10
- 0.222
Latency Distribution
- Mean
- 10ms
- P50 (Median)
- 10ms
- P90
- 11ms
Compare Models
See how it stacks up
Compare Voyage 3.5 Lite with other top embeddings to understand the differences in performance, accuracy, and latency.