Back to all embeddings

Voyage 3.5 Lite

Most cost-optimized variant at $0.02 per 1M tokens achieving retrieval quality within 0.3% of Cohere-v4 at 1/6 the cost. Supports quantization options including 32-bit, int8, and binary precision for storage efficiency.

Leaderboard Rank
#6
of 13
ELO Rating
1503
#6
Win Rate
44.4%
#6
Accuracy (nDCG@10)
0.803
#5
Latency
11ms
#5

Model Information

Provider
Voyage AI
License
Proprietary
Price per 1M tokens
$0.020
Dimensions
512
Release Date
2025-05-20
Model Name
voyage-3.5-lite
Total Evaluations
1918

Performance Record

Wins852 (44.4%)
Losses803 (41.9%)
Ties263 (13.7%)
Wins
Losses
Ties

Performance Overview

ELO ratings by dataset

Voyage 3.5 Lite's ELO performance varies across different benchmark datasets, showing its strengths in specific domains.

Voyage 3.5 Lite - ELO by Dataset

Detailed Metrics

Dataset breakdown

Performance metrics across different benchmark datasets, including accuracy and latency percentiles.

FiQa

ELO 152457.9% WR139W-99L-2T

Accuracy Metrics

nDCG@5
0.708
nDCG@10
0.736
Recall@5
0.687
Recall@10
0.780

Latency Distribution

Mean
16ms
P50 (Median)
16ms
P90
19ms

NorQuAD

ELO 151728.7% WR69W-42L-129T

Latency Distribution

Mean
18ms
P50 (Median)
18ms
P90
21ms

ARCD

ELO 151641.7% WR100W-72L-68T

Accuracy Metrics

nDCG@5
0.928
nDCG@10
0.935
Recall@5
0.960
Recall@10
0.980

Latency Distribution

Mean
13ms
P50 (Median)
13ms
P90
15ms

business reports

ELO 151252.1% WR125W-106L-9T

Latency Distribution

Mean
7ms
P50 (Median)
7ms
P90
8ms

DBPedia

ELO 149847.1% WR112W-111L-15T

Accuracy Metrics

nDCG@5
0.641
nDCG@10
0.632
Recall@5
0.219
Recall@10
0.367

Latency Distribution

Mean
6ms
P50 (Median)
6ms
P90
7ms

PG

ELO 149847.5% WR114W-119L-7T

Latency Distribution

Mean
8ms
P50 (Median)
8ms
P90
9ms

SciFact

ELO 149347.1% WR113W-123L-4T

Accuracy Metrics

nDCG@5
0.670
nDCG@10
0.719
Recall@5
0.718
Recall@10
0.843

Latency Distribution

Mean
12ms
P50 (Median)
11ms
P90
13ms

MSMARCO

ELO 146833.3% WR80W-131L-29T

Accuracy Metrics

nDCG@5
0.994
nDCG@10
0.995
Recall@5
0.122
Recall@10
0.222

Latency Distribution

Mean
10ms
P50 (Median)
10ms
P90
11ms

Compare Models

See how it stacks up

Compare Voyage 3.5 Lite with other top embeddings to understand the differences in performance, accuracy, and latency.