Cohere Embed v3
Optimized for English text with 512 token context length supporting retrieval, classification, and clustering tasks. Requires input_type specification distinguishing between search documents and queries for optimal performance.
Model Information
- Provider
- Cohere
- License
- Proprietary
- Price per 1M tokens
- $0.100
- Dimensions
- 1024
- Release Date
- 2024-02-07
- Model Name
- cohere-embed-v3
- Total Evaluations
- 1917
Performance Record
Wins786 (41.0%)
Losses915 (47.7%)
Ties216 (11.3%)
Wins
Losses
Ties
Performance Overview
ELO ratings by dataset
Cohere Embed v3's ELO performance varies across different benchmark datasets, showing its strengths in specific domains.
Cohere Embed v3 - ELO by Dataset
Detailed Metrics
Dataset breakdown
Performance metrics across different benchmark datasets, including accuracy and latency percentiles.
MSMARCO
ELO 151947.9% WR115W-82L-43T
Accuracy Metrics
- nDCG@5
- 1.000
- nDCG@10
- 0.996
- Recall@5
- 0.123
- Recall@10
- 0.218
Latency Distribution
- Mean
- 6ms
- P50 (Median)
- 6ms
- P90
- 7ms
PG
ELO 151954.6% WR131W-101L-8T
Latency Distribution
- Mean
- 6ms
- P50 (Median)
- 6ms
- P90
- 7ms
business reports
ELO 151152.1% WR125W-105L-10T
Latency Distribution
- Mean
- 6ms
- P50 (Median)
- 6ms
- P90
- 7ms
SciFact
ELO 150751.2% WR123W-111L-6T
Accuracy Metrics
- nDCG@5
- 0.729
- nDCG@10
- 0.769
- Recall@5
- 0.788
- Recall@10
- 0.900
Latency Distribution
- Mean
- 8ms
- P50 (Median)
- 8ms
- P90
- 9ms
DBPedia
ELO 149644.7% WR106W-110L-21T
Accuracy Metrics
- nDCG@5
- 0.634
- nDCG@10
- 0.619
- Recall@5
- 0.219
- Recall@10
- 0.353
Latency Distribution
- Mean
- 6ms
- P50 (Median)
- 6ms
- P90
- 7ms
NorQuAD
ELO 148924.6% WR59W-75L-106T
Latency Distribution
- Mean
- 9ms
- P50 (Median)
- 9ms
- P90
- 10ms
FiQa
ELO 148344.6% WR107W-130L-3T
Accuracy Metrics
- nDCG@5
- 0.641
- nDCG@10
- 0.650
- Recall@5
- 0.639
- Recall@10
- 0.678
Latency Distribution
- Mean
- 7ms
- P50 (Median)
- 6ms
- P90
- 8ms
ARCD
ELO 13838.3% WR20W-201L-19T
Accuracy Metrics
- nDCG@5
- 0.349
- nDCG@10
- 0.398
- Recall@5
- 0.380
- Recall@10
- 0.520
Latency Distribution
- Mean
- 11ms
- P50 (Median)
- 11ms
- P90
- 12ms
Compare Models
See how it stacks up
Compare Cohere Embed v3 with other top embeddings to understand the differences in performance, accuracy, and latency.