Qwen3 Embedding 0.6B vs Gemini text-embedding-004
Detailed comparison between Qwen3 Embedding 0.6B and Gemini text-embedding-004. See which embedding best meets your accuracy and performance needs.
Model Comparison
Qwen3 Embedding 0.6B takes the lead.
Both Qwen3 Embedding 0.6B and Gemini text-embedding-004 are powerful embedding models designed to improve retrieval quality in RAG applications. However, their performance characteristics differ in important ways.
Why Qwen3 Embedding 0.6B:
- Qwen3 Embedding 0.6B has 31 higher ELO rating
- Qwen3 Embedding 0.6B delivers better accuracy (nDCG@10: 0.751 vs 0.585)
- Gemini text-embedding-004 is 26962ms faster on average
- Qwen3 Embedding 0.6B has a 9.3% higher win rate
Overview
Key metrics
ELO Rating
Overall ranking quality
Qwen3 Embedding 0.6B
Gemini text-embedding-004
Win Rate
Head-to-head performance
Qwen3 Embedding 0.6B
Gemini text-embedding-004
Accuracy (nDCG@10)
Ranking quality metric
Qwen3 Embedding 0.6B
Gemini text-embedding-004
Average Latency
Response time
Qwen3 Embedding 0.6B
Gemini text-embedding-004
Visual Performance Analysis
Performance
ELO Rating Comparison
Win/Loss/Tie Breakdown
Accuracy Across Datasets (nDCG@10)
Latency Distribution (ms)
Breakdown
How the models stack up
| Metric | Qwen3 Embedding 0.6B | Gemini text-embedding-004 | Description |
|---|---|---|---|
| Overall Performance | |||
| ELO Rating | 1478 | 1447 | Overall ranking quality based on pairwise comparisons |
| Win Rate | 37.3% | 28.0% | Percentage of comparisons won against other models |
| Pricing & Availability | |||
| Price per 1M tokens | $0.010 | $0.020 | Cost per million tokens processed |
| Release Date | 2025-06-06 | 2024-05-14 | Model release date |
| Accuracy Metrics | |||
| Avg nDCG@10 | 0.751 | 0.585 | Normalized discounted cumulative gain at position 10 |
| Performance Metrics | |||
| Avg Latency | 70062ms | 43100ms | Average response time across all datasets |
Dataset Performance
By field
Comprehensive comparison of accuracy metrics (nDCG, Recall) and latency percentiles for each benchmark dataset.
PG
| Metric | Qwen3 Embedding 0.6B | Gemini text-embedding-004 | Description |
|---|---|---|---|
| Accuracy Metrics | |||
| Latency Metrics | |||
| Mean | 77697ms | 76115ms | Average response time |
| P50 | 76143ms | 74593ms | 50th percentile (median) |
| P90 | 89352ms | 87532ms | 90th percentile |
business reports
| Metric | Qwen3 Embedding 0.6B | Gemini text-embedding-004 | Description |
|---|---|---|---|
| Accuracy Metrics | |||
| Latency Metrics | |||
| Mean | 15599ms | 9184ms | Average response time |
| P50 | 15287ms | 9000ms | 50th percentile (median) |
| P90 | 17939ms | 10562ms | 90th percentile |
DBPedia
| Metric | Qwen3 Embedding 0.6B | Gemini text-embedding-004 | Description |
|---|---|---|---|
| Accuracy Metrics | |||
| nDCG@5 | 0.549 | 0.536 | Ranking quality at top 5 results |
| nDCG@10 | 0.556 | 0.517 | Ranking quality at top 10 results |
| Recall@5 | 0.216 | 0.200 | % of relevant docs in top 5 |
| Recall@10 | 0.350 | 0.304 | % of relevant docs in top 10 |
| Latency Metrics | |||
| Mean | 67654ms | 59127ms | Average response time |
| P50 | 66301ms | 57944ms | 50th percentile (median) |
| P90 | 77802ms | 67996ms | 90th percentile |
FiQa
| Metric | Qwen3 Embedding 0.6B | Gemini text-embedding-004 | Description |
|---|---|---|---|
| Accuracy Metrics | |||
| nDCG@5 | 0.620 | 0.613 | Ranking quality at top 5 results |
| nDCG@10 | 0.647 | 0.649 | Ranking quality at top 10 results |
| Recall@5 | 0.590 | 0.645 | % of relevant docs in top 5 |
| Recall@10 | 0.680 | 0.748 | % of relevant docs in top 10 |
| Latency Metrics | |||
| Mean | 212205ms | 62373ms | Average response time |
| P50 | 207961ms | 61126ms | 50th percentile (median) |
| P90 | 244036ms | 71729ms | 90th percentile |
SciFact
| Metric | Qwen3 Embedding 0.6B | Gemini text-embedding-004 | Description |
|---|---|---|---|
| Accuracy Metrics | |||
| nDCG@5 | 0.666 | 0.722 | Ranking quality at top 5 results |
| nDCG@10 | 0.686 | 0.745 | Ranking quality at top 10 results |
| Recall@5 | 0.723 | 0.797 | % of relevant docs in top 5 |
| Recall@10 | 0.783 | 0.860 | % of relevant docs in top 10 |
| Latency Metrics | |||
| Mean | 102019ms | 65774ms | Average response time |
| P50 | 99979ms | 64459ms | 50th percentile (median) |
| P90 | 117322ms | 75640ms | 90th percentile |
MSMARCO
| Metric | Qwen3 Embedding 0.6B | Gemini text-embedding-004 | Description |
|---|---|---|---|
| Accuracy Metrics | |||
| nDCG@5 | 0.997 | 0.979 | Ranking quality at top 5 results |
| nDCG@10 | 0.992 | 0.977 | Ranking quality at top 10 results |
| Recall@5 | 0.122 | 0.118 | % of relevant docs in top 5 |
| Recall@10 | 0.215 | 0.209 | % of relevant docs in top 10 |
| Latency Metrics | |||
| Mean | 65717ms | 62416ms | Average response time |
| P50 | 64403ms | 61168ms | 50th percentile (median) |
| P90 | 75575ms | 71778ms | 90th percentile |
NorQuAD
| Metric | Qwen3 Embedding 0.6B | Gemini text-embedding-004 | Description |
|---|---|---|---|
| Accuracy Metrics | |||
| Latency Metrics | |||
| Mean | 12763ms | 5693ms | Average response time |
| P50 | 12508ms | 5579ms | 50th percentile (median) |
| P90 | 14677ms | 6547ms | 90th percentile |
ARCD
| Metric | Qwen3 Embedding 0.6B | Gemini text-embedding-004 | Description |
|---|---|---|---|
| Accuracy Metrics | |||
| nDCG@5 | 0.865 | 0.030 | Ranking quality at top 5 results |
| nDCG@10 | 0.872 | 0.036 | Ranking quality at top 10 results |
| Recall@5 | 0.880 | 0.040 | % of relevant docs in top 5 |
| Recall@10 | 0.900 | 0.060 | % of relevant docs in top 10 |
| Latency Metrics | |||
| Mean | 6841ms | 4118ms | Average response time |
| P50 | 6704ms | 4036ms | 50th percentile (median) |
| P90 | 7867ms | 4736ms | 90th percentile |
Explore More
Compare more embeddings
See how all embedding models stack up. Compare OpenAI, Cohere, Jina AI, Voyage, and more. View comprehensive benchmarks, compare performance metrics, and find the perfect embedding for your RAG application.