Gemini 3 Flash
Frontier intelligence built for speed at a fraction of the cost. Achieves 90.4% on GPQA Diamond and 81.2% on MMMU Pro with Pro-grade reasoning at Flash-level latency. 3x faster than 2.5 Pro while using 30% fewer tokens on average. Ideal for agentic workflows, coding (78% on SWE-bench Verified), and multimodal reasoning with 1M token context.
Model Information
- Provider
- License
- Proprietary
- Input Price per 1M
- $0.50
- Output Price per 1M
- $3.00
- Context Window
- 1049K
- Release Date
- 2025-12-17
- Model Name
- gemini-3-flash-preview
- Total Evaluations
- 900
Performance Record
Wins549 (61.0%)
Losses223 (24.8%)
Ties128 (14.2%)
Wins
Losses
Ties
Performance Overview
ELO ratings by dataset
Gemini 3 Flash's ELO performance varies across different benchmark datasets, showing its strengths in specific domains.
Gemini 3 Flash - ELO by Dataset
Detailed Metrics
Dataset breakdown
Performance metrics across different benchmark datasets, including accuracy and latency percentiles.
PG
ELO 175464.0% WR192W-98L-10T
Quality Metrics
- Correctness
- 5.00
- Faithfulness
- 5.00
- Grounding
- 5.00
- Relevance
- 5.00
- Completeness
- 5.00
- Overall
- 5.00
Latency Distribution
- Mean
- 9444ms
- Min
- 5346ms
- Max
- 12549ms
MSMARCO
ELO 161468.0% WR204W-64L-32T
Quality Metrics
- Correctness
- 4.83
- Faithfulness
- 4.87
- Grounding
- 4.87
- Relevance
- 5.00
- Completeness
- 4.90
- Overall
- 4.89
Latency Distribution
- Mean
- 6852ms
- Min
- 3389ms
- Max
- 9837ms
SciFact
ELO 145451.0% WR153W-61L-86T
Quality Metrics
- Correctness
- 5.00
- Faithfulness
- 5.00
- Grounding
- 5.00
- Relevance
- 4.97
- Completeness
- 4.83
- Overall
- 4.96
Latency Distribution
- Mean
- 7110ms
- Min
- 3784ms
- Max
- 18224ms
Compare Models
See how it stacks up
Compare Gemini 3 Flash with other top llms to understand the differences in performance, accuracy, and latency.