Qwen3 30B A3B Thinking
Support for 119 languages enabling multilingual RAG without translation overhead. Thinking mode with <think> blocks shows reasoning process over retrieved documents with Apache 2.0 license.
Model Information
- Provider
- Alibaba/Qwen
- License
- Open Source
- Input Price per 1M
- $0.05
- Output Price per 1M
- $0.34
- Context Window
- 33K
- Release Date
- 2025-08-28
- Model Name
- qwen3-30b-a3b-thinking-2507
- Total Evaluations
- 810
Performance Record
Wins258 (31.9%)
Losses443 (54.7%)
Ties109 (13.5%)
Wins
Losses
Ties
Performance Overview
ELO ratings by dataset
Qwen3 30B A3B Thinking's ELO performance varies across different benchmark datasets, showing its strengths in specific domains.
Qwen3 30B A3B Thinking - ELO by Dataset
Detailed Metrics
Dataset breakdown
Performance metrics across different benchmark datasets, including accuracy and latency percentiles.
SciFact
ELO 145639.3% WR106W-100L-64T
Quality Metrics
- Correctness
- 4.97
- Faithfulness
- 4.97
- Grounding
- 4.93
- Relevance
- 5.00
- Completeness
- 4.83
- Overall
- 4.94
Latency Distribution
- Mean
- 8384ms
- Min
- 2185ms
- Max
- 19414ms
MSMARCO
ELO 144733.7% WR91W-138L-41T
Quality Metrics
- Correctness
- 4.90
- Faithfulness
- 4.90
- Grounding
- 4.90
- Relevance
- 5.00
- Completeness
- 4.80
- Overall
- 4.90
Latency Distribution
- Mean
- 12522ms
- Min
- 1541ms
- Max
- 49799ms
PG
ELO 116622.6% WR61W-205L-4T
Quality Metrics
- Correctness
- 4.90
- Faithfulness
- 4.87
- Grounding
- 4.87
- Relevance
- 4.93
- Completeness
- 4.77
- Overall
- 4.87
Latency Distribution
- Mean
- 16030ms
- Min
- 3483ms
- Max
- 44237ms
Compare Models
See how it stacks up
Compare Qwen3 30B A3B Thinking with other top llms to understand the differences in performance, accuracy, and latency.