GLM 4.6

Native bilingual English/Chinese support for cross-lingual RAG without translation overhead. MIT license enables fine-tuning on proprietary knowledge bases with self-hosting via vLLM/SGLang.

Leaderboard Rank

of 11

ELO Rating

1508

Win Rate

40.2%

Latency

33116ms

#11

Model Information

Provider: Zhipu AI
License: Open Source
Input Price per 1M: $0.40
Output Price per 1M: $1.75
Context Window: 203K
Release Date: 2025-09-30
Model Name: glm-4.6
Total Evaluations: 900

Performance Record

Wins362 (40.2%)

Losses401 (44.6%)

Ties137 (15.2%)

Wins

Losses

Ties

Performance Overview

ELO ratings by dataset

GLM 4.6's ELO performance varies across different benchmark datasets, showing its strengths in specific domains.

GLM 4.6 - ELO by Dataset

Detailed Metrics

Dataset breakdown

Performance metrics across different benchmark datasets, including accuracy and latency percentiles.

SciFact

ELO 159030.7% WR92W-135L-73T

Quality Metrics

Correctness: 4.67
Faithfulness: 4.83
Grounding: 4.83
Relevance: 4.90
Completeness: 4.60
Overall: 4.77

Latency Distribution

Mean: 27880ms
Min: 3248ms
Max: 68513ms

MSMARCO

ELO 154541.0% WR123W-130L-47T

Quality Metrics

Correctness: 4.83
Faithfulness: 4.80
Grounding: 4.80
Relevance: 4.93
Completeness: 4.77
Overall: 4.83

Latency Distribution

Mean: 34694ms
Min: 9198ms
Max: 69527ms

PG

ELO 138849.0% WR147W-136L-17T

Quality Metrics

Correctness: 4.93
Faithfulness: 4.97
Grounding: 4.93
Relevance: 4.97
Completeness: 4.67
Overall: 4.89

Latency Distribution

Mean: 36774ms
Min: 9584ms
Max: 104257ms

Compare Models

See how it stacks up

Compare GLM 4.6 with other top llms to understand the differences in performance, accuracy, and latency.

vs GPT-5.1

OpenAI

ELO1743

Win Rate68.8%

Compare now →

vs Grok 4 Fast

xAI

ELO1645

Win Rate58.3%

Compare now →

vs Gemini 3 Flash

Google

ELO1607

Win Rate61.0%

Compare now →

View Full Leaderboard

Agentset

GLM 4.6

Model Information

Performance Record

Performance Overview

ELO ratings by dataset

GLM 4.6 - ELO by Dataset

Detailed Metrics

Dataset breakdown

SciFact

Quality Metrics

Latency Distribution

MSMARCO

Quality Metrics

Latency Distribution

PG

Quality Metrics

Latency Distribution

Compare Models

See how it stacks up

vs GPT-5.1

vs Grok 4 Fast

vs Gemini 3 Flash

Agentset

Product

Developers

Compare

Leaderboard

Enterprise

Company

Content

Trust

Agentset

Product

Developers

Compare

Leaderboard

Enterprise

Company

Content

Trust