Z.AI: GLM 4.6

by Z AI

Compared with GLM-4.5, this generation brings several key improvements: Longer context window: The context window has been expanded from 128K to 200K tokens, enabling the model to handle more complex agentic tasks. Superior coding performance: The model achieves higher scores on code benchmarks and...

Release Date
September 30, 2025
Context Window
202,752 tokens
Max Output
202,752 tokens
Input Modalities
text

Coding Performance

Coding benchmarks and performance metrics for development tasks

Coding Benchmarks

Artificial Analysis Coding Index

43.8%

Overall coding capability score

LiveCodeBench

69.5%

Real-world coding task performance

SciCode

38.4%

Scientific computing & algorithms

TerminalBench Hard

23.4%

CLI & terminal command generation

LCR (Long Context Reasoning)

54.3%

Understanding large codebases

IFBench

43.4%

Instruction following accuracy

Speed & Efficiency

Output Speed

1168.7 tokens/sec

Median output tokens per second

Time to First Token

0.3s

Median responsiveness

Performance metrics from Artificial Analysis

Real-World Usage

Real-world usage statistics from the Kilo Code community

Loading chart...
Loading rankings...

Real-world metrics from the Kilo Code Leaderboard

Pricing

Cost per 1 million tokens

Input Tokens
$0.60
per 1M tokens
Output Tokens
$2.19
per 1M tokens
Blended (3:1 ratio)
$0.99
per 1M tokens

Example Cost

Analyzing a 10,000 line codebase (≈40k input tokens, 10k output tokens) costs approximately $0.0460

Coding Capabilities

Features and parameters relevant to coding tasks

Coding Features

Function Calling
Can call external functions/APIs
Tool Choice
Control over function selection
Structured Outputs
JSON schema validation
Reasoning Tokens
Extended thinking for complex problems

Pricing details from OpenRouter

Technical Details

Architecture and implementation specifications

Model ID
z-ai/glm-4.6
Created
September 30, 2025
Tokenizer
Other
Input Modalities
text
Context Window
202,752 tokens
Max Completion Tokens
202,752 tokens
Input Price
$0.60 per 1M tokens
Output Price
$2.19 per 1M tokens
Content Moderation
Disabled

Ready to try Z.AI: GLM 4.6?

Install Kilo Code and start using Z.AI: GLM 4.6 for your coding projects today. Choose from 400+ AI models with complete freedom.