Z.ai: GLM 4.6

by z-ai

Compared with GLM-4.5, this generation brings several key improvements: Longer context window: The context window has been expanded from 128K to 200K tokens, enabling the model to handle more complex...

29.5 Coding Index*39.4 tokens/sec*$0.43/1M input tokens
Release Date
September 30, 2025
Context Window
202,752 tokens
Max Output
131,072 tokens
Input Modalities
text

Try Z.ai: GLM 4.6 in Kilo Code

Experience this model with the most popular open source coding agent. Free to start, pay only for AI usage. Use in popular IDEs like VS Code, JetBrains, command line, or cloud agents.

3M+

Downloads

500+

models supported

Free

to Start

Access 500+ models including Z.ai: GLM 4.6 and many more in Kilo Code

Benchmarking Z.ai: GLM 4.6

Coding Performance

Coding benchmarks and performance metrics for development tasks

Coding Benchmarks

BenchmarkScore
AA Coding Index29.5%
LiveCodeBench69.5%
SciCode38.4%
TerminalBench Hard25.0%
LCR54.3%
IFBench43.4%

Speed & Efficiency

MetricValue
Output Speed39 tok/s

Performance metrics from Artificial Analysis

Real-World Usage

Real-world usage statistics from the Kilo Code community

Weekly Token Usage

Mode Rankings (Last Week)

Where this model ranks for each built-in mode

Code

Write, modify, and refactor code

No data

Ask

Get answers and explanations

No data

Debug

Diagnose and fix software issues

#94

Orchestrator

Coordinate tasks across multiple modes

No data

Real-world metrics from the Kilo Code Leaderboard

Pricing

Cost per 1 million tokens

Input Tokens
$0.43
per 1M tokens
Output Tokens
$1.74
per 1M tokens

Example Cost

Analyzing a 10,000 line codebase (≈40k input tokens, 10k output tokens) costs approximately $0.0346

Coding Capabilities

Features and parameters relevant to coding tasks

Coding Features

Function Calling
Can call external functions/APIs
Tool Choice
Control over function selection
Structured Outputs
JSON schema validation
Reasoning Tokens
Extended thinking for complex problems

Pricing details from OpenRouter

Technical Details

Architecture and implementation specifications

Model ID
z-ai/glm-4.6
Artificial Analysis Slug
glm-4-6-reasoning
Created
September 30, 2025
Tokenizer
Other
Input Modalities
text
Context Window
202,752 tokens
Max Completion Tokens
131,072 tokens
Input Price
$0.43 per 1M tokens
Output Price
$1.74 per 1M tokens
Cache Read Price
$0.08 per 1M tokens
Content Moderation
Disabled

Ready to try Z.ai: GLM 4.6?

Install Kilo Code and start using Z.ai: GLM 4.6 for your coding projects today. Choose from 500+ AI models with complete freedom.

  1. 1.

    Install Kilo Code

    Get the extension from VS Code Marketplace, JetBrains Plugin Repository, or the CLI.

  2. 2.

    Open the model selector

    Click the model name in the Kilo Code chat panel to open the selector.

  3. 3.

    Choose your model

    Search or browse to find and select your preferred model.

  4. 4.

    Start coding

    Use Code, Ask, Debug, or Plan mode — the model is ready immediately.