DeepSeek: DeepSeek V4 Flash
DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model from DeepSeek with 284B total parameters and 13B activated parameters, supporting a 1M-token context window. It is designed for fast inference and...
Try DeepSeek: DeepSeek V4 Flash in Kilo Code
Experience this model with the most popular open source coding agent. Free to start, pay only for AI usage. Use in popular IDEs like VS Code, JetBrains, command line, or cloud agents.
Downloads
models supported
to Start
Access 500+ models including DeepSeek: DeepSeek V4 Flash and many more in Kilo Code
Benchmarking DeepSeek: DeepSeek V4 Flash
Coding Performance
Coding benchmarks and performance metrics for development tasks
Coding Benchmarks
| Benchmark | Score |
|---|---|
| AA Coding Index | 39.8% |
| SciCode | 42.0% |
| TerminalBench Hard | 38.6% |
| LCR | 62.7% |
| IFBench | 73.5% |
Performance metrics from Artificial Analysis
OpenClaw Benchmarks
PinchBench measures how DeepSeek: DeepSeek V4 Flash performs on real OpenClaw agent tasks: multi-step execution, tool use, recovery, latency, and cost.
Average score
#12 of 50 official models
Average time
6 runs · per OpenClaw task
Average cost
Per benchmark run
Category breakdown
Best verified PinchBench v2 run by OpenClaw task family.
Top task results
Highest-scoring benchmark tasks from the same submission.
Autonomous task execution
DeepSeek: DeepSeek V4 Flash shows strong average success across OpenClaw-style benchmark runs, useful for recurring research, browser, and file-based automations.
Tool use and recovery
PinchBench tasks stress multi-step planning, tool calls, and judge-verified completion rather than single prompt coding snippets.
Agent workflow fit
Its deliberate average runtime and premium run cost help set expectations for long-running KiloClaw agents and production workflows.
Agentic benchmarks from the PinchBench Leaderboard
Real-World Usage
Real-world usage statistics from the Kilo Code community
Weekly Token Usage
Mode Rankings (Last Week)
Where this model ranks for each built-in mode
Code
Write, modify, and refactor code
Ask
Get answers and explanations
Debug
Diagnose and fix software issues
Orchestrator
Coordinate tasks across multiple modes
Real-world metrics from the Kilo Code Leaderboard
Pricing
Cost per 1 million tokens
Example Cost
Analyzing a 10,000 line codebase (≈40k input tokens, 10k output tokens) costs approximately $0.0067
Coding Capabilities
Features and parameters relevant to coding tasks
Coding Features
Pricing details from OpenRouter
Technical Details
Architecture and implementation specifications
- Model ID
- deepseek/deepseek-v4-flash
- Artificial Analysis Slug
- deepseek-v4-flash-high
- Created
- April 24, 2026
- Tokenizer
- DeepSeek
- Input Modalities
- text
- Context Window
- 1,048,575 tokens
- Input Price
- $0.11 per 1M tokens
- Output Price
- $0.22 per 1M tokens
- Cache Read Price
- $0.02 per 1M tokens
- Content Moderation
- Disabled
Ready to try DeepSeek: DeepSeek V4 Flash?
Install Kilo Code and start using DeepSeek: DeepSeek V4 Flash for your coding projects today. Choose from 500+ AI models with complete freedom.
- 1.
Install Kilo Code
Get the extension from VS Code Marketplace, JetBrains Plugin Repository, or the CLI.
- 2.
Open the model selector
Click the model name in the Kilo Code chat panel to open the selector.
- 3.
Choose your model
Search or browse to find and select your preferred model.
- 4.
Start coding
Use Code, Ask, Debug, or Plan mode — the model is ready immediately.