Skip to main content

DeepSeek: DeepSeek V4 Flash

DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model from DeepSeek with 284B total parameters and 13B activated parameters, supporting a 1M-token context window. It is designed for fast inference and...

39.8 Coding Index*80.1% PinchBench§#14 Code Mode$0.11/1M input tokens
Release Date
April 24, 2026
Context Window
1,048,575 tokens
Input Modalities
text

Try DeepSeek: DeepSeek V4 Flash in Kilo Code

Experience this model with the most popular open source coding agent. Free to start, pay only for AI usage. Use in popular IDEs like VS Code, JetBrains, command line, or cloud agents.

3M+

Downloads

500+

models supported

Free

to Start

Access 500+ models including DeepSeek: DeepSeek V4 Flash and many more in Kilo Code

Benchmarking DeepSeek: DeepSeek V4 Flash

Coding Performance

Coding benchmarks and performance metrics for development tasks

Coding Benchmarks

BenchmarkScore
AA Coding Index39.8%
SciCode42.0%
TerminalBench Hard38.6%
LCR62.7%
IFBench73.5%

Performance metrics from Artificial Analysis

PinchBench data · refreshed daily

OpenClaw Benchmarks

PinchBench measures how DeepSeek: DeepSeek V4 Flash performs on real OpenClaw agent tasks: multi-step execution, tool use, recovery, latency, and cost.

Average score

80.1%

#12 of 50 official models

Average time

298m 19s

6 runs · per OpenClaw task

Average cost

$1.302

Per benchmark run

Category breakdown

Best verified PinchBench v2 run by OpenClaw task family.

Productivity97.1% · 5/8 cleared
Writing96.7% · 2/6 cleared
Csv Analysis96.3% · 3/26 cleared
Memory95.0% · 1/2 cleared

Top task results

Highest-scoring benchmark tasks from the same submission.

Analysis
Access Control Log Anomaly Detection
100.0%
Log Analysis
Apache Error Log - Create Error Timeline
100.0%
Csv Analysis
Apple Stock 2014 Best and Worst Days
100.0%
Productivity
Calendar Event Creation
100.0%
Coding
CI/CD Pipeline Debug
100.0%
Writing
Commit Message Writer
100.0%

Autonomous task execution

DeepSeek: DeepSeek V4 Flash shows strong average success across OpenClaw-style benchmark runs, useful for recurring research, browser, and file-based automations.

Tool use and recovery

PinchBench tasks stress multi-step planning, tool calls, and judge-verified completion rather than single prompt coding snippets.

Agent workflow fit

Its deliberate average runtime and premium run cost help set expectations for long-running KiloClaw agents and production workflows.

Agentic benchmarks from the PinchBench Leaderboard

Real-World Usage

Real-world usage statistics from the Kilo Code community

Weekly Token Usage

Mode Rankings (Last Week)

Where this model ranks for each built-in mode

Code

Write, modify, and refactor code

#14

Ask

Get answers and explanations

#21

Debug

Diagnose and fix software issues

#10

Orchestrator

Coordinate tasks across multiple modes

#24

Real-world metrics from the Kilo Code Leaderboard

Pricing

Cost per 1 million tokens

Input Tokens
$0.11
per 1M tokens
Output Tokens
$0.22
per 1M tokens

Example Cost

Analyzing a 10,000 line codebase (≈40k input tokens, 10k output tokens) costs approximately $0.0067

Coding Capabilities

Features and parameters relevant to coding tasks

Coding Features

Function Calling
Can call external functions/APIs
Tool Choice
Control over function selection
Structured Outputs
JSON schema validation
Reasoning Tokens
Extended thinking for complex problems

Pricing details from OpenRouter

Technical Details

Architecture and implementation specifications

Model ID
deepseek/deepseek-v4-flash
Artificial Analysis Slug
deepseek-v4-flash-high
Created
April 24, 2026
Tokenizer
DeepSeek
Input Modalities
text
Context Window
1,048,575 tokens
Input Price
$0.11 per 1M tokens
Output Price
$0.22 per 1M tokens
Cache Read Price
$0.02 per 1M tokens
Content Moderation
Disabled

Ready to try DeepSeek: DeepSeek V4 Flash?

Install Kilo Code and start using DeepSeek: DeepSeek V4 Flash for your coding projects today. Choose from 500+ AI models with complete freedom.

  1. 1.

    Install Kilo Code

    Get the extension from VS Code Marketplace, JetBrains Plugin Repository, or the CLI.

  2. 2.

    Open the model selector

    Click the model name in the Kilo Code chat panel to open the selector.

  3. 3.

    Choose your model

    Search or browse to find and select your preferred model.

  4. 4.

    Start coding

    Use Code, Ask, Debug, or Plan mode — the model is ready immediately.