NVIDIA Coding Models on Kilo Code

Updated May 2026 · 3 models tracked

Nemotron models tuned for efficient reasoning on accelerated infrastructure. NVIDIA’s coding models on Kilo Code include Nemotron 3 Super (free), Nemotron 3 Ultra, and Nemotron 3 Ultra (free). Use them across VS Code, JetBrains IDEs, Cursor, Windsurf, Trae, and the Kilo CLI — with pay-as-you-go pricing and no markup over the underlying provider rates.

Use NVIDIA models in Kilo

Models3in catalog

Top Coding—index

Price$0.50 / 1M ininput range

LicenseOpen + closedposture

Every NVIDIA Coding Model in Kilo Code, Ranked

Sorted by coding-index where published. Click any model for the full review with benchmarks, real-world Kilo usage, and provider-specific pricing.

NVIDIA: Nemotron 3 Super (free)

NVIDIA

NVIDIA Nemotron 3 Super is a 120B-parameter open hybrid MoE model, activating just 12B parameters for maximum compute efficiency and accuracy in complex multi-agent applications. Built on a hybrid Mamba-Transformer...

Coding index

—

Context

1000K

Price (in / out per 1M)

Free / Free

Speed

—

View Nemotron 3 Super (free) in Kilo

NVIDIA: Nemotron 3 Ultra

NVIDIA

NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA, with 55B active parameters out of 550B total (MoE). Built on a hybrid Transformer-Mamba mixture-of-experts architecture, it...

Coding index

—

Context

1000K

Price (in / out per 1M)

$0.500 / $2.20

Speed

—

View Nemotron 3 Ultra in Kilo

NVIDIA: Nemotron 3 Ultra (free)

NVIDIA

Coding index

—

Context

1000K

Price (in / out per 1M)

Free / Free

Speed

—

View Nemotron 3 Ultra (free) in Kilo

Try NVIDIA models free in Kilo

About NVIDIA

NVIDIA’s Nemotron line reflects the company’s hardware-plus-model approach: models are optimized for fast inference on GPU infrastructure and often released with permissive options for enterprise deployment. For coding, Nemotron is useful when throughput, cost, and deployability matter as much as benchmark rank.

Headquarters

Santa Clara, USA

Founded

1993

License posture

Open + closed

NVIDIA Pricing on Kilo Code

Pay-as-you-go, no markup over the underlying provider rates. Cheapest first.

Model	Input / 1M	Output / 1M	Context	Coding index
NVIDIA: Nemotron 3 Super (free)	Free	Free	1000K	—
NVIDIA: Nemotron 3 Ultra (free)	Free	Free	1000K	—
NVIDIA: Nemotron 3 Ultra	$0.500	$2.20	1000K	—

How to Use NVIDIA Models in Kilo Code

Three ways: hosted in Kilo, locally on your hardware, or through your own provider keys.

Use Hosted in Kilo

The fastest path: install Kilo Code, sign in, pick nvidia from the model picker. No API keys, no markup. Works in VS Code, JetBrains, Cursor, Windsurf, Trae, and the Kilo CLI.

See live model leaderboard →

Bring Your Own Keys

Already have an account with NVIDIA, OpenRouter, AWS Bedrock, Google Vertex, Together AI, or another compatible provider? Plug your key into Kilo Code and keep your existing billing relationship.

BYOK setup guide →

Run Locally

Download NVIDIA open weights from Hugging Face and serve them with Ollama, LM Studio, vLLM, or SGLang. Connect Kilo Code to your local OpenAI-compatible endpoint and keep all prompts on hardware you control.

Local setup guide →

Compare NVIDIA with Other Labs

See coding-model lineups from NVIDIA’s closest competitors.

Coding models by

Google

Maker of Gemini and the Gemma open-weight family

Coding models by

Qwen

Alibaba Cloud model family with strong open-weight coding options

Coding models by

Mistral AI

European lab behind compact open and enterprise coding models

Or browse all open-source coding models →

Why Choose NVIDIA Models for Coding?

Frontier capability

NVIDIA’s flagship is among the strongest coding models on the Kilo Code leaderboard, ranked by Code, Plan, Ask, Debug, and Review usage.

Works everywhere Kilo works

VS Code, Cursor, Windsurf, Trae, JetBrains IDEs (IntelliJ, PyCharm, WebStorm, GoLand, RubyMine, Android Studio), and the Kilo CLI / terminal.

No vendor lock-in

Switch between NVIDIA and 500+ other models with one click. Pay only for what you use, at the underlying provider rate.

NVIDIA Coding Models: FAQ

What is the best NVIDIA model for coding?+

Nemotron Ultra is generally the strongest NVIDIA option, while Nemotron Super targets a more efficient price/performance point.

Can I use NVIDIA models in Kilo Code?+

NVIDIA models work in all Kilo-supported IDEs and the CLI; downloadable variants can also run on compatible local or self-hosted GPU infrastructure.

How much do NVIDIA models cost on Kilo Code?+

NVIDIA model pricing varies by hosted provider, with some free variants available. Kilo Code shows the current token rates with no markup.

Are NVIDIA models open source?+

Mixed. NVIDIA publishes several Nemotron models for download, while hosted variants may have provider-specific terms.

What is the difference between NVIDIA's flagship coding models?+

Ultra variants emphasize capability and reasoning depth. Super variants usually trade some maximum quality for speed and lower serving cost.

Which IDEs and editors support NVIDIA models through Kilo?+

NVIDIA models work in all Kilo-supported IDEs and the CLI; downloadable variants can also run on compatible local or self-hosted GPU infrastructure.

Use NVIDIA Coding Models in Kilo Today

Install Kilo Code and get instant access to NVIDIA: Nemotron 3 Super (free) and 2 other NVIDIA models, plus 500+ frontier and open-source options. Free to start, no credit card required.

Get Started with NVIDIA in Kilo