Skip to main content

NVIDIA Coding Models on Kilo Code

Updated May 2026 · 3 models tracked

Nemotron models tuned for efficient reasoning on accelerated infrastructure. NVIDIA’s coding models on Kilo Code include Nemotron 3 Super (free), Nemotron 3 Ultra, and Nemotron 3 Ultra (free). Use them across VS Code, JetBrains IDEs, Cursor, Windsurf, Trae, and the Kilo CLI — with pay-as-you-go pricing and no markup over the underlying provider rates.

Models3in catalog
Top Codingindex
Price$0.50 / 1M ininput range
LicenseOpen + closedposture

Every NVIDIA Coding Model in Kilo Code, Ranked

Sorted by coding-index where published. Click any model for the full review with benchmarks, real-world Kilo usage, and provider-specific pricing.

#1

NVIDIA: Nemotron 3 Super (free)

NVIDIA

NVIDIA Nemotron 3 Super is a 120B-parameter open hybrid MoE model, activating just 12B parameters for maximum compute efficiency and accuracy in complex multi-agent applications. Built on a hybrid Mamba-Transformer...

Coding index
Context
1000K
Price (in / out per 1M)
Free / Free
Speed
View Nemotron 3 Super (free) in Kilo
#2

NVIDIA: Nemotron 3 Ultra

NVIDIA

NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA, with 55B active parameters out of 550B total (MoE). Built on a hybrid Transformer-Mamba mixture-of-experts architecture, it...

Coding index
Context
1000K
Price (in / out per 1M)
$0.500 / $2.20
Speed
View Nemotron 3 Ultra in Kilo
#3

NVIDIA: Nemotron 3 Ultra (free)

NVIDIA

NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA, with 55B active parameters out of 550B total (MoE). Built on a hybrid Transformer-Mamba mixture-of-experts architecture, it...

Coding index
Context
1000K
Price (in / out per 1M)
Free / Free
Speed
View Nemotron 3 Ultra (free) in Kilo

About NVIDIA

NVIDIA’s Nemotron line reflects the company’s hardware-plus-model approach: models are optimized for fast inference on GPU infrastructure and often released with permissive options for enterprise deployment. For coding, Nemotron is useful when throughput, cost, and deployability matter as much as benchmark rank.

Headquarters
Santa Clara, USA
Founded
1993
License posture
Open + closed

NVIDIA Pricing on Kilo Code

Pay-as-you-go, no markup over the underlying provider rates. Cheapest first.

ModelInput / 1MOutput / 1MContextCoding index
NVIDIA: Nemotron 3 Super (free)FreeFree1000K
NVIDIA: Nemotron 3 Ultra (free)FreeFree1000K
NVIDIA: Nemotron 3 Ultra$0.500$2.201000K

How to Use NVIDIA Models in Kilo Code

Three ways: hosted in Kilo, locally on your hardware, or through your own provider keys.

1

Use Hosted in Kilo

The fastest path: install Kilo Code, sign in, pick nvidia from the model picker. No API keys, no markup. Works in VS Code, JetBrains, Cursor, Windsurf, Trae, and the Kilo CLI.

See live model leaderboard →
2

Bring Your Own Keys

Already have an account with NVIDIA, OpenRouter, AWS Bedrock, Google Vertex, Together AI, or another compatible provider? Plug your key into Kilo Code and keep your existing billing relationship.

BYOK setup guide →
3

Run Locally

Download NVIDIA open weights from Hugging Face and serve them with Ollama, LM Studio, vLLM, or SGLang. Connect Kilo Code to your local OpenAI-compatible endpoint and keep all prompts on hardware you control.

Local setup guide →

Compare NVIDIA with Other Labs

See coding-model lineups from NVIDIA’s closest competitors.

Why Choose NVIDIA Models for Coding?

Frontier capability

NVIDIA’s flagship is among the strongest coding models on the Kilo Code leaderboard, ranked by Code, Plan, Ask, Debug, and Review usage.

Works everywhere Kilo works

VS Code, Cursor, Windsurf, Trae, JetBrains IDEs (IntelliJ, PyCharm, WebStorm, GoLand, RubyMine, Android Studio), and the Kilo CLI / terminal.

No vendor lock-in

Switch between NVIDIA and 500+ other models with one click. Pay only for what you use, at the underlying provider rate.

NVIDIA Coding Models: FAQ

What is the best NVIDIA model for coding?+

Nemotron Ultra is generally the strongest NVIDIA option, while Nemotron Super targets a more efficient price/performance point.

Can I use NVIDIA models in Kilo Code?+

NVIDIA models work in all Kilo-supported IDEs and the CLI; downloadable variants can also run on compatible local or self-hosted GPU infrastructure.

How much do NVIDIA models cost on Kilo Code?+

NVIDIA model pricing varies by hosted provider, with some free variants available. Kilo Code shows the current token rates with no markup.

Are NVIDIA models open source?+

Mixed. NVIDIA publishes several Nemotron models for download, while hosted variants may have provider-specific terms.

What is the difference between NVIDIA's flagship coding models?+

Ultra variants emphasize capability and reasoning depth. Super variants usually trade some maximum quality for speed and lower serving cost.

Which IDEs and editors support NVIDIA models through Kilo?+

NVIDIA models work in all Kilo-supported IDEs and the CLI; downloadable variants can also run on compatible local or self-hosted GPU infrastructure.

Use NVIDIA Coding Models in Kilo Today

Install Kilo Code and get instant access to NVIDIA: Nemotron 3 Super (free) and 2 other NVIDIA models, plus 500+ frontier and open-source options. Free to start, no credit card required.