Back to Models

NVIDIA: Nemotron Nano 12B 2 VL

nemotron-nano-12b-v2-vl

NVIDIA Nemotron Nano 2 VL is a 12-billion-parameter open multimodal reasoning model designed for video understanding and document intelligence. It introduces a hybrid Transformer-Mamba architecture, combining transformer-level accuracy with Mamba’s...

Modalities

Input

imagetextvideo

Output

text
Pricing
Cost per 1 million tokens
Input
$0.24
Output
$0.72
Model Specs
Context Window
131,072
Max Output
26,215
Release Date
2025-10-28
Knowledge Cutoff
Capabilities
Reasoning
Tool Calling
Vision

Last Updated: 2026-01-31

Provider: