Back to Models
Qwen: Qwen3 VL 30B A3B Instruct
qwen3-vl-30b-a3b-instruct
Qwen3-VL-30B-A3B-Instruct is a multimodal model that unifies strong text generation with visual understanding for images and videos. Its Instruct variant optimizes instruction-following for general multimodal tasks. It excels in perception...
Modalities
Input
textimage
Output
text
Pricing
Cost per 1 million tokens
Input
$0.156
Output
$0.624
Model Specs
Context Window
131,072Max Output
32,768Release Date
2025-10-05Knowledge Cutoff
Capabilities
Reasoning
Tool Calling
Vision
Last Updated: 2025-11-25
Provider: