Back to Models

Qwen: Qwen3 VL 30B A3B Instruct

qwen3-vl-30b-a3b-instruct

Qwen3-VL-30B-A3B-Instruct is a multimodal model that unifies strong text generation with visual understanding for images and videos. Its Instruct variant optimizes instruction-following for general multimodal tasks. It excels in perception...

Modalities

Input

textimage

Output

text
Pricing
Cost per 1 million tokens
Input
$0.156
Output
$0.624
Model Specs
Context Window
131,072
Max Output
32,768
Release Date
2025-10-05
Knowledge Cutoff
Capabilities
Reasoning
Tool Calling
Vision

Last Updated: 2025-11-25

Provider: