Back to Models
Google: Gemma 3 4B
gemma-3-4b-it
Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...
Modalities
Input
textimage
Output
text
Pricing
Cost per 1 million tokens
Input
$0.048
Output
$0.096
Model Specs
Context Window
96,000Max Output
96,000Release Date
2025-03-13Knowledge Cutoff
2024-10Capabilities
Reasoning
Tool Calling
Vision
Last Updated: 2025-03-13
Provider: