Back to Models

Google: Gemma 3 4B

gemma-3-4b-it

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...

Modalities

Input

textimage

Output

text
Pricing
Cost per 1 million tokens
Input
$0.048
Output
$0.096
Model Specs
Context Window
96,000
Max Output
96,000
Release Date
2025-03-13
Knowledge Cutoff
2024-10
Capabilities
Reasoning
Tool Calling
Vision

Last Updated: 2025-03-13

Provider: