Back to Models

Google: Gemma 3 12B

gemma-3-12b-it

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...

Modalities

Input

textimage

Output

text
Pricing
Cost per 1 million tokens
Input
$0.048
Output
$0.156
Model Specs
Context Window
131,072
Max Output
131,072
Release Date
2025-03-13
Knowledge Cutoff
2024-10
Capabilities
Reasoning
Tool Calling
Vision

Last Updated: 2025-03-13

Provider: