Back to Models

Inception: Mercury

mercury

Mercury is the first diffusion large language model (dLLM). Applying a breakthrough discrete diffusion approach, the model runs 5-10x faster than even speed optimized models like GPT-4.1 Nano and Claude...

Modalities

Input

text

Output

text
Pricing
Cost per 1 million tokens
Input
$0.3
Output
$0.9
Model Specs
Context Window
128,000
Max Output
32,000
Release Date
2025-06-26
Knowledge Cutoff
Capabilities
Reasoning
Tool Calling
Vision

Last Updated: 2025-06-26

Provider: