Gemma 4
Google's most capable open-weights AI model family, introducing unparalleled multimodal reasoning, a massive 1M token context window, and breakthrough efficiency via Sparse-Dense hybrid architectures.
The Gemma 4 release marks a significant paradigm shift in open research. By integrating advanced Mixture-of-Experts (MoE) techniques and deep instruction tuning natively across text, vision, and audio, Gemma 4 achieves state-of-the-art performance while maintaining accessibility for developers and researchers worldwide.
104B
Max Parameters (MoE)
1M
Token Context Window
4.2x
Inference Speedup
89%
MMLU Top Score
📊 State-of-the-Art Benchmarks
Gemma 4 sets a new standard for open-weights models. The chart below illustrates its dominance across key academic benchmarks, significantly outperforming its predecessor (Gemma 3) and remaining highly competitive with closed-source giants. The focus on logical reasoning and coding (HumanEval) shows remarkable gains.
🎯 Multidimensional Capabilities
Unlike text-only models, Gemma 4 was trained from the ground up to understand multiple modalities. This radar chart visualizes the balanced scaling of capabilities. The expansion in Vision and Multilingual tasks represents the largest generational leap, providing a highly versatile foundation for complex agents.
⚙️ Hybrid Architecture Composition
To achieve high parameter counts without prohibitive inference costs, the flagship Gemma 4 104B utilizes a Sparse Mixture-of-Experts (MoE) architecture. This donut chart breaks down the parameter distribution. During generation, only ~18B parameters are active at any given token.
🔍 1M Context Window Retrieval
Context length has been expanded to an unprecedented 1 million tokens for the open-weights ecosystem. This line chart demonstrates the "Needle In A Haystack" retrieval accuracy. Impressively, Gemma 4 maintains near-perfect recall even as the context window approaches its maximum limit.
The Gemma 4 Ecosystem
Gemma 4 9B
Ultra-lightweight, dense architecture optimized for on-device deployment and mobile edge computing.
Gemma 4 27B
The perfect middle ground. Dense architecture providing exceptional coding and reasoning for local workstations.
Gemma 4 104B MoE
The flagship MoE model. Enterprise-grade capabilities rivaling closed systems, requiring server-class hardware.