Gemma 4 — Open-Source Multimodal AI Platform
Text · Image · Audio | Google's Most Capable Open Model | Free Online Access
Meet Gemma 4
Gemma 4 31B
Recommended31B Dense · Text + Image + Audio
Heavy-duty servers & complex reasoning. Highest overall quality — ideal for deep analysis and accurate answers. Full parameter activation, zero information loss.
Which Gemma 4 Model Can Your Hardware Run?
Select your device and configuration to find the best Gemma 4 model for your Mac, NVIDIA GPU, AMD GPU, or CPU.
Speed
~60–80 tok/s
Disk
5.0 GB
RAM Usage
~6 GB
Modality
Text + Image
ollama run gemma4:e4b-it-q8_027B Q4 possible for short conversations, but E4B Q8 better at full context.
Hardware Compatibility
| Model | Min VRAM / RAM | Best For | Install Command |
|---|---|---|---|
| Gemma 4 E2B | 2 GB | Mobile, CPU-only, embedded devices | ollama run gemma4:e2b |
| Gemma 4 E4B | 3 GB | 8–16 GB devices, most laptops | ollama run gemma4:e4b |
| Gemma 4 27B | 15 GB | 24 GB+ Mac or GPU, best MoE balance | ollama run gemma4:27b-it-q4_K_M |
| Gemma 4 31B | 18 GB | 48 GB+ Mac or 32 GB+ GPU, max quality | ollama run gemma4:31b-it-q4_K_M |
Three Modalities, One Model
TEXT
Text Understanding
Fluent conversation, long-document analysis, and code generation — up to 256K context window with Gemma 4
Chat with Gemma 4 →VISION
Image Understanding
Upload any image for instant descriptions, visual Q&A, and multi-image comparison powered by Gemma 4
Upload an image →AUDIO
Audio Understanding
Native audio input for speech-to-text, meeting summaries, and content extraction with Gemma 4
Upload audio →Try Every Gemma 4 Tool Free — No Signup Required
Powered by Google's Gemini API free tierStart Using Gemma 4 →Gemma 4 vs Qwen 3.5 · Community Benchmarks
AI Tools
Run Locally
| Model | VRAM (Q4_K_M) |
|---|---|
| Gemma 4 E2B | ~1.5 GB |
| Gemma 4 E4B | ~2.8 GB |
| Gemma 4 27B MoE | ~15 GB |
| Gemma 4 31B Dense | ~18 GB |
ollama run gemma4:27b-it-q4_K_M█