AI Race

DeepMind's DiffusionGemma gives the AI race a speed boost

DeepMind announced DiffusionGemma, promising up to 4x faster text generation, and a $10M fund to accelerate multi-agent AI safety research. These moves pair capability gains with investments in governance.

Cyber News Centre Team Jun 16, 2026

DeepMind's DiffusionGemma gives the AI race a speed boost — Abstract luminous cubes represent accelerating AI capability, interconnected agents, and the growing need for safety, coordination, and control.

Google DeepMind announced DiffusionGemma on 10 June 2026 (DeepMind blog), an experimental 26B open-weight model that generates text via discrete diffusion instead of sequential token prediction.

The company claims 1,000-plus tokens per second on a single H100 and 700-plus on an RTX 5090 — roughly four times the throughput of autoregressive equivalents. The model fits quantised in 18 GB VRAM, handles 256K context, supports 140-plus languages, and ships under Apache 2.0 on Hugging Face.

DeepMind chief executive Demis Hassabis framed the release as a complement to safety investment: "advancing generation efficiency and investing in multi-agent safety are complementary priorities to scale agentic AI responsibly."

ML systems researcher Andrew Anokhin noted the architecture "fundamentally changes what's possible for local-first AI — real-time code infilling, interactive structured generation, and self-correcting output are now hardware-feasible on a single consumer GPU."

Why It Matters

Inference speed shapes cost, latency and the practicality of running AI tools locally rather than in the cloud. DeepMind positions DiffusionGemma for interactive workloads — code infilling, in-line editing, real-time formatting — while signalling that Gemma 4 remains preferable for maximum quality at cloud scale. Expect diffusion back-ends in vLLM, TensorRT-LLM and llama.cpp within weeks. For product teams and cloud operators, the economics of local-first inference just shifted.

Get the stories that matter to you.
Subscribe to Cyber News Centre and update your preferences to follow our Daily 4min Cyber Update, Innovative AI Startups, The AI Diplomat series, or the main Cyber News Centre newsletter — featuring in-depth analysis on major cyber incidents, tech breakthroughs, global policy, and AI developments.