Meet DiffusionGemma! An experimental open model that explores a fast approach to text generation, released under an Apache 2.0 license. Moving beyond sequential, token-by-token processes to generate entire blocks of text simultaneously. Here’s what’s new with DiffusionGemma: 👇 https://t.co/iaVMPr0WKx
Google Releases DiffusionGemma for 4x Faster Parallel Text Generation
Google released DiffusionGemma, an experimental open model that generates text using diffusion instead of sequential token prediction. By generating 256 tokens in parallel, it delivers up to 4x faster inference on dedicated GPUs, exceeding 1000 tokens per second on an H100. This 26B Mixture of Experts model supports real-time self-correction for tasks like code infilling and in-line editing.
Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →





