CLUSTER · TIER 1
Google DeepMind releases DiffusionGemma for 4x faster parallel text generation.
Google DeepMind released DiffusionGemma, an experimental open model that generates multiple words in parallel rather than sequentially, enabling low-latency text generation for single-user workloads. NVIDIA optimized the model to run faster across GeForce RTX GPUs, RTX PRO platforms, and DGX Spark systems from local PCs to the cloud.
Sources
5
X mentions
13k ▲
First seen
22Dago
Velocity
+11%/6h