Google DeepMind released DiffusionGemma, an experimental open model that generates multiple words in parallel rather than sequentially, enabling low-latency text generation for single-user workloads. NVIDIA optimized the model to run faster across GeForce RTX GPUs, RTX PRO platforms, and DGX Spark systems from local PCs to the cloud.