DiffusionGemma: 4x faster text generation

基本信息

来源: blogs_podcasts
原始来源: https://blog.google/innovation-and-ai/technology/developers-tools/diffusion-gemma-faster-text-generation

来源摘要/节选

公开展示已截断至最多 800 个字符；请访问原始来源查看完整上下文。

Our newest open experimental model delivers up to 4x faster inference on dedicated GPUs and opens the door to exploring speed-critical, interactive local workflows.
Today, we’re introducing DiffusionGemma, an experimental open model that explores text diffusion, an exceptionally fast approach to text generation. Released under an Apache 2.0 license, this 26B Mixture of Experts (MoE) model moves beyond the sequential token-by-token processing of typical autoregressive Large Language Models (LLMs). Instead, it generates entire blocks of text simultaneously, delivering up to 4x faster text generation on GPUs.…

来源说明

当前只保存了公开页面节选，不代表原文全文。请以原始来源为准。

本页只呈现已做哈希绑定的来源证据，不包含基于旧正文或缺失原文的扩展推断。

DiffusionGemma: 4x faster text generation

基本信息

来源摘要/节选

来源说明

应用场景

AI/ML项目

大语言模型

从首次观测到传播链