In a new study, Apple researchers present a diffusion model that can write up to 128 times faster than its counterparts. Here’s how it works.
The nerdy bits
Here’s what you need to know for this study: LLMs such as ChatGPT are autoregressive models. They generate text sequentially, one token at a time, taking into account both the user’s prompt and all previously generated tokens.
In contrast to autoregressive models, there are diffusion models. They generate multiple tokens in parallel and refine them over several iterative steps until the full response takes shape.
Källa: Apple’s new language model can write long texts incredibly fast – 9to5Mac