diffusion
Qwen used human-feedback training to make its image AI follow directions better News
A new recipe applies the same reinforcement-learning approach that polished chatbots to an image generator, then merges separate skill models into one - improving how faithfully it follows prompts and edits.
This AI predicts how objects move by tracking shapes, not pixels News
PhysiFormer forecasts physical motion as real 3D meshes in space - and recovers rigidity and momentum without anyone hand-coding the laws of physics.
NVIDIA shrinks video generation down to real time News
A new NVIDIA recipe distills slow video-generating AI into a fast version that can stream frames live and react to your actions.
A language model that doesn't write left to right News
iLLaDA is an 8-billion-parameter model that generates text by refining a blurry whole rather than one word at a time, and it's catching up to the mainstream.
Two labs race to make AI write whole paragraphs at once instead of word by word News
Diffusion text models generate in parallel blocks rather than left to right; Google's open DiffusionGemma and Inception's Mercury 2 are now in a head-to-head over speed.
An image generator that catches and corrects its own errors mid-draw News
Image-generating models often quietly break the very rule they were told to follow. A new method trains them to notice that error as they work and steer back on target.
AI builds a single 3D object that shows two different things from two angles News
A new training-free method generates 3D visual illusions — one sculpture that reads as completely different objects depending on where you stand — in minutes instead of hours.
What are diffusion language models? Lesson
Most AI writes one word at a time and can never go back. Diffusion language models start from noise and clarify it iteratively — and some versions can revise any word at any step. A growing alternative to the standard left-to-right approach.
An AI that could rewrite its own words — and gained nothing from it News
A different style of text AI can go back and change any word at any point as it writes. Given that power, it didn't actually produce better writing. A clean negative result.
Mercury 2 (Inception Labs) Tool
An API-only diffusion language model pitched on raw speed, claiming to out-pace open diffusion models on tokens-per-second for latency-sensitive generation.
LLaDA / iLLaDA Tool
An openly released diffusion language model (weights and code) that generates text by refining a whole passage at once rather than one word at a time, useful for experimenting with non-autoregressive generation and infilling.
DiffusionGemma Tool
Google's open-weight text-diffusion model that generates text in parallel blocks instead of one token at a time; Apache-2.0, runnable locally, with community tooling already shipping.