Ground Truth.
AI, checked against the source.

← All topics

image-generation

Everything on Ground Truth tagged “image-generation” — 8 items.

Google ships a faster, cheaper image model and hands developers conversational video editing News

A lightweight version of Google's image model now makes a picture in about four seconds for a fraction of a cent, while a new video model lets developers edit clips by talking to it.

Qwen used human-feedback training to make its image AI follow directions better News

A new recipe applies the same reinforcement-learning approach that polished chatbots to an image generator, then merges separate skill models into one - improving how faithfully it follows prompts and edits.

Image generators can't plan. This one bolts on a brain that can. News

Qwen-Image-Agent wraps planning, reasoning, and memory around a text-to-image model so it can break a hard request into steps - and the local-AI crowd immediately asked whether it runs on a gaming GPU.

Researchers turn the internet's hobbyist art 'filters' into training fuel News

Cleanly separating 'what's in a picture' from 'what style it's in' usually needs scarce data. A new method mines the huge public library of community-made style add-ons instead.

An image generator that catches and corrects its own errors mid-draw News

Image-generating models often quietly break the very rule they were told to follow. A new method trains them to notice that error as they work and steer back on target.

A tiny image-fixer keeps up with a model fifty times its size News

Filling in the missing parts of an image usually takes a huge model. This one is a small fraction of the size and far faster, yet matches a system far bigger than it.

Qwen-Image-2.0-Pro Tool

Alibaba's latest open image-generation model in the Qwen family, downloadable and runnable locally, part of a broad open-weight release wave that also refreshed the Qwen3.6 chat models.

Nano Banana 2 Lite Tool

Google's fastest, cheapest Gemini image model - a text-to-image picture in about four seconds for roughly three cents per thousand images, built for high-volume use.