Ground Truth.
AI, checked against the source.

← All topics

video-generation

Everything on Ground Truth tagged “video-generation” — 8 items.

Google ships a faster, cheaper image model and hands developers conversational video editing News

A lightweight version of Google's image model now makes a picture in about four seconds for a fraction of a cent, while a new video model lets developers edit clips by talking to it.

NVIDIA's new method stops AI dream-worlds from breaking the laws of physics News

PhysisForcing trains video-generating world models to keep objects solid and interactions believable, raising how often a robot's imagined plan actually works.

AI video has a consistency problem. This model targets it. News

DomainShuttle goes after the tug-of-war in subject-driven text-to-video: keeping a specific character or object recognizable across frames while still letting the scene move freely.

NVIDIA shrinks video generation down to real time News

A new NVIDIA recipe distills slow video-generating AI into a fast version that can stream frames live and react to your actions.

What are world models? Lesson

A world model is an AI system's internal understanding of how an environment works — not just what it sees right now, but what will happen after an action, and what would have happened differently. Central to planning, robotics, and the next generation of physical AI.

Turn the camera away, and the AI's world freezes News

A new benchmark tests whether video AI systems can track what happens to parts of a scene the camera isn't currently showing. Across 23 models, the answer is mostly no — and making the models larger made the problem worse, not better.

Turn around, and the world disappears News

AI video models that are supposed to "understand" a 3D scene only remember what's on screen — pan away and back, and things have reset. Bigger models are worse at it.

Gemini Omni Flash Tool

Google's new video model offering developers programmable conversational editing - generate and revise clips up to ten seconds by describing changes in words.