News

Google DeepMind Simulates Robot Behavior Using Veo World Model

Remix Reality Newsroom

17 Dec 2025 — 1 min read

Source: Google DeepMind

DeepMind and Gemini Robotics used a video-based simulation system to test robot behavior across tasks without physical hardware.
The system accurately predicted performance and flagged safety issues by simulating both standard and altered environments.

DeepMind and the Gemini Robotics team built a world model using Veo, Google's frontier video foundation model, to evaluate robot behavior without needing physical hardware. The model was fine-tuned to generate realistic, multi-camera predictions of future robot actions based on current scene images and planned motion sequences. It also includes generative tools for editing scenes across multiple axes, such as objects, backgrounds, and distractors, to support testing under varied conditions.

The system was used to test eight vision-language-action models on typical tasks as well as more difficult ones, where scenes were changed by adding new objects, visual clutter, or background shifts using Veo’s editing features. The evaluations were conducted on a bimanual robot and included instruction-driven tasks such as pick-and-place and object relocation, for example, placing red grapes into a compartment or putting a LEGO piece into a bag. Across more than 1,600 real-world evaluations, DeepMind reports the simulated results closely matched physical tests, showing the approach can rank and compare model performance with accuracy.

Source: YouTube / Gemini Robotics

The team also used the system to red team safety by editing scenes with risky instructions or objects, revealing unsafe behaviors before using real hardware. This makes it possible to evaluate robot behavior more quickly, safely, and across a wider range of scenarios.

🌀 Tom’s Take:

DeepMind research shows that video simulations can accurately test how robots behave, fail, or act unsafely, making it easier to improve robotics without relying on costly or risky real-world trials.

Source: DeepMind

📬 Remix Reality Insider: Turning Movement Into Control

In this drop, we explore how motion becomes control, why glTF brings Gaussian splats into the 3D mainstream, and what it feels like to stand next to real humanoid robots. PLUS: Waymo’s $16B raise, Apeiron’s ocean drones, Bedrock’s fleets, Faraday’s robots.

Apeiron Labs Raises $9.5M to Expand Real-Time Ocean Intelligence Network

Apeiron Labs has raised $9.5 million in Series A funding to expand its network of autonomous underwater vehicles (AUVs) for real-time ocean intelligence. The round was co-led by S2G Investments, RA Capital’s Planetary Health Fund, and DYNE Ventures, with participation from Assembly Ventures

Khronos Publishes Release Candidate for glTF Gaussian Splatting Extension

Khronos Group released a draft update to glTF that adds support for 3D Gaussian splat data. The group said the rapid growth of Gaussian splatting could lead to fragmentation without standardization. The draft is open for industry feedback ahead of final approval.

Humanoid Introduces KinetIQ as Unified AI Framework for Humanoid Robot Fleets

Humanoid introduced KinetIQ, its proprietary AI framework for end-to-end orchestration of humanoid robot fleets. Humanoid says KinetIQ is a single system that controls robots with different physical designs and coordinates how they work together.

🌀 Tom’s Take:

Read more

📬 Remix Reality Insider: Turning Movement Into Control

Apeiron Labs Raises $9.5M to Expand Real-Time Ocean Intelligence Network

Khronos Publishes Release Candidate for glTF Gaussian Splatting Extension

Humanoid Introduces KinetIQ as Unified AI Framework for Humanoid Robot Fleets