News

Cornell Framework Teaches Robots from a Single Demonstration Video

Remix Reality Newsroom

10 May 2025 — 1 min read

Source: Cornell Chronicle

Robots using RHyME improved task success by over 50% with just 30 minutes of robot data.
The framework draws on past video examples to bridge differences between human and robot actions.

A Cornell research team has developed a new training method for robots that relies on a single demonstration video. The approach, named RHyME, addresses a common challenge in robotic learning—how to manage discrepancies between human actions and robotic capabilities—by allowing robots to infer actions from related visual data they have previously encountered.

Instead of copying actions exactly, RHyME retrieves relevant snippets from past demonstrations and adapts them to fit the new task. When shown a person performing a sequence, like placing a mug in a sink, it draws from its video memory to find similar steps, such as grasping or lowering, and pieces them together to complete the task.

“This work is a departure from how robots are programmed today. The status quo of programming robots is thousands of hours of tele-operation to teach the robot how to do tasks. That’s just impossible,” said senior author Sanjiban Choudhury, assistant professor of computer science at the Cornell Ann S. Bowers College of Computing and Information Science, on the school's website. “With RHyME, we’re moving away from that and learning to train robots in a more scalable way.”

In lab experiments, RHyME-equipped robots performed complex sequences with over 50% higher task success than prior methods, using just 30 minutes of training data. The system's ability to handle multi-step tasks from a single demonstration highlights a shift in how robotic learning can operate under real-world conditions.

🌀 Tom's Take:

Teaching robots how to perform tasks that we take for granted as humans is a significant undertaking, and this research suggests that there are new, more scalable methods to explore, including many that we apply to our development.

Source: Cornell News

ShelfOptix Makes Real-Time Shelf Intelligence a Managed Service

Brain Corp and Driveline Retail have rolled out ShelfOptix, a robot-powered service that delivers real-time shelf visibility without requiring retailers to manage the hardware.

Moonlake AI Debuts with $28M to Vibe Code Interactive Worlds

Moonlake AI has exited stealth with a $28 million seed round backed by AIX Ventures, Threshold, NVIDIA Ventures, and more than a dozen AI leaders and founders.

EchoTwin AI Raises $8M to Expand AI-Powered Urban Digital Twins

EchoTwin AI has raised $8 million in a seed round led by Metis Ventures, with support from a global syndicate of investors. The funds will be used to scale the company's real-time urban infrastructure platform and expand its reach in the fast-growing smart cities market.

First Robotic GI Procedure by a Gastroenterologist Completed Using EndoQuest System

Dr. Norio Fukami of Mayo Clinic, Arizona has performed the world’s first fully robotic endoscopic submucosal dissection (ESD) by a gastroenterologist. The procedure used EndoQuest’s Endoluminal Surgical (ELS) System and was completed under the FDA’s PARADIGM Trial.

🌀 Tom's Take:

Read more

ShelfOptix Makes Real-Time Shelf Intelligence a Managed Service

Moonlake AI Debuts with $28M to Vibe Code Interactive Worlds

EchoTwin AI Raises $8M to Expand AI-Powered Urban Digital Twins

First Robotic GI Procedure by a Gastroenterologist Completed Using EndoQuest System