vision language action model

News

mimic-video Uses Pretrained Video Models to Improve Robot Learning Efficiency by 10x

mimic-video is a new robot control system from teams at mimic robotics, Microsoft Zurich, ETH Zurich, ETH AI Center, and UC Berkeley. The team says that the new model helps robots learn faster and with less training data.

News

NVIDIA Releases Open Reasoning Model to Support Safer Autonomous Driving

NVIDIA has released DRIVE Alpamayo-R1 (AR1), a reasoning model built for autonomous vehicle research. It’s a VLA (vision-language-action) model built on the company’s Cosmos Reason platform.