News

Sharpa’s New VTLA Model Targets the Hardest Problem in Robotics

Remix Reality Newsroom

20 Jan 2026 — 1 min read

Source: Sharpa

Sharpa introduced CraftNet, a VTLA model that combines vision, tactile sensing, language, and action for fine robotic manipulation.
The company frames CraftNet as central to its mission of “manufacturing time by making robots useful.”

Sharpa has introduced CraftNet, a control system for fine robotic manipulation that combines vision, tactile sensing, language, and action, which the company calls VTLA. The system is built to run on real robots and carry out long physical tasks without scripting or simulation. Sharpa links CraftNet to its core mission: “We manufacture time by making robots useful.”

CraftNet is built on Sharpa’s multi-system architecture, designed to mirror how humans combine reflexes with higher-level planning. It includes two layers: System 0, the Interaction Brain, handles fast, reflex-like responses, while System 1, the Motion Brain, manages longer-term coordination. Sharpa says this structure enables reliable control at the “last millimeter”, a challenge it describes as “90% of the problem” in fine manipulation.

The company points to opportunities in retail, restaurants, hotels, and eventually the home, where robots could move beyond novelty and take on real tasks. Each job handed off to a robot becomes, in their words, a deposit into humanity’s “time bank.”

🌀 Tom’s Take:

Dexterity is still the hardest problem in robotics, and CraftNet is Sharpa’s attempt to solve it with a system built for real-world control.

Source: PR Newswire / Sharpa

World Labs Launches API for On-Demand 3D World Generation

World Labs has launched the World API, a tool for generating explorable 3D environments from text, images, panoramas, and video. Powered by Marble, the company’s multimodal world model

NBA Launchpad Reveals 2026 Tech Cohort Focused on Cognitive, Spatial, and Fan Engagement Tools

The NBA has announced the latest group in its Launchpad program, which finds and tests new technologies that could shape the future of basketball. For its fifth year, the league picked five companies from a pool of over 200 applicants.

Microsoft Unveils Rho-alpha to Bridge Language, Vision, and Touch in Robotics

Microsoft has introduced Rho-alpha, a vision-language-action model that lets robots follow spoken or written instructions like “Insert the plug” or “Turn the knob to position 5.” It combines visual input with a sense of touch to guide actions

Serve Robotics to Buy Diligent Robotics to Expand Autonomy Platform Into Hospitals

Serve Robotics will acquire Diligent Robotics in a $29 million stock deal, with an additional earn-out of up to $5.3 million. Serve Robotics develops autonomous sidewalk delivery robots, while Diligent Robotics builds AI-powered assistant robots for hospitals.

🌀 Tom’s Take:

Read more

World Labs Launches API for On-Demand 3D World Generation

NBA Launchpad Reveals 2026 Tech Cohort Focused on Cognitive, Spatial, and Fan Engagement Tools

Microsoft Unveils Rho-alpha to Bridge Language, Vision, and Touch in Robotics

Serve Robotics to Buy Diligent Robotics to Expand Autonomy Platform Into Hospitals