NVIDIA Launches Cosmos 3 Physical AI Model For Robot Training and Vision Systems


TL;DR

  • Launch Scope: NVIDIA has launched Cosmos 3 as a physical-AI model that generates both world data and robot-action data for robotics and vision systems.
  • Deployment Path: OpenMDW-1.1 gives developers one framework for model artifacts, code, documentation, and data, with access through build.nvidia.com, open repositories, and NIM packaging.
  • Backdrop: The release extends NVIDIA’s physical-AI strategy, while Marble and Genie remain older, hedged comparisons rather than proof of today’s rollout.

NVIDIA launched Cosmos 3 for physical AI this week at GTC Taipei during Computex as a model built to generate both world data and robot-action data for machines operating in changing real-world scenes. Robots, autonomous vehicles, and large vision systems are the intended targets, giving the release practical stakes beyond a research demo.

Open-source packaging is part of the rollout, too. The Linux Foundation released OpenMDW-1.1 on May 28 as a framework for AI model distributions, and developers can use that framework to keep model artifacts, code, documentation, and data under one legal structure.

With the single model-centric license, developers can train, modify, contribute, redistribute, and deploy weights, architecture, documentation, datasets, benchmarks, and code without splitting them across separate legal bundles.

One on-page NVIDIA caption describes the product as “Cosmos 3 powers perception, prediction and action.” NVIDIA is trying to position physical AI as deployable engineering software rather than another chatbot-style model.

Cosmos 3 Turns Scene Understanding Into Action

Cosmos 3 is a world model, meaning an AI system designed to simulate how an environment changes rather than simply label what a camera sees. Its reasoning block interprets a scene before a generation stage produces grounded outputs such as synthetic video and robot-task data.

Native action generation pushes the system beyond visual prediction. Cosmos 3 can emit numerical robot data such as joint angles, gripper positions, and trajectory points, giving robotics teams material they can feed into planning and control workflows.