Cosmos 1.0

NVIDIA · January 2025

● activeOpen Sourcedecoder onlyvideo

Why It Matters

First major 'world model' designed not for chatting but for understanding physics — generates realistic 3D environments for training robots and self-driving cars.

Description

NVIDIA's World Foundation Model designed not for conversation but for understanding physics. Generates realistic synthetic 3D environments and videos that can be used to train robots, self-driving cars, and other physical AI systems — essentially creating virtual worlds where machines can safely learn before operating in the real one.

Key Innovations

Text-to-Video

Text-to-VideoGenerating video clips from text descriptions — one of the newest and most compute-intensive AI capabilities.

Multimodal

MultimodalProcessing multiple types of input (text, images, audio, video) in a single model.

External Links

Announcement

More from NVIDIA Nemotron

Megatron-Turing NLG2021-10 · 530B

Nemotron-4 15B2024-03 · 15B

Nemotron-4 340B2024-06 · 340B

Llama-3.1-Nemotron-70B2024-10 · 70B

NVLM 1.02024-10 · 72B

Nemotron 3 Nano2025-12 · 30B (3B active)

Nemotron 3 Super2026-03 · 120B (12B active)

Nemotron 3 Ultra2026-05 · 550B (55B active)

PreviousNVLM 1.0

NextNemotron 3 Nano