Runway announces GWM-1 family of world models

AI firm Runway has unveiled GWM-1, its initial set of world models designed to extend beyond video generation into areas like robotics and avatars. Built on the Gen-4.5 text-to-video model, these three specialized autoregression models enable real-time simulations, synthetic data creation, and natural human-like interactions. The launch highlights Runway's push into a competitive field dominated by tech giants.

Runway, known for its video generation tools, introduced GWM-1 as a trio of models post-trained on domain-specific data from its Gen-4.5 foundation. This move signals the company's expansion from creative industries into broader AI applications.

The first, GWM Worlds, provides an interface for exploring digital environments with real-time user inputs influencing frame generation. Users can specify world elements, appearances, physics rules, and actions such as camera movements or environmental changes, maintaining consistency over extended sequences. Potential uses include pre-visualization in game development, virtual reality setups, and educational simulations of historical sites. It also supports training AI agents, including those for robotics.

GWM Robotics focuses on producing synthetic training data to enhance robotics datasets, incorporating novel objects, task instructions, and environmental variations. This aids in simulating challenging real-world conditions like varying weather and allows safer, cost-effective policy testing in virtual settings before physical trials. Runway offers a Python SDK for its robotics API on a per-request basis.

GWM Avatars integrates video and speech generation to create avatars that move and emote naturally during speaking and listening, sustaining long conversations without quality loss. It will soon integrate into Runway's web app and API.

While aiming for more unified models across domains, Runway's current versions are distinct. CEO Cristóbal Valenzuela described GWM-1 on X as "a major step toward universal simulation." The company enters a crowded space with players like Google and Nvidia, targeting robotics, physics, and life sciences alongside film and games.

Additionally, Runway revealed Gen-4.5 updates with native audio, audio editing, and multi-shot video capabilities, plus a partnership with CoreWeave for Nvidia's GB300 NVL72 racks to support future AI training and inference.

संबंधित लेख

Photorealistic illustration depicting OpenAI's ChatGPT Images 2 launch, with AI generating text-rich infographics on a laptop screen.
AI द्वारा उत्पन्न छवि

OpenAI launches ChatGPT Images 2 image generation model

AI द्वारा रिपोर्ट किया गया AI द्वारा उत्पन्न छवि

OpenAI announced ChatGPT Images 2, its new AI image model, on Tuesday. The upgrade focuses on creating text-heavy professional visuals like infographics and study guides. It rolls out to all ChatGPT users with generation limits based on subscription plans.

Shanghai-based Fysics AI announced the launch of Fysiverse on Wednesday, a new-generation physics-based world model.

AI द्वारा रिपोर्ट किया गया

California-based Generalist AI has launched Gen-1, a new physical AI model that enables robots to handle tasks like folding laundry, fixing other robots and stuffing cash into wallets. The model draws on human dexterity data collected worldwide to teach robots 'physical common sense.' Co-founder Pete Florence described it as a major advance for real-world robotics.

OpenAI has launched GPT-Rosalind, a large language model trained specifically on biology workflows. The model, named after scientist Rosalind Franklin, aims to address challenges in handling massive biological datasets and specialized subfields. Access is currently limited to US-based entities due to safety concerns.

यह वेबसाइट कुकीज़ का उपयोग करती है

हम अपनी साइट को बेहतर बनाने के लिए विश्लेषण के लिए कुकीज़ का उपयोग करते हैं। अधिक जानकारी के लिए हमारी गोपनीयता नीति पढ़ें।
अस्वीकार करें