Particle.news
Download on the App Store

DeepMind Unveils SIMA 2, a Gemini-Powered Agent That Learns in 3D Worlds

The research preview presents a self‑improving, instruction‑following system seen as groundwork for future robotics despite clear current limits.

Overview

  • SIMA 2 integrates Google’s Gemini model to reason about scenes and act in virtual environments, with DeepMind reporting roughly double the performance of SIMA 1.
  • Training drew on human gameplay from eight commercial titles, including No Man’s Sky and Goat Simulator 3, plus three company-built worlds to map pixels to mouse‑keyboard actions.
  • DeepMind used its Genie 3 world model to generate novel photorealistic environments, where SIMA 2 navigated and interacted with previously unseen objects.
  • The agent self-improves by attempting Gemini-generated tasks, receiving AI feedback and reward-model scores, and learning via repeated trial and error.
  • DeepMind stresses the system remains experimental, struggling with long multi-step tasks and fine motor control, with limited memory and no timeline for robotics deployment or wider release.