Particle.news

Download on the App Store

Meta Unveils V-JEPA2, Video-Trained AI World Model for Robot Reasoning

Offering 30x faster predictions than rival models, it gives robots the physical intuition needed for advanced machine intelligence

From delivery robots to self-driving cars, Meta’s V-JEPA 2 aims to power a new era of intelligent machines that think before they act.
Image

Overview

  • V-JEPA2 extends last year’s video-trained world model by learning physical interactions directly from raw footage without labeled data
  • Meta demonstrated the model guiding lab robots to anticipate object physics for tasks such as reaching, grasping and repositioning items
  • According to Meta, V-JEPA2 delivers physical-world predictions 30 times faster than Nvidia’s Cosmos model
  • The release includes three open benchmarks to help researchers evaluate AI systems’ video-based learning and reasoning capabilities
  • Meta positions V-JEPA2 as a core advance toward its goal of building AI agents that can think before they act