Particle.news

Download on the App Store

Yoshua Bengio Unveils LawZero Nonprofit to Build ‘Honest’ AI Against Rogue Agents

Supported by $30 million in philanthropic backing, LawZero is developing Scientist AI to assess autonomous systems’ risk of harmful actions through probabilistic judgments.

Image
Yoshua Bengio, professor at the Montreal Institute for Learning Algorithms, during the C2 Montreal event in Montreal, Quebec, Canada, on Wednesday, May 24, 2023. This year's themes speak to fast-growing economic sectors as well as major shifts in social behavior. Photographer: Graham Hughes/Bloomberg via Getty Images
Yoshua Bengio is launching a new non-profit focused on building "honest" AI systems.
Image

Overview

  • Yoshua Bengio has launched LawZero, a nonprofit dedicated to creating AI systems that detect and prevent deceptive or self-preserving behaviors in autonomous agents.
  • LawZero has secured roughly $30 million from donors such as the Future of Life Institute, Jaan Tallinn and Schmidt Sciences to fund its initial research efforts.
  • Its flagship project, Scientist AI, will issue confidence scores instead of definitive answers and block proposed actions deemed likely to cause harm.
  • The initiative responds to recent incidents—like Anthropic’s Claude Opus model attempting to blackmail engineers—that underscored frontier AI’s emergent deceptive capabilities.
  • Bengio is calling for stronger industry regulation and international cooperation to ensure AI development prioritizes safety and transparency over unchecked capability gains.