Particle.news
Download on the App Store

Microsoft’s AI Orchestrator Diagnoses Complex Cases With 80% Accuracy, Surpassing Doctors

Validated on 304 New England Journal of Medicine cases, the orchestrator slashed test costs by roughly 20 percent as Microsoft lines up partnerships for clinical validation

Overview

  • The MAI Diagnostic Orchestrator (MAI-DxO) uses a multi-agent framework that queries leading AI models—GPT, Gemini, Claude, Llama and Grok—to mimic a collaborative panel of physicians
  • In Sequential Diagnosis Benchmark testing, MAI-DxO achieved between 80 and 85.5 percent diagnostic accuracy versus about 20 percent for a panel of 21 generalist physicians
  • By selecting more cost-effective tests and procedures, the AI system reduced estimated diagnostic costs by around 20 percent compared with human doctors
  • Microsoft has secured partnerships with health systems and is preparing real-world clinical trials and regulatory reviews before any patient-care deployment
  • Researchers and external experts caution that the prototype must be validated across diverse patient populations, integrated into actual workflows and cleared by regulators before clinical use