Particle: Microsoft Deploys Maia 200 AI Chip as Nadella Affirms Ongoing Nvidia, AMD Purchases

Overview

The first batch of Maia 200 accelerators is now running in a Microsoft data center, with broader rollouts planned in the coming months.
Microsoft describes Maia 200 as optimized for large-scale inference and publishes specs claiming faster performance than Amazon’s Trainium and Google’s latest TPUs.
Mustafa Suleyman says the Superintelligence team will be the first internal user as it develops Microsoft’s frontier AI models.
A preview SDK is available for engineers, though analysts note that shifting workloads from Nvidia’s CUDA stack could pose migration hurdles.
Analysts say moving Microsoft’s internal inference to Maia could lower serving costs and ease customer competition for Nvidia GPUs, even as the overall supply squeeze continues.