Particle.news

Download on the App Store

Amazon Debuts Nova Sonic, a Unified Real-Time Voice AI Model for Enterprises

The new AI model integrates speech recognition, language understanding, and speech synthesis, offering cost-efficient, low-latency voice interactions via Amazon Bedrock.

Image
Amazon has launched Nova Sonic, a new AI voice model built for faster, more accurate, and natural conversations.
Andy Jassy, chief executive officer of Amazon.com Inc., speaks during an unveiling event in New York, US, on Wednesday, Feb. 26, 2025. Amazon has rebooted Alexa with artificial intelligence, marking the biggest overhaul of the voice-activated assistant since its introduction over a decade ago. Photographer: Michael Nagle/Bloomberg via Getty Images
Image

Overview

  • Nova Sonic consolidates speech-to-text, text understanding, and text-to-speech into a single system, enabling natural and context-aware communication.
  • The model supports real-time, interactive voice interactions with low latency, handling interruptions and maintaining conversational flow.
  • Benchmark tests show Nova Sonic outperforms competitors like OpenAI's GPT-4o and Google’s Gemini Flash 2.0 in conversational quality and accuracy.
  • It achieves a word error rate of 4.2% on multilingual benchmarks and demonstrates strong performance in noisy, multi-speaker environments.
  • Available now via Amazon Bedrock, Nova Sonic is nearly 80% cheaper than GPT-4o and is already integrated into Alexa+, Amazon’s upgraded voice assistant.