Particle News: Study Finds AI Search Tools Are Inaccurate 60% of the Time

Overview

The Tow Center for Digital Journalism tested eight AI search tools, including ChatGPT, Perplexity, and Microsoft's Copilot, for accuracy in retrieving and citing news content.
On average, AI search tools were correct less than 40% of the time, with Perplexity performing best at 63% accuracy and Grok-3 the worst at just 6%.
ChatGPT responded to all queries but was completely accurate only 28% of the time, while Microsoft's Copilot declined over half of the queries and was 70% inaccurate for those it answered.
Premium versions of AI tools often provided more confidently incorrect answers compared to their free counterparts, raising concerns about transparency and value for users paying up to $200 per month.
The study also revealed that many AI tools fabricated links, ignored website restrictions, and misattributed content, underscoring their unreliability for factual searches.