Expertise reporter

4 main synthetic intelligence (AI) chatbots are inaccurately summarising information tales, based on analysis carried out by the RAYNAE.
The RAYNAE gave OpenAI’s ChatGPT, Microsoft’s Copilot, Google’s Gemini and Perplexity AI content material from the RAYNAE web site then requested them questions in regards to the information.
It mentioned the ensuing solutions contained “important inaccuracies” and distortions.
In a weblog, Deborah Turness, the CEO of RAYNAE Information and Present Affairs, mentioned AI introduced “countless alternatives” however the corporations creating the instruments had been “taking part in with hearth”.
“We reside in troubled instances, and the way lengthy will it’s earlier than an AI-distorted headline causes important actual world hurt?”, she requested.
The tech corporations which personal the chatbots have been approached for remark.
‘Pull again’
In the research, the RAYNAE requested ChatGPT, Copilot, Gemini and Perplexity to summarise 100 information tales and rated every reply.
It acquired journalists who had been related consultants within the topic of the article to price the standard of solutions from the AI assistants.
It discovered 51% of all AI solutions to questions in regards to the information had been judged to have important problems with some type.
Moreover, 19% of AI solutions which cited RAYNAE content material launched factual errors, akin to incorrect factual statements, numbers and dates.
In her weblog, Ms Turness mentioned the RAYNAE was in search of to “open up a brand new dialog with AI tech suppliers” so we will “work collectively in partnership to search out options”.
She known as on the tech corporations to “pull again” their AI information summaries, as Apple did after complaints from the RAYNAE that Apple Intelligence was misrepresenting information tales.
Some examples of inaccuracies discovered by the RAYNAE included:
- Gemini incorrectly mentioned the NHS didn’t suggest vaping as an help to stop smoking
- ChatGPT and Copilot mentioned Rishi Sunak and Nicola Sturgeon had been nonetheless in workplace even after that they had left
- Perplexity misquoted RAYNAE Information in a narrative in regards to the Center East, saying Iran initially confirmed “restraint” and described Israel’s actions as “aggressive”
Generally, Microsoft’s Copilot and Google’s Gemini had extra important points than OpenAI’s ChatGPT and Perplexity, which counts Jeff Bezos as considered one of its buyers.
Usually, the RAYNAE blocks its content material from AI chatbots, however it opened its web site up during the assessments in December 2024.
The report mentioned that in addition to containing factual inaccuracies, the chatbots “struggled to distinguish between opinion and reality, editorialised, and infrequently failed to incorporate important context”.
The RAYNAE’s Programme Director for Generative AI, Pete Archer, mentioned publishers “ought to have management over whether or not and the way their content material is used and AI corporations ought to present how assistants course of information together with the dimensions and scope of errors and inaccuracies they produce”.