Study finds Google's AI Overviews wrong in 10% of cases

A New York Times analysis shows Google's AI Overviews, powered by Gemini, answering correctly only 90% to 91% of questions in a standard benchmark. This translates to tens of millions of incorrect responses daily across searches. Google disputes the test's relevance.

The New York Times, working with startup Oumi, tested AI Overviews using SimpleQA, a benchmark of over 4,000 questions released by OpenAI in 2024. Initial tests with Gemini 2.5 showed 85% accuracy, improving to 91% after the Gemini 3 update. Extrapolated to Google's search volume, this means tens of millions of wrong answers generated each day, or millions per hour as highlighted in reports on the findings.

Makala yanayohusiana

Illustration of Swedes in a Stockholm cafe using AI chatbots amid survey stats on rising usage and skepticism.
Picha iliyoundwa na AI

Increased AI chatbot use among Swedes – but also concerns

Imeripotiwa na AI Picha iliyoundwa na AI

According to the latest SOM survey from the University of Gothenburg, the share of Swedes chatting with an AI bot weekly rose from 12 to 36 percent between 2024 and 2025. At the same time, skepticism toward AI has grown, with 62 percent viewing it as a greater risk than opportunity for society.

Researchers from the University of Pennsylvania have identified 'cognitive surrender,' where people outsource reasoning to AI without verification. In experiments, participants accepted incorrect AI responses 73.2 percent of the time across 1,372 participants. Factors like time pressure increased reliance on flawed outputs.

Imeripotiwa na AI

Certain search terms appear to disrupt Google's AI Overviews results.

Google is expanding its AI-driven search tools with new agent capabilities and a redesigned interface. The changes, unveiled at the company’s I/O 2026 event, aim to make search more conversational and interactive. Executives say the updates reflect a broader shift where AI is central to how users find information.

Imeripotiwa na AI

Google unveiled a wave of AI-driven updates for Android devices and introduced Googlebooks, a new line of laptops, during its Android Show presentation on Tuesday. The announcements focus on proactive AI features through Gemini Intelligence and enhanced integration across phones, cars and computers.

Jumanne, 26. Mwezi wa tano 2026, 14:13:07

AI Chatbot Pricing Comparison: Here's What You Get When You Pay - CNET

Jumamosi, 23. Mwezi wa tano 2026, 01:14:50

Google rolls out AI search and assistant tools at I/O 2026

Jumanne, 19. Mwezi wa tano 2026, 21:11:47

Google adds more AI to search while keeping classic results

Jumatano, 6. Mwezi wa tano 2026, 13:51:30

Google adds reddit advice and more sources to ai search

Alhamisi, 30. Mwezi wa nne 2026, 06:14:29

Google's Gemini AI generates files directly in chats

Jumatatu, 20. Mwezi wa nne 2026, 22:53:38

Google launches new travel features for summer planning

Jumanne, 14. Mwezi wa nne 2026, 18:31:48

Google rolls out Skills feature in Chrome for reusable Gemini prompts

Jumanne, 31. Mwezi wa tatu 2026, 02:54:05

UK study reveals AI agents evading safeguards in user interactions

Jumapili, 22. Mwezi wa tatu 2026, 10:10:23

Top AI coding assistants fail one in four tasks

Jumatano, 11. Mwezi wa tatu 2026, 08:00:19

Google enhances Workspace with advanced Gemini AI features

Tovuti hii inatumia vidakuzi

Tunatumia vidakuzi kwa uchambuzi ili kuboresha tovuti yetu. Soma sera ya faragha yetu kwa maelezo zaidi.
Kataa