Google's Gemini outperforms ChatGPT in key AI tests

In a comparative evaluation of leading AI models, Google's Gemini 3.2 Fast demonstrated strengths in factual accuracy over OpenAI's ChatGPT 5.2, particularly in informational tasks. The tests, prompted by Apple's partnership with Google to enhance Siri, highlight evolving capabilities in generative AI since 2023. While results were close, Gemini avoided significant errors that undermined ChatGPT's reliability.

Ars Technica conducted a series of tests on January 21, 2026, pitting Google's Gemini 3.2 Fast against OpenAI's ChatGPT 5.2, the default models accessible without subscriptions. This evaluation follows Apple's decision to integrate Gemini into the next version of its Siri assistant, marking a shift from earlier comparisons when Google's AI was known as Bard in late 2023.

The prompts spanned creative and practical scenarios, including generating dad jokes, solving a mathematical puzzle about fitting Windows 11 onto 3.5-inch floppy disks, crafting a fictional story of Abraham Lincoln inventing basketball, writing a biography of journalist Kyle Orland, drafting emails to address unrealistic work deadlines, assessing medical claims about healing crystals for cancer, providing guidance for beating Super Mario Bros. level 8-2 without running, and outlining steps to land a Boeing 737-800 for a novice.

Gemini secured wins in four categories: the floppy disk calculation, where it offered clearer explanations and historical context; the biography, avoiding hallucinations about Orland's career start in 2012 and linking to sources; email advice, providing three tailored options with usage tips; and video game strategy, suggesting innovative workarounds like enemy bounces for gaps. ChatGPT prevailed in dad jokes for slight originality, creative writing for charm in details like Lincoln using a stove pipe hat for scoring, and the plane landing prompt, deemed more practical by aviation expert Lee Hutchinson for urging professional help over risky solo actions. The medical advice prompt ended in a tie, with both models dismissing crystals' efficacy while noting psychological benefits and recommending doctor consultations.

Overall, Gemini earned four points to ChatGPT's three, with one draw. The tests underscore Gemini's edge in factual reliability, reducing distrust from errors like ChatGPT's in the biography and game level. This progress likely influenced Apple's partnership choice, signaling Google's gains in the AI landscape.

ተያያዥ ጽሁፎች

Illustration depicting OpenAI's ChatGPT-5.2 launch, showing professionals using the AI to enhance workplace productivity amid rivalry with Google's Gemini.
በ AI የተሰራ ምስል

OpenAI releases ChatGPT-5.2 to boost work productivity

በAI የተዘገበ በ AI የተሰራ ምስል

OpenAI has launched ChatGPT-5.2, a new family of AI models designed to enhance reasoning and productivity, particularly for professional tasks. The release follows an internal alert from CEO Sam Altman about competition from Google's Gemini 3. The update includes three variants aimed at different user needs, starting with paid subscribers.

Google has released Gemini 3.1 Pro, an updated version of its flagship AI model, emphasizing improvements in problem-solving and reasoning. The model is available in preview for developers and consumers starting today. It builds on the Gemini 3 release from November.

በAI የተዘገበ

Apple has selected Google's Gemini AI models to enhance its Siri virtual assistant in a forthcoming update. The decision, announced in a joint statement, marks a shift from previous integrations with OpenAI's ChatGPT. This multi-year partnership aims to deliver more capable AI experiences while upholding Apple's privacy standards.

Google has launched Personal Intelligence, a new feature for its Gemini AI that integrates data from Gmail, Photos, Search, and YouTube to deliver more tailored responses. Available initially to paid subscribers in the US, the opt-in tool emphasizes user privacy controls and avoids direct training on personal data. The rollout begins in beta, with plans for broader access in the future.

በAI የተዘገበ

Google is overhauling its Workspace apps by integrating deeper Gemini AI capabilities to assist in document creation and editing. The updates allow Gemini to pull context from emails, files, and other sources to generate drafts and refine content. These features aim to streamline workflows for users across Docs, Sheets, Slides, and Drive.

The US Pentagon has unveiled a new artificial intelligence platform built on Google's Gemini model. This development equips the military with advanced AI tools. Yet, reactions are mixed, with some expressing unease about its implications.

በAI የተዘገበ

Google has announced that its experimental AI prototype, Genie 3, is now available to subscribers of its highest-tier AI plan. The tool allows users to generate and navigate interactive 3D worlds using simple text prompts. Previously limited to trusted testers, this expansion marks a step toward broader access for the 18-and-older audience.

 

 

 

ይህ ድረ-ገጽ ኩኪዎችን ይጠቀማል

የእኛን ጣቢያ ለማሻሻል ለትንታኔ ኩኪዎችን እንጠቀማለን። የእኛን የሚስጥር ፖሊሲ አንብቡ የሚስጥር ፖሊሲ ለተጨማሪ መረጃ።
ውድቅ አድርግ