Google's Gemini outperforms ChatGPT in key AI tests

In a comparative evaluation of leading AI models, Google's Gemini 3.2 Fast demonstrated strengths in factual accuracy over OpenAI's ChatGPT 5.2, particularly in informational tasks. The tests, prompted by Apple's partnership with Google to enhance Siri, highlight evolving capabilities in generative AI since 2023. While results were close, Gemini avoided significant errors that undermined ChatGPT's reliability.

Ars Technica conducted a series of tests on January 21, 2026, pitting Google's Gemini 3.2 Fast against OpenAI's ChatGPT 5.2, the default models accessible without subscriptions. This evaluation follows Apple's decision to integrate Gemini into the next version of its Siri assistant, marking a shift from earlier comparisons when Google's AI was known as Bard in late 2023.

The prompts spanned creative and practical scenarios, including generating dad jokes, solving a mathematical puzzle about fitting Windows 11 onto 3.5-inch floppy disks, crafting a fictional story of Abraham Lincoln inventing basketball, writing a biography of journalist Kyle Orland, drafting emails to address unrealistic work deadlines, assessing medical claims about healing crystals for cancer, providing guidance for beating Super Mario Bros. level 8-2 without running, and outlining steps to land a Boeing 737-800 for a novice.

Gemini secured wins in four categories: the floppy disk calculation, where it offered clearer explanations and historical context; the biography, avoiding hallucinations about Orland's career start in 2012 and linking to sources; email advice, providing three tailored options with usage tips; and video game strategy, suggesting innovative workarounds like enemy bounces for gaps. ChatGPT prevailed in dad jokes for slight originality, creative writing for charm in details like Lincoln using a stove pipe hat for scoring, and the plane landing prompt, deemed more practical by aviation expert Lee Hutchinson for urging professional help over risky solo actions. The medical advice prompt ended in a tie, with both models dismissing crystals' efficacy while noting psychological benefits and recommending doctor consultations.

Overall, Gemini earned four points to ChatGPT's three, with one draw. The tests underscore Gemini's edge in factual reliability, reducing distrust from errors like ChatGPT's in the biography and game level. This progress likely influenced Apple's partnership choice, signaling Google's gains in the AI landscape.

Makala yanayohusiana

Illustration of Google's native Gemini AI app on a MacBook Pro, showcasing screen sharing, file uploads, and image generation features.
Picha iliyoundwa na AI

Google launches native Gemini app for macOS

Imeripotiwa na AI Picha iliyoundwa na AI

Google has released a dedicated native app for its Gemini AI on macOS, allowing users quick access via a keyboard shortcut. The free app supports screen sharing, file uploads, and generative features like image and video creation. It is available for download from Google's website for macOS 15 and later.

Google has released Gemini 3.1 Pro, an updated version of its flagship AI model, emphasizing improvements in problem-solving and reasoning. The model is available in preview for developers and consumers starting today. It builds on the Gemini 3 release from November.

Imeripotiwa na AI

OpenAI has begun rolling out GPT-5.5 Instant as the new default model for ChatGPT, promising greater factual accuracy and improved personalization. The update reduces hallucinations and inaccurate claims while enhancing response clarity. A new memory sources feature gives users more control over personalized context.

Google has begun rolling out a new 'Skills' feature in its Chrome browser on desktop, enabling users to save and quickly reuse custom Gemini AI prompts. The update makes it easier to repeat tasks like calculating protein in recipes or comparing products across tabs. Skills sync across devices when signed into a Google account and include a library of premade prompts.

Imeripotiwa na AI

OpenAI has released ChatGPT 5.4, which includes new features focused on spreadsheets. The update brings specialized tools for Microsoft Excel and Google Sheets. This development highlights an emphasis on productivity applications.

Alhamisi, 30. Mwezi wa nne 2026, 06:14:29

Google's Gemini AI generates files directly in chats

Alhamisi, 23. Mwezi wa nne 2026, 12:11:48

OpenAI releases GPT-5.5 model for ChatGPT

Jumatano, 8. Mwezi wa nne 2026, 01:31:38

Study finds Google's AI Overviews wrong in 10% of cases

Alhamisi, 26. Mwezi wa tatu 2026, 16:00:59

OpenAI shelves ChatGPT adult mode indefinitely

Jumatano, 11. Mwezi wa tatu 2026, 08:00:19

Google enhances Workspace with advanced Gemini AI features

Jumamosi, 14. Mwezi wa pili 2026, 14:31:20

Indian AI models outperform OpenAI and Google on key benchmarks: Ashwini Vaishnaw

Tovuti hii inatumia vidakuzi

Tunatumia vidakuzi kwa uchambuzi ili kuboresha tovuti yetu. Soma sera ya faragha yetu kwa maelezo zaidi.
Kataa