Top AI coding assistants fail one in four tasks

22 marzo 2026

Riportato dall'IA

Leading AI coding assistants fail one in four tasks, according to a TechRadar analysis. The report points to serious gaps between hype and actual performance reliability, especially in structured output tasks. AI tools are far from flawless in these critical areas.

A TechRadar article published on March 22, 2026, examines the performance of top AI coding assistants. It reveals that these tools fail one in four tasks, highlighting significant discrepancies between promotional claims and real-world reliability. The analysis focuses on structured output tasks, where AI assistants demonstrate notable shortcomings, described as far from flawless. This raises questions about their effectiveness in professional coding environments. The title of the piece underscores 'serious gaps between hype and actual performance reliability.' No specific models or methodologies are detailed in the available excerpt, but the findings suggest caution in relying on such tools for critical work.

Articoli correlati

Study finds heavy AI use at work lowers confidence

A new study published this month by the American Psychological Association reveals that heavy reliance on AI tools for workplace tasks correlates with reduced confidence in personal abilities and less sense of ownership over work. Researchers observed that users who rarely modify AI outputs feel less confident in their independent reasoning. The findings highlight trade-offs between speed and depth in AI-assisted work.

Study finds Google's AI Overviews wrong in 10% of cases

08 aprile 2026 Riportato dall'IA

A New York Times analysis shows Google's AI Overviews, powered by Gemini, answering correctly only 90% to 91% of questions in a standard benchmark. This translates to tens of millions of incorrect responses daily across searches. Google disputes the test's relevance.

Tecnologia

Vogue survey shows low trust in AI for fashion shopping

Salute

Generative AI outperforms human teams in analyzing medical data

Tecnologia

Microsoft AI chief predicts AI replacement of white-collar jobs

Linux maintainer says AI tools now find real bugs

Greg Kroah-Hartman, maintainer of the Linux kernel, stated that AI-driven code review tools have become genuinely useful. He told The Register that the technology reached an inflection point about a month ago, leading to actionable bug reports.

Report critiques big tech's unsubstantiated AI climate claims

18 febbraio 2026 Riportato dall'IA

A recent report examines claims by big tech companies that generative AI can help combat climate change, finding limited evidence to support them. Of 154 specific assertions, only a quarter referenced academic research, while a third offered no proof at all. The analysis highlights Google's 2023 claim of AI reducing global emissions by 5 to 10 percent by 2030 as an example.

Tesla