Amateur mathematicians solve Erdős problems with AI assistance

January 16, 2026

由 AI 报道

Amateur mathematicians have stunned professionals by using AI tools like ChatGPT to tackle long-standing problems posed by Paul Erdős. While most solutions rediscover existing results, one new proof highlights AI's potential to transform mathematical research. Experts see this as an early step toward broader applications in the field.

Paul Erdős, a renowned Hungarian mathematician, left behind over 1,000 unsolved problems upon his death in 1996, covering areas from combinatorics to number theory. These problems, often simple to state but difficult to solve, serve as benchmarks for progress in mathematics. Thomas Bloom at the University of Manchester maintains a website tracking these challenges.

Starting in October 2023, enthusiasts began feeding Erdős problems into AI chatbots like ChatGPT. Initially used to locate relevant literature, the tools soon generated partial improvements and proofs. Undergraduate Kevin Barreto at Cambridge University and amateur Liam Price targeted problem 728, a number theory conjecture. Using ChatGPT-5.2 Pro, they obtained a sophisticated argument, which they verified with Aristotle, an AI from Harmonic that translates proofs into the Lean programming language for automated checking.

By mid-January 2024, AI had fully resolved six Erdős problems, though professionals later found five had prior solutions in the literature. Barreto and Price's work on problem 205 stands as the sole novel resolution. Additionally, AI contributed fresh partial solutions or enhancements to seven others, some linking to overlooked papers.

This raises questions about novelty versus rediscovery. Bloom notes that AI often reformulates problems to uncover hidden connections: “A lot of these papers, I wouldn’t have found... without this sort of [use of] the AI tool.” Barreto acknowledges the problems as relatively straightforward, predicting tougher ones, including those with prizes, remain beyond current AI capabilities.

Kevin Buzzard at Imperial College London calls it “green shoots” of progress, not yet a threat to experts. Terence Tao at UCLA suggests AI could enable a more empirical approach: “We don’t do large-scale mathematics because we don’t have the intellectual resources, but AI is showing that you can.” Bloom envisions expanded research breadth, allowing mathematicians to draw instantly from unfamiliar fields without extensive learning.

Illustration depicting OpenAI's ChatGPT-5.2 launch, showing professionals using the AI to enhance workplace productivity amid rivalry with Google's Gemini.

OpenAI releases ChatGPT-5.2 to boost work productivity

December 11, 2025 由 AI 报道 AI 生成的图像

OpenAI has launched ChatGPT-5.2, a new family of AI models designed to enhance reasoning and productivity, particularly for professional tasks. The release follows an internal alert from CEO Sam Altman about competition from Google's Gemini 3. The update includes three variants aimed at different user needs, starting with paid subscribers.

AI boosts scientific productivity but erodes paper quality

A Cornell University study reveals that AI tools like ChatGPT have increased researchers' paper output by up to 50%, particularly benefiting non-native English speakers. However, this surge in polished manuscripts is complicating peer review and funding decisions, as many lack substantial scientific value. The findings highlight a shift in global research dynamics and call for updated policies on AI use in academia.

AI models surpass cutoff scores in Chile's PAES 2026 test

January 08, 2026 由 AI 报道

A study applying Chile's university entrance exam, PAES 2026, to AI models shows several systems scoring high enough for selective programs like Medicine and Civil Engineering. Google's Gemini led with averages near 950 points, outperforming rivals like ChatGPT. The experiment underscores AI progress and raises questions about standardized testing efficacy.

亚洲

AI公司准备投放广告，操纵威胁浮现

科学

New bridge links infinity math to computer science

亚洲

AI 代理在 2025 年到来

AI chatbots fail on 60 percent of urgent women's health queries

Commonly used AI models, including ChatGPT and Gemini, often fail to provide adequate advice for urgent women's health issues, according to a new benchmark test. Researchers found that 60 percent of responses to specialized queries were insufficient, highlighting biases in AI training data. The study calls for improved medical content to address these gaps.

Limited cheating in chess dramatically increases win chances

January 16, 2026 由 AI 报道

A new study reveals that using computer advice for just three moves in a chess game can boost a player's victory odds from 51 percent to 84 percent. Researcher Daniel Keren simulated thousands of matches to demonstrate how selective cheating evades detection. The findings highlight vulnerabilities in online chess platforms' anti-cheating measures.

Linus Torvalds uses AI for personal coding project

Linus Torvalds, the creator of Linux, has begun experimenting with AI-assisted 'vibe coding' for a personal underwater audio tool. While known as an AI skeptic, he employed the technology to overcome unfamiliarity with Python. This marks a cautious embrace of AI in non-critical software development.

Linus Torvalds dismisses AI code rules in Linux kernel debate

January 11, 2026 由 AI 报道

Linus Torvalds, creator of the Linux kernel, has criticized efforts to create rules for AI-generated code submissions, calling them pointless. In a recent email, he argued that such policies would not deter malicious contributors and urged focus on code quality instead. This stance highlights ongoing tensions in open-source development over artificial intelligence tools.

January 28, 2026 11:16

Amateur mathematicians solve Erdős problems with AI assistance

相关文章

OpenAI releases ChatGPT-5.2 to boost work productivity

AI boosts scientific productivity but erodes paper quality

AI models surpass cutoff scores in Chile's PAES 2026 test

AI公司准备投放广告，操纵威胁浮现

New bridge links infinity math to computer science

AI 代理在 2025 年到来

AI chatbots fail on 60 percent of urgent women's health queries

Limited cheating in chess dramatically increases win chances

Linus Torvalds uses AI for personal coding project

Linus Torvalds dismisses AI code rules in Linux kernel debate

中国AI在美数学奥林匹克几何竞赛中再创佳绩

Research paper questions viability of AI agents

Google's Gemini outperforms ChatGPT in key AI tests

OpenAI introduces ChatGPT Jobs tool for job seekers

Hampuz Nyström found love with AI service Eve

How AI coding agents function and their limitations

Duke ai uncovers simple rules in complex systems

New Scientist sets precedent for UK FOI on AI chatbot use

Study suggests brain-inspired algorithms to cut AI energy use

AI embeds deeply in Linux kernel workflows

此网站使用 cookie