Amateur mathematicians solve Erdős problems with AI assistance

Amateur mathematicians have stunned professionals by using AI tools like ChatGPT to tackle long-standing problems posed by Paul Erdős. While most solutions rediscover existing results, one new proof highlights AI's potential to transform mathematical research. Experts see this as an early step toward broader applications in the field.

Paul Erdős, a renowned Hungarian mathematician, left behind over 1,000 unsolved problems upon his death in 1996, covering areas from combinatorics to number theory. These problems, often simple to state but difficult to solve, serve as benchmarks for progress in mathematics. Thomas Bloom at the University of Manchester maintains a website tracking these challenges.

Starting in October 2023, enthusiasts began feeding Erdős problems into AI chatbots like ChatGPT. Initially used to locate relevant literature, the tools soon generated partial improvements and proofs. Undergraduate Kevin Barreto at Cambridge University and amateur Liam Price targeted problem 728, a number theory conjecture. Using ChatGPT-5.2 Pro, they obtained a sophisticated argument, which they verified with Aristotle, an AI from Harmonic that translates proofs into the Lean programming language for automated checking.

By mid-January 2024, AI had fully resolved six Erdős problems, though professionals later found five had prior solutions in the literature. Barreto and Price's work on problem 205 stands as the sole novel resolution. Additionally, AI contributed fresh partial solutions or enhancements to seven others, some linking to overlooked papers.

This raises questions about novelty versus rediscovery. Bloom notes that AI often reformulates problems to uncover hidden connections: “A lot of these papers, I wouldn’t have found... without this sort of [use of] the AI tool.” Barreto acknowledges the problems as relatively straightforward, predicting tougher ones, including those with prizes, remain beyond current AI capabilities.

Kevin Buzzard at Imperial College London calls it “green shoots” of progress, not yet a threat to experts. Terence Tao at UCLA suggests AI could enable a more empirical approach: “We don’t do large-scale mathematics because we don’t have the intellectual resources, but AI is showing that you can.” Bloom envisions expanded research breadth, allowing mathematicians to draw instantly from unfamiliar fields without extensive learning.

Связанные статьи

Illustration depicting OpenAI's ChatGPT-5.2 launch, showing professionals using the AI to enhance workplace productivity amid rivalry with Google's Gemini.
Изображение, созданное ИИ

OpenAI releases ChatGPT-5.2 to boost work productivity

Сообщено ИИ Изображение, созданное ИИ

OpenAI has launched ChatGPT-5.2, a new family of AI models designed to enhance reasoning and productivity, particularly for professional tasks. The release follows an internal alert from CEO Sam Altman about competition from Google's Gemini 3. The update includes three variants aimed at different user needs, starting with paid subscribers.

A Cornell University study reveals that AI tools like ChatGPT have increased researchers' paper output by up to 50%, particularly benefiting non-native English speakers. However, this surge in polished manuscripts is complicating peer review and funding decisions, as many lack substantial scientific value. The findings highlight a shift in global research dynamics and call for updated policies on AI use in academia.

Сообщено ИИ

A study applying Chile's university entrance exam, PAES 2026, to AI models shows several systems scoring high enough for selective programs like Medicine and Civil Engineering. Google's Gemini led with averages near 950 points, outperforming rivals like ChatGPT. The experiment underscores AI progress and raises questions about standardized testing efficacy.

Commonly used AI models, including ChatGPT and Gemini, often fail to provide adequate advice for urgent women's health issues, according to a new benchmark test. Researchers found that 60 percent of responses to specialized queries were insufficient, highlighting biases in AI training data. The study calls for improved medical content to address these gaps.

Сообщено ИИ

A new study reveals that using computer advice for just three moves in a chess game can boost a player's victory odds from 51 percent to 84 percent. Researcher Daniel Keren simulated thousands of matches to demonstrate how selective cheating evades detection. The findings highlight vulnerabilities in online chess platforms' anti-cheating measures.

Linus Torvalds, the creator of Linux, has begun experimenting with AI-assisted 'vibe coding' for a personal underwater audio tool. While known as an AI skeptic, he employed the technology to overcome unfamiliarity with Python. This marks a cautious embrace of AI in non-critical software development.

Сообщено ИИ

Linus Torvalds, creator of the Linux kernel, has criticized efforts to create rules for AI-generated code submissions, calling them pointless. In a recent email, he argued that such policies would not deter malicious contributors and urged focus on code quality instead. This stance highlights ongoing tensions in open-source development over artificial intelligence tools.

 

 

 

Этот сайт использует куки

Мы используем куки для анализа, чтобы улучшить наш сайт. Прочитайте нашу политику конфиденциальности для дополнительной информации.
Отклонить