The Coe Lab
← Back to Blog

AI Solves 80-Year-Old Math Problem: When Machines Outthink Mathematicians

May 25, 20266 min read

OpenAI's reasoning model autonomously cracked a geometry problem that stumped mathematicians for eight decades—proving AI can now make genuine mathematical discoveries.

On May 20, 2026, something happened that mathematicians once thought impossible: an AI system independently solved a famous open problem that had eluded the world's best minds for 80 years.

OpenAI announced that its internal reasoning model had disproved a central conjecture in discrete geometry—the planar unit distance problem first posed by Hungarian mathematician Paul Erdős in 1946. The AI didn't just assist human researchers. It produced the proof entirely on its own, using a single prompt.

The Problem That Stumped Generations

Erdős's question sounds deceptively simple: if you place n points on a plane, how many pairs can be exactly distance 1 apart? You can think of it as arranging dots so that as many pairs as possible are separated by one unit.

Erdős himself proposed a construction using square grids that produced a specific growth rate. He conjectured—and most mathematicians agreed—that this was essentially optimal. No arrangement could do significantly better. For decades, this seemed right. The best known constructions all followed similar patterns.

But no one could prove it. The conjecture sat in the "probably true but unproven" pile that mathematicians sometimes call the "Erdős graveyard"—a collection of problems so fundamental that they feel obvious, yet resist all attempts at proof.

AI Finds What Humans Missed

OpenAI's model did something unexpected: it disproved Erdős's conjecture entirely. The AI found an infinite family of point configurations that yield a polynomial improvement over the square grid—proving that better arrangements exist.

What makes this remarkable isn't just the result. It's how the model got there.

The proof uses algebraic number theory—a field that, on the surface, has nothing to do with arranging points on a plane. The model pulled techniques from "infinite class field towers" and "Golod–Shafarevich theory," concepts that even professional mathematicians rarely encounter together.

Fields medalist Tim Gowers called it "a milestone in AI mathematics." Number theorist Arul Shankar went further: "This paper demonstrates that current AI models go beyond just helpers to human mathematicians—they are capable of having original ingenious ideas, and then carrying them out to fruition."

A Single Prompt, A 125-Page Proof

The model wasn't specifically trained for math problems. It's a general-purpose reasoning system that OpenAI was testing on a collection of Erdős problems. The prompt was simply: could Erdős's conjecture be true or false?

The response: a 125-page chain of thought that constructed the counterexample, proved it worked, and connected ideas that no one in the discrete geometry community had thought to combine. Princeton professor Will Sawin has already refined the proof to give an explicit improvement factor (δ = 0.014)—meaning the new construction produces about 1.3% more unit-distance pairs than Erdős's grid.

Why This Matters Beyond Math

This isn't about one problem. It's about what the result represents: AI making genuine research contributions.

Mathematics is an ideal testbed for reasoning. Proofs either work or they don't. A 125-page argument only holds if every step is valid. The model's proof was verified by external mathematicians—researchers not affiliated with OpenAI, including Daniel Litt at the University of Toronto, who called it "the first result produced autonomously by an AI that I find interesting in itself."

OpenAI's point is broader: if a model can hold together a complex mathematical argument and find unexpected connections across distant areas of knowledge, those same capabilities matter for biology, physics, materials science, and medicine. AI is moving from "tool that helps with research" to "research partner that can discover."

The Bigger Picture

May 2026 has been called "the biggest month in AI history." OpenAI filed for a $1 trillion IPO. Anthropic posted its first operating profit. Google's Gemini 3.5 Flash launched across billions of devices.

But the mathematics breakthrough might matter most in the long run. It's not about benchmarks or revenue. It's about a machine producing something that's genuinely new—an insight that expands human knowledge, not just retrieves or synthesizes what's already known.

As Thomas Bloom, one of the mathematicians who reviewed the proof, wrote: "AI is helping us to more fully explore the cathedral of mathematics we have built over the centuries; what other unseen wonders are waiting in the wings?"

The question isn't whether AI will replace mathematicians. The question is: what else will it figure out that we've been missing?

AI Solves 80-Year-Old Math Problem: When Machines Outthink Mathematicians | The Coe Lab