Terence Tao – Kepler, Newton, and the true nature of mathematical discovery
- 01The Scientific Method Is Being Inverted by AI and Big Data
- 02AI Excels at Breadth; Humans Still Excel at Depth
- 03The Real Bottleneck Now Is Verification, Not Idea Generation
Podcast: Dwarkesh | Participants: Dwarkesh Patel, Terence Tao
1. Key Themes
The Scientific Method Is Being Inverted by AI and Big Data
The classical scientific method — form hypothesis, then collect data — is being reversed. Now data comes first, and hypotheses emerge from it. AI has accelerated this inversion by making idea generation essentially free, shifting the bottleneck to verification and validation.
"Now with machine learning and data analysis and statistics and so on, you can start with data and through say statistics work out laws that were not present before... Now it's almost reverse. You collect big data first, and then you try to get hypotheses from it." — Terence Tao 00:07:58
"AI has basically driven the cost of idea generation down to almost zero. In a very similar way to how the internet drove the cost of communication down to almost zero. Which is an amazing thing, but it doesn't make, it doesn't create abundance by itself. Now the bottleneck is different." — Terence Tao 00:12:17
AI Excels at Breadth; Humans Still Excel at Depth — and the Future Is Complementary
AI can sweep entire problem spaces, solving low-hanging fruit at scale. But it lacks the cumulative, adaptive, depth-first intelligence that humans apply to hard problems. The future of math and science is redesigning workflows to exploit both.
"They excel at breadth and humans excel at depth and human experts at least. So I think they're very complementary, but our current way of doing math and science is focused on depth because that's where the human expertise is because humans can't do breadth. But yeah, so we have to redesign the way we do science to take full advantage of this breadth capability that we now have." — Terence Tao 00:35:21
"We can explore entire new fields of science by first getting these broad, moderately competent AI to sort of map it out and clear out all the easy, make all the easy observations and then identify certain islands of difficulty which, you know, then human experts can come and work on." — Terence Tao 00:35:50
The Real Bottleneck Now Is Verification, Not Idea Generation
Peer review and scientific institutions were built to filter a trickle of ideas. AI has turned that into a flood. The infrastructure for validating ideas — at scale, reliably, quickly — simply doesn't exist yet and is the defining challenge of the coming era.
"We're now in a situation where suddenly people can generate thousands of theories for a given scientific problem. And now we have to verify them, evaluate them. And this is something which we have to change our structures of science to actually sort this out." — Terence Tao 00:12:45
"Many journals are reporting AI general submissions are just flooding their submissions. So it's great that we can generate all kinds of things now with AI, but it means that we have to, the rest of the aspects of science have to catch up." — Terence Tao 00:13:38
2. Contrarian Perspectives
Partial Progress Is More Valuable Than We Recognize — and AI Is Bad at It
Conventional wisdom celebrates solved problems. But Tao argues that the ability to build on partial progress — climbing partway up a wall and pulling others up — is actually the core engine of science. AI can jump high but can't hold a handhold and build from it.
"What they can't do is they jump a little bit and they reach some handhold but then they stay there and pull other people up and then they jump from there. There isn't this cumulative process which is sort of built up interactively. It seems to be a lot more trial and error and just repetition brute force." — Terence Tao 00:50:36
The "Correct" Theory Is Often Initially Worse Than the Wrong One
At the moment of discovery, a new correct theory frequently looks less accurate and less useful than the entrenched incorrect theory it will eventually replace. This makes it nearly impossible to build an automated peer review system based on accuracy alone.
"The ultimately correct theory initially is worse in many ways. Copernicus's theory of the planets, it was less accurate than Ptolemy's theory... It was only Kepler that made it more accurate than Ptolemy's theory." — Terence Tao 00:18:48
"Often progress has been made not by adding more theories, but by deleting some assumptions that you have in your mind." — Terence Tao 00:19:37
AI's 1-2% Success Rate on Math Problems Is Being Misread as a Trend
The media narrative around AI solving decades-old math problems is systematically misleading. The wins are cherry-picked from massive sweeps. On any given problem, success rates are very low — and the impressive-looking results are a consequence of scale, not capability.
"Whenever we do a systematic study any given problem an AI tool has a success rate of maybe 1 or 2%. It's just that they can buy a scale and if you just pick the winners it looks great." — Terence Tao 00:44:33
"There'll be a lot of noise amongst the signal of sort of when they're working when they're not. We have to do it's increasingly important to collect these really standardized data sets... and not just rely on the AI companies to only publish their wins and not disclose their negative results." — Terence Tao 00:45:32
Serendipity Is a Feature, Not a Bug — And We're Optimizing It Away
Tao argues that unscheduled, inefficient, "wasteful" time is scientifically productive in ways that resist quantification. Over-optimization — including through AI — may destroy the randomness that generates breakthroughs.
"With COVID, for example, we switched a lot to remote meetings and so everything was scheduled now... What we lost out on was the casual, knocking on the hallway, just meeting someone while getting a coffee. And there's serendipitous interactions that you may think are not optimal, but actually are really important." — Terence Tao 00:15:30
"Maybe there's a danger actually that in the modern society, it's not just AI, but we've become really good at optimizing everything. And maybe we're not optimizing a lot of optimization." — Terence Tao 00:14:29
3. Companies Identified
Jane Street
Description: Quantitative trading and financial firm
Why Mentioned: Created a sophisticated AI/ML puzzle for Dwarkesh's audience — shuffling 96 layers of a trained neural network (AriseNet) and challenging solvers to reconstruct the order. Also mentioned for an unsolved backdoor LLM puzzle. Signals that Jane Street is doing cutting-edge research at the intersection of AI interpretability and quantitative methods.
"Jane Street trained AriseNet and then shuffled all 96 layers and then challenged people to put them back in the right order using only the model's outputs and training data. You can't brute force this. There's more possible orderings than atoms in the universe." — Dwarkesh Patel 00:27:35
Mercury
Description: Fintech banking platform for startups and entrepreneurs
Why Mentioned: Sponsor mention by Dwarkesh, who has personally banked with them since 2023. Highlighted for continuously shipping new features (e.g., "Insights" — an intelligent cash flow summarizer).
"Mercury is constantly updating things and adding new features. Take their newest feature, Insights. Insights summarizes your money in and out, showing you your biggest transactions and calling out anything that deserves extra attention." — Dwarkesh Patel 00:48:50
LabelBox
Description: AI data labeling and model evaluation platform
Why Mentioned: Sponsor mention, highlighted for building rubrics that measure model quality beyond simple correctness — including whether models reach for the right tools, check their own work, and explore alternative paths.
"These rubrics go beyond simple correctness. Did the model reach for the right tools? Did it check its own work and explore alternative paths?" — Dwarkesh Patel 00:52:22
4. People Identified
Tycho Brahe
Description: 16th-century Danish astronomer; last of the naked-eye astronomers
Why Mentioned: Tao makes the non-obvious point that Brahe deserves far more credit than he gets. His obsessive, decades-long, high-precision data collection was the essential substrate for Kepler's discoveries — analogous to the role of big data in modern science.
"We should also celebrate Brahe for his assiduous data collection, which was ten times more precise than any previous observation. And that extra decimal point of accuracy was actually essential for Kepler to get his results." — Terence Tao 00:06:41
Johannes Bode (Bordet as referred to in transcript)
Description: 18th-century astronomer who formalized the Titius-Bode Law
Why Mentioned: Cited as a cautionary tale about overfitting to small data sets. His pattern correctly predicted Uranus and Ceres, creating enormous excitement, but failed completely for Neptune — illustrating that empirical regularities from sparse data are dangerously unreliable.
"People got really excited that Bode had discovered this amazing new law of nature. But then Neptune was discovered and it was completely like way off. And basically, it was just a numerical fluke." — Terence Tao 00:11:05
Darwin
Description: 19th-century naturalist, author of On the Origin of Species
Why Mentioned: Highlighted as an underrated science communicator whose persuasive writing style — plain English, no equations, compelling narrative — was as important as the theory itself in driving scientific adoption.
"He spoke in plain English, didn't use equations, and he synthesized a lot of disparate facts... his writing style was persuasive, and that helped a lot." — Terence Tao 00:23:25
5. Operating Insights
Commit to Writing Down Everything You Learn, Not Just What You Produce
Tao describes how early in his career he would deeply understand something, then lose access to that understanding months later. His solution — a public blog — forces him to externalize learning in a way that sticks and compounds over time.
"The first few times it was so frustrating to have understood something and then lost it. I should always write down anything cool that I've learned. And this is part of how this blog came about." — Terence Tao 01:12:08
Use AI to Eliminate Secondary Tasks, Not to Replace Core Work
Tao's personal workflow with AI is precise: he delegates formatting, literature searches, plot generation, and code reformatting to AI — but still uses pen and paper for the hardest conceptual work. The result is richer outputs without trading away depth.
"The core of what I do solving the most difficult part of a math problem that hasn't changed too much. I still use pen and paper for that. But there's lots of silly things... they really sped up lots of secondary tasks. They haven't yet sped up the core thing that I do but it's allowed me to add more things to my papers." — Terence Tao 00:48:22
Protect Unscheduled Time as a Source of High-Value Signal
At the organizational level, Tao's insight applies directly to how leaders should structure their calendars and those of their teams. Fully scheduled days optimize for known outputs at the cost of serendipitous discoveries.
"I do believe a lot in serendipity... I have been willing to sort of leave some portions just, okay, I'm going to do something which is not my usual thing and maybe it will be a waste of my time, but maybe I will learn something. And more often than not, I feel like I've gotten a positive experience, which is not something I would have planned for." — Terence Tao 01:13:59
6. Overlooked Insights
Astronomy PhDs Are Sought After by Quant Hedge Funds — and This Reveals a Broader Alpha in Signal Extraction
Tao mentions almost in passing that quant hedge funds actively prefer to hire astronomy PhDs. This is significant: astronomy is uniquely disciplined at extracting high-confidence conclusions from extremely sparse, noisy data — a skill that transfers directly to alpha generation in financial markets. This suggests that any domain that has historically faced severe data scarcity (astro, geology, archaeology) has developed methodological tools that are undervalued and broadly transferable.
"I hear that a lot of quant hedge funds, they're preferred hires in astronomy PhD. They also are very interested for other reasons in extracting signals from various random bits of data." — Terence Tao 00:27:22
A Formal Language for Mathematical Strategy — Not Just Proof — Could Be the Most Leveraged Research Investment in AI for Science
Tao briefly floats the idea of a "semi-formal language for mathematical strategies" as distinct from formal proof languages like Lean. This is treated as a wish, almost an aside — but it is actually a foundational missing layer. Lean can verify deductive steps. But there is no formal system for representing plausibility, strategy, or conjecture quality. Whoever builds this layer unlocks the ability to train AI on the full scientific reasoning process, not just its outputs. This is arguably more important than any improvement to LLM capability.
"Just seeing how successful having a formal framework in place like Lean has made deductive proofs so much easier to automate and train AI on. If there was some similar framework... the bottleneck for using AI to create strategies and make conjectures is we have to rely on human experts to validate whether something's plausible or not." — Terence Tao 01:01:01