Manage episode 520465099 series 3674205
Episode 22: Let's do the Math!
—
Sam Nadler and Jordan Metzner return with one of the most mind-bending episodes yet. Joined by Carina, founder & CEO of Axiom Math, the startup is building a self-improving, formal-reasoning AI mathematician. The trio breaks down why math is the next AI frontier, how Lean formalization works, and why proving theorems is a completely different challenge than solving them.
Jordan also unveils his newest build: the LLM Math Roaster, a tool that scores, compares, and even roasts large models on proofs, with a full leaderboard, custom problem submissions, and an API for automated evaluation. (Yes, it even benchmarked Gemini, GPT-5, Claude, and Grok head-to-head.)
In AI News, the hosts unpack Google’s massive Gemini 3 launch, Jeff Bezos stepping into the arena with Project Prometheus, and Suno’s $250M raise at a $2.45B valuation, plus what hyper-powerful AI means for creativity, coding, and even music composition.
It’s fast builds, deep math, big models, and a guest who’s literally building the future of reasoning.
—
Show Notes:
(0:00) Intro + welcoming our guest Carina
(1:00) What Axiom Math is building
(3:00) Jordan’s LM Math Roaster: how it works
(5:00) Testing models on proofs (Gemini, GPT-5, Claude, Grok)
(7:00) Why formal proofs beat natural-language reasoning
(9:00) The data bottleneck: Lean scarcity & synthetic generation
(12:00) How formal systems unlock “research-level” AI math
(15:00) Comparing LLM math vs. Axiom’s approach
(18:00) AI News: Gemini 3 hits the market
(20:00) Jeff Bezos returns with Project Prometheus
(22:00) Suno raises $250M — AI-generated music explodes
(24:00) How math, code & creativity overlap
(25:30) Episode wrap-up + what’s coming next
—
Platforms / Tools Mentioned:
• Axiom Math – https://www.axiom.ai
• Gemini 3 – https://ai.google.dev
• Lean / mathlib – https://lean-lang.org
• Grok / xAI – https://x.ai
• GPT-5.x – https://openai.com
• Claude – https://www.anthropic.com
—
Listen on Your Favorite Platform:
• Spotify – https://open.spotify.com/show/0ahiOCzYxhhkEgbtz9kkeC
• Apple Podcasts – https://podcasts.apple.com/us/podcast/built-this-week/id1823270832
• Amazon Music – https://music.amazon.com/podcasts/1017d387-fbb0-4bbf-9488-817cee38e058
• Deezer – https://www.deezer.com/us/show/1001995001
—
Follow the Hosts:
Jordan Metzner
• LinkedIn – https://www.linkedin.com/in/jordanmetzner/
• Instagram – https://www.instagram.com/mrjmetz/
• X – https://x.com/mrjmetz?lang=bn
Sam Nadler
• LinkedIn – https://www.linkedin.com/in/sam-nadler-1881b75/
• X – http://x.com/Gravino05
23 episodes