Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

April 29, 2026·2h 13m·7-min readnarrativescience
Podzilla Summary (7 min)
TLDR
This podcast episode is a technical deep-dive lecture on the infrastructure and economics of AI inference, focusing on how model architecture, hardware constraints, and batching strategies determine latency, cost, and scalability. It lands by revealing the hidden engineering trade-offs be

Sign up to read the full summary

Free AI-powered recaps of every episode, delivered to your inbox.

Get Free Summaries →

Free forever for up to 3 podcasts. No credit card required.

Guest

R

Reiner Pope

narrative on Dwarkesh Podcast

Mentions

Books, people, and resources mentioned in this episode.

People

Reiner PopeDwarkeshClark

Companies

MaddoxJane Street

Research & Papers

unified laws for routed language modelsrev nets

Topics

Artificial IntelligenceData ScienceDeep LearningLarge Language ModelsMachine LearningSemiconductorsTech Industry

Listen to This Episode

Get summaries like this every morning.

Free AI-powered recaps of Dwarkesh Podcast and your other favorite podcasts, delivered to your inbox.

Get Free Summaries →

Free forever for up to 3 podcasts. No credit card required.