Programming Throwdown

180: Reinforcement Learning

March 17, 2025·1h 52m

Episode Description from the Publisher

Patrick and Jason introduce reinforcement learning and place it alongside supervised and unsupervised learning. They cover Q-learning, SARSA, policy gradients, actor-critic methods, PPO, imitation learning, and why training and evaluating RL systems is so challenging.

Podzilla Summary coming soon

Sign up to get notified when the full AI-powered summary is ready.

Get Free Summaries →

Free forever for up to 3 podcasts. No credit card required.

Listen to This Episode

More from Programming Throwdown

187: Agentic Coding

May 2, 2026·1h 38m

186: Becoming a Manager

February 3, 2026·1h 27m

185: Workflow Orchestrators

November 4, 2025·1h 32m

184: Asynchronous Programming

September 23, 2025·1h 30m

View all episodes →

Get summaries like this every morning.

Free AI-powered recaps of Programming Throwdown and your other favorite podcasts, delivered to your inbox.

Get Free Summaries →

Free forever for up to 3 podcasts. No credit card required.