
In this episode of Agents at Work, Jordi Montes sits down with Tom Shapland , product manager at LiveKit, the open-source platform powering real-time audio/video. They power the voice agents behind ChatGPT’s audio interface.They dive into:How a LiveKit side project became the voice pipeline for ChatGPTThe rise of cascaded pipelines vs. audio-to-audio agentsWhat makes voice turn-taking so tricky (and how to fix it)The role of tone, latency, and emotion in building natural-sounding AIWhy voice agents are more than just a feature—they’re the future interfaceTom shares his journey from building agtech startups to surfing the wave of voice AI infrastructure. He unpacks what it really means to bring machines closer to humans, and why we’re entering a golden age of ambient, always-on, emotionally aware assistants.Whether you’re an engineer, product manager, or just someone dreaming of yelling at your printer and getting a helpful response this episode is a must-listen.Try it at https://livekit.io
Podzilla Summary coming soon
Sign up to get notified when the full AI-powered summary is ready.
Free forever for up to 3 podcasts. No credit card required.

Agents at work 21: Your next co-founder is an AI agent w/ Ben (Polsia)

Agents at Work 20: “Bash Is All Your Agent Needs” w/ Yunfan (Yutori)

Agents at Work 19: Unlocking Enterprise Data Access Through Integrations w/ Gil Feig (Merge)

Agents at work 18: AI Translation at Enterprise Scale w/ Olga Beregovaya (Smartling)
Free AI-powered recaps of Agents at work and your other favorite podcasts, delivered to your inbox.
Free forever for up to 3 podcasts. No credit card required.