Last Week in AI

#243 - GPT 5.5, DeepSeek V4, AI safety sabotage

May 3, 2026·1h 52m
Episode Description from the Publisher

Our 243rd episode with a summary and discussion of last week's big AI news!Recorded on 04/29/2026Hosted by Andrey Kurenkov and Jeremie HarrisFeel free to email us your questions and feedback at andreyvkurenkov@gmail.com and/or&nbsp;hello@gladstone.aiRead out our text newsletter and comment on the podcast at https://lastweekin.ai/In this episode:OpenAI released GPT-5.5 with strong coding-oriented improvements, a system card discussing chain-of-thought monitorability and misalignment testing, higher pricing than GPT-5.4, and notable quirks like a system-prompt warning about “goblins.”xAI launched Grok Voice Think Fast 1.0, claiming large benchmark leads for real-time voice agents and reporting major Starlink customer-support automation and sales conversion impact.DeepSeek open-sourced DeepSeek V4 (Pro and Flash) featuring MoE scaling and 1M-token context via hybrid/compressed attention changes, while Tencent released Hunyuan 3 preview with weaker benchmark performance; a new long-horizon agent benchmark (Clawmark) shows low task success rates.Major business, legal, and policy updates include Google’s planned up-to-$40B investment and 5GW compute commitment to Anthropic, Meta’s AWS Gravitron deal and China blocking Meta’s Manus acquisition, a revamped OpenAI–Microsoft agreement, ongoing Musk–OpenAI trial developments, and new safety/security research on sabotage, document degradation under delegation, and bit-flip attacks.Timestamps:(00:00:10) Intro / Banter(00:02:00) News Preview(00:02:26) Response to listener comments(00:02:55) SponsorsTools &amp; Apps(00:05:55) OpenAI Unveils Its New, More Powerful GPT-5.5 Model - The New York Times(00:23:33) xAI Launches grok-voice-think-fast-1.0: Topping τ-voice Bench at 67.3%, Outperforming Gemini, GPT Realtime, and More - MarkTechPost(00:29:00) Claude can now plug directly into Photoshop, Blender, and Ableton | The VergeProjects &amp; Open Source(00:29:38) China's DeepSeek releases preview of long-awaited V4 model as AI race intensifies(00:47:05) Tencent Unveils Hy3 preview; Model Enhances Agent Capabilities and Real-World Usability - Tencent 腾讯(00:50:14) ClawMark: A Living-World Benchmark for Multi-Turn, Multi-Day, Multimodal Coworker AgentsApplications &amp; Business(00:53:03) Google Plans to Invest Up to $40 Billion in Anthropic(00:56:26) Meta will use hundreds of thousands of AWS Graviton chips(00:59:51) China blocks Meta's $2 billion takeover of AI startup Manus(01:01:45) OpenAI shakes up partnership with Microsoft, capping revenue share payments(01:07:13) Elon Musk Testifies of AI Risk at Trial, Says OpenAI Tried to ‘Steal’ a Charity - WSJ(01:11:50) <a href="https://www.politico.com/news/2026/04/24/doj-delay-anthropic-appeal

Podzilla Summary coming soon

Sign up to get notified when the full AI-powered summary is ready.

Get Free Summaries →

Free forever for up to 3 podcasts. No credit card required.

Listen to This Episode

Get summaries like this every morning.

Free AI-powered recaps of Last Week in AI and your other favorite podcasts, delivered to your inbox.

Get Free Summaries →

Free forever for up to 3 podcasts. No credit card required.