LWiAI Podcast #243 - GPT 5.5, DeepSeek V4, AI safety sabotage

Last Week in AI / 5/4/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The episode covers recent developments related to GPT 5.5 and DeepSeek V4, highlighting the latest model progress and competitive momentum in the AI landscape.
  • It discusses concerns about AI safety sabotage, focusing on how malicious actions or misuse can undermine safety efforts.
  • The podcast frames these topics as part of broader trends in AI capabilities and risk management rather than treating them as isolated news items.
  • It provides a timely roundup-style analysis of what these updates could mean for practitioners building and deploying AI systems.
Last Week in AI
Podcast
LWiAI Podcast #243 - GPT 5.5, DeepSeek V4, AI safety sabotage
6
1
0:00
Current time: 0:00 / Total time: -1:52:22
-1:52:22
Audio playback is not supported on your browser. Please upgrade.

LWiAI Podcast #243 - GPT 5.5, DeepSeek V4, AI safety sabotage

Last Week in AI's avatar
May 04, 2026
6
1
Share
Transcript

Our 243rd episode with a summary and discussion of last week’s big AI news!

Recorded on 04/29/2026

Hosted by Andrey Kurenkov and Jeremie Harris

Feel free to email us your questions and feedback at andreyvkurenkov@gmail.com and/or hello@gladstone.ai

In this episode:

  • OpenAI released GPT-5.5 with strong coding-oriented improvements, a system card discussing chain-of-thought monitorability and misalignment testing, higher pricing than GPT-5.4, and notable quirks like a system-prompt warning about “goblins.”

  • xAI launched Grok Voice Think Fast 1.0, claiming large benchmark leads for real-time voice agents and reporting major Starlink customer-support automation and sales conversion impact.

  • DeepSeek open-sourced DeepSeek V4 (Pro and Flash) featuring MoE scaling and 1M-token context via hybrid/compressed attention changes, while Tencent released Hunyuan 3 preview with weaker benchmark performance; a new long-horizon agent benchmark (Clawmark) shows low task success rates.

  • Major business, legal, and policy updates include Google’s planned up-to-$40B investment and 5GW compute commitment to Anthropic, Meta’s AWS Gravitron deal and China blocking Meta’s Manus acquisition, a revamped OpenAI–Microsoft agreement, ongoing Musk–OpenAI trial developments, and new safety/security research on sabotage, document degradation under delegation, and bit-flip attacks.

Timestamps:

  • (00:00:10) Intro / Banter

  • (00:02:00) News Preview

  • (00:02:26) Response to listener comments

Discussion about this episode

CommentsRestacks
User's avatar