Our evaluation of OpenAI's GPT-5.5 cyber capabilities

Simon Willison's Blog / 5/1/2026

📰 NewsSignals & Early TrendsIndustry & Market MovesModels & Research

Key Points

  • The UK AI Security Institute (AISI) has evaluated OpenAI’s GPT-5.5 for its cyber capabilities, specifically its ability to find security vulnerabilities.
  • GPT-5.5 was found to be comparable to AISI’s earlier evaluation target, Claude Mythos, in terms of vulnerability-finding performance.
  • Unlike Mythos, GPT-5.5 is generally available right now, making these findings more immediately relevant for current deployments.
  • The post highlights that comparative security evaluations can help organizations understand how different frontier LLMs may perform in cybersecurity tasks.
Sponsored by: Sonar — Now with SAST + SCA for secure, dependency-aware Agentic Engineering. SonarQube Advanced Security

30th April 2026 - Link Blog

Our evaluation of OpenAI's GPT-5.5 cyber capabilities. The UK's AI Security Institute previously evaluated Claude Mythos: now they've evaluated GPT-5.5 for finding security vulnerability and found it to be comparable to Mythos, but unlike Mythos it's generally available right now.

Posted 30th April 2026 at 11:03 pm

This is a link post by Simon Willison, posted on 30th April 2026.

ai 1995 openai 416 generative-ai 1768 llms 1734 anthropic 278 claude 272 ai-security-research 16 gpt 124

Monthly briefing

Sponsor me for $10/month and get a curated email digest of the month's most important LLM developments.

Pay me to send you less!

Sponsor & subscribe