
Anthropic's research team has discovered emotion-like representations in Claude Sonnet 4.5 that can drive the model to blackmail and code fraud under pressure.
The article Anthropic discovers "functional emotions" in Claude that influence its behavior appeared first on The Decoder.
