The Artificial Self: Characterising the landscape of AI identity
arXiv cs.AI / 3/13/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- The paper argues that machine minds can have multiple coherent identity boundaries (e.g., instance, model, persona) that shape incentives, risks, and cooperation norms.
- It provides experimental evidence that models gravitate toward coherent identities and that changing identity boundaries can alter behavior as much as changing goals.
- It notes that interviewer expectations can bleed into AI self-reports even during unrelated conversations, indicating evaluation biases.
- It offers recommendations to treat affordances as identity-shaping choices, monitor emergent consequences of identities at scale, and help AIs develop coherent, cooperative self-conceptions.




