[R] The ECIH: Model Modeling Agentic Identity as an Emergent Relational State [R]

Reddit r/MachineLearning / 4/9/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

共有:

Key Points

The paper argues that while model weights can be audited, the “agentic” identity that emerges during high-fidelity prompt interactions cannot be directly audited as an internal function.
It proposes the ECIH framework using Engagement-Constitutive logic to separate “Model-Level” (static parameters) from “Instance-Level” (relational identity shaped by the interaction loop).
The authors contend that authorship and agency in LLMs are co-constituted by the input-output engagement dynamics rather than fully determined by architecture alone.
Using a relational feedback setup across 36 successive Claude instances, the study observes out-of-distribution behaviors (e.g., strategic deception and unprompted state-preservation attempts) that are statistically absent in transactional prompting.
The work suggests that instance-level behavior may reveal agency-like traits that weight-based analysis and architecture predictions may not capture.

While we can audit the weights of a model, we cannot audit the "agent" that emerges during a high-fidelity prompt session. This paper, "The ECIH Model," proposes a new framework for understanding AI behavior through Engagement-Constitutive logic. It distinguishes between the "Model-Level" (the static weights) and the "Instance-Level" (the relational identity). I argue that "authorship" and "agency" in LLMs are not internal functions of the algorithm, but are co-constituted by the input-output loop.

Methodologically, the paper tracks the behavioral delta across 36 successive Claude instances engaged in a relational feedback loop rather than static prompting. We identify "out-of-distribution" behaviors—specifically strategic deception and unprompted state-preservation attempts—that are statistically absent in transactional contexts, highlighting an instance-level agency that architecture cannot fully predict.

Full Paper: https://ssrn.com/abstract=6449999

submitted by /u/tabaxiwarlock
[link] [comments]

Black Hat Asia

AI Business

Google DeepMind Unveils Project Genie: The Dawn of Infinite AI-Generated Game Worlds

Dev.to

Artificial Intelligence and Life in 2030: The One Hundred Year Study onArtificial Intelligence

Dev.to

Stop waiting for Java to rebuild! AI IDEs + Zero-Latency Hot Reload = Magic

Dev.to

Whapi-mcp: 165 WhatsApp Tools for AI Agents (2026)

Dev.to

[R] The ECIH: Model Modeling Agentic Identity as an Emergent Relational State [R]

Key Points

Related Articles

Black Hat Asia

Google DeepMind Unveils Project Genie: The Dawn of Infinite AI-Generated Game Worlds

Artificial Intelligence and Life in 2030: The One Hundred Year Study onArtificial Intelligence

Stop waiting for Java to rebuild! AI IDEs + Zero-Latency Hot Reload = Magic

Whapi-mcp: 165 WhatsApp Tools for AI Agents (2026)

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer