[D] Modeling online discourse escalation as a state machine (dataset + labeling approach)

Reddit r/MachineLearning / 3/23/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • Proposes modeling online discourse escalation as a state machine with per-comment local states and a thread-global state that evolves over time, defining states such as Neutral, Disagreement, Identity Activation, Personalization, Ad Hominem, and Dogpile.
  • Outlines signals and features across linguistic, structural, and contextual dimensions, including pronoun usage shifts, sentiment/insult markers, reply velocity, number of unique responders, thread depth, topic sensitivity, and prior state transitions.
  • Describes a dataset plan to collect threads from public platforms (e.g., Reddit), create a labeled dataset using the state taxonomy, start with manual annotation, and train a baseline classifier transitioning from heuristics to ML models.
  • Introduces a second layer of identity activation (personal, ideological, group) and hypothesizes that simultaneous activation across identities correlates with rapid escalation, while posing questions about framing, per-comment vs sequence modeling, labeling guidelines, and existing datasets.

Hi,

I’ve been working on a framework to model how online discussions escalate into conflict, and I’m exploring whether it can be framed as a classification / sequence modeling problem.

The core idea is to treat discourse as a state machine with observable transitions.

States (proposed)

  1. Neutral (information exchange)
  2. Disagreement
  3. Identity Activation
  4. Personalization
  5. Ad Hominem
  6. Dogpile (multi-user targeting, non-recoverable)

Each comment can be labeled as a local state, while threads also have a global state that evolves over time.

Signals / Features

Some features I’m considering:

  • Linguistic:
    • increase in second-person pronouns (“you”)
    • sentiment shift
    • insult / toxicity markers
  • Structural:
    • number of unique users replying to one user
    • reply velocity (bursts)
    • depth of thread
  • Contextual:
    • topic sensitivity (proxy via keywords)
    • prior state transitions in thread

Additional dimension

I’m also experimenting with a second layer:

  • Personal identity activation
  • Ideological identity activation
  • Group identity activation

The hypothesis is that simultaneous activation of multiple identity layers correlates with rapid escalation.

Dataset plan

  • Collect threads from public platforms (Reddit, etc.)
  • Build a labeled dataset using the state taxonomy above
  • Start with a small manually annotated dataset
  • Train a classifier (baseline: heuristic → ML model)

Questions

  1. Does this framing make sense as a sequence classification / state transition problem?
  2. Would you model this as:
    • per-comment classification, or
    • sequence modeling (e.g., HMM / RNN / transformer over thread)?
  3. Any suggestions on:
    • labeling guidelines to reduce ambiguity between states?
    • existing datasets that approximate this (beyond toxicity classification)?
  4. Would you treat “dogpile” as a class or as an emergent property of the graph structure?
submitted by /u/Inevitable_Back3319
[link] [comments]