AI Navigate

A Longitudinal, Multinational, and Multilingual Corpus of News Coverage of the Russo-Ukrainian War

arXiv cs.CL / 3/16/2026

📰 NewsIdeas & Deep AnalysisTools & Practical Usage

Key Points

  • The DNIPRO corpus collects 246K news articles about the Russo-Ukrainian war (Feb 2022 – Aug 2024) across eleven outlets in five nation-states and three languages.
  • It includes metadata and human-evaluated annotations for stance, sentiment, and topical framing to enable systematic analysis of competing geopolitical narratives.
  • The dataset supports empirical studies of narrative evolution, cross-lingual information flow, and detection of implicit contradictions in fragmented information ecosystems.
  • Exploratory findings show outlets construct divergent attributions and topical selections, illustrating narrative divergence without directly refuting opposing narratives.

Abstract

We present DNIPRO, a corpus of 246K news articles from the Russo-Ukrainian war (Feb 2022 -- Aug 2024) spanning eleven outlets across five nation-states (Russia, Ukraine, U.S., U.K., China) and three languages. The corpus features comprehensive metadata and human-evaluated annotations for stance, sentiment, and topical framing, enabling systematic analysis of competing geopolitical narratives. It is uniquely suited for empirical studies of narrative divergence, media framing, and information warfare. Our exploratory analyses reveal how media outlets construct incompatible realities through divergent attribution and topical selection without direct refutation of opposing narratives. DNIPRO empowers empirical research on narrative evolution, cross-lingual information flow, and computational detection of implicit contradictions in fragmented information ecosystems.