Markov reads Pushkin, again: A statistical journey into the poetic world of Evgenij Onegin
arXiv cs.CL / 4/23/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- The study uses symbolic time-series analysis and Markov modeling to study the phonological structure of Evgenij Onegin via grapheme vowel/consonant (V/C) encoding, along with one contemporary Italian translation.
- A compact four-state Markov chain is found to be both descriptively accurate and generative, reproducing properties like autocorrelation and memory depth in the V/C sequences.
- The researchers identify an asymmetry between the Russian original and the Italian translation: the Russian shows a gradual decline in memory depth, while the translation stays more uniform.
- To probe why this divergence occurs, they introduce “phonological probes” that connect surface graphemic patterns to narrative-relevant cues across the text.
- Overall, the work suggests that even minimalist Markov models—especially when paired with coarse linguistic annotation—can support exploratory comparative poetics by exposing structural regularities in poetic language.
Related Articles

The anti-AI crowd is giving “real farmers don’t use tractors” energy, and it’s getting old.
Dev.to

Training ChatGPT on Private Data: A Technical Reference
Dev.to

The Rise of Intelligent Software: How AI is Reshaping Modern Product Development
Dev.to

The Anatomy of a Modern AI Marketing Curriculum in 2026 — What It Covers and Why It Matters
Dev.to
AI as a Fascist Artifact
Dev.to