From graphemic dependence to lexical structure: a Markovian perspective on Dante's Commedia
arXiv cs.CL / 4/27/2026
💬 OpinionIdeas & Deep AnalysisModels & Research
Key Points
- The paper studies Dante’s Divina Commedia by encoding text into vowel/consonant (V/C) symbols and modeling the resulting sequence as a four-state Markov chain.
- It introduces a “graphemic memory” index that slightly but consistently increases from Inferno to Paradiso, suggesting a directional shift in local dependency structure.
- A trigram-level analysis attributes the trend to a limited set of recurrent configurations (“graphemic probes”) that connect Markov patterns to identifiable lexical environments.
- The work finds different probe behaviors across word boundaries versus within lexical units and shows that orthographic conventions—especially apostrophised forms—affect the signal.
- A complementary classification approach uses cantica-specific terms as lexical anchors, showing that the three cantiche are separated but also follow a continuous trajectory across the poem.
Related Articles

Legal Insight Transformation: 7 Mistakes to Avoid When Adopting AI Tools
Dev.to

Legal Insight Transformation: Traditional vs. AI-Driven Research Compared
Dev.to

Legal Insight Transformation: A Beginner's Guide to Modern Research
Dev.to
I tested the same prompt across multiple AI models… the differences surprised me
Reddit r/artificial

The five loops between AI coding and AI engineering
Dev.to