Fixation Sequences as Time Series: A Topological Approach to Dyslexia Detection

arXiv cs.CL / 4/24/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The study proposes using persistent homology from topological data analysis to extract robust, multi-scale features from time series via filtration over varying thresholds.
  • It introduces novel filtrations tailored to time series and applies topological analysis to eye-tracking fixation sequences by treating them as time series.
  • The authors build “hybrid models” that combine persistent-homology (topological) features with conventional statistical features for analyzing eye-tracking data.
  • Evaluated on dyslexia detection using the Copenhagen Corpus (scanpaths from dyslexic and non-dyslexic L1/L2 readers), the hybrid topological-statistical models outperform methods that use only traditional features.
  • The paper reports that the proposed filtrations themselves are stronger than existing ones and that the topological features reach performance comparable to established baselines.

Abstract

Persistent homology, a method from topological data analysis, extracts robust, multi-scale features from data. It produces stable representations of time series by applying varying thresholds to their values (a process known as a \textit{filtration}). We develop novel filtrations for time series and introduce topological methods for the analysis of eye-tracking data, by interpreting fixation sequences as time series, and constructing ``hybrid models'' that combine topological features with traditional statistical features. We empirically evaluate our method by applying it to the task of dyslexia detection from eye-tracking-while-reading data using the Copenhagen Corpus, which contains scanpaths from dyslexic and non-dyslexic L1 and L2 readers. Our hybrid models outperform existing approaches that rely solely on traditional features, showing that persistent homology captures complementary information encoded in fixation sequences. The strength of these topological features is further underscored by their achieving performance comparable to established baseline methods. Importantly, our proposed filtrations outperform existing ones.