AI Navigate

Single molecule localization microscopy challenge: a biologically inspired benchmark for long-sequence modeling

arXiv cs.LG / 3/13/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces the Single Molecule Localization Microscopy Challenge (SMLM-C), a new benchmark dataset with ten SMLM simulations spanning dSTORM and DNA-PAINT modalities to evaluate state space models on sparse, biologically realistic spatiotemporal data with ground truth.
  • Using a controlled subset of simulations, the authors evaluate state space models and observe that performance degrades substantially as temporal discontinuity increases, highlighting challenges in modeling heavy-tailed blinking dynamics.
  • The study demonstrates fundamental limitations of current sequence models for sparse, irregular temporal processes encountered in real-world scientific imaging data, motivating the development of more suitable architectures.
  • SMLM-C provides a resource for future research in long-sequence modeling for biological imaging and serves as a testbed for comparing modeling approaches under realistic conditions.

Abstract

State space models (SSMs) have recently achieved strong performance on long sequence modeling tasks while offering improved memory and computational efficiency compared to transformer based architectures. However, their evaluation has been largely limited to synthetic benchmarks and application domains such as language and audio, leaving their behavior on sparse and stochastic temporal processes in biological imaging unexplored. In this work, we introduce the Single Molecule Localization Microscopy Challenge (SMLM-C), a benchmark dataset consisting of ten SMLM simulations spanning dSTORM and DNA-PAINT modalities with varying hyperparameter designed to evaluate state space models on biologically realistic spatiotemporal point process data with known ground truth. Using a controlled subset of these simulations, we evaluate state space models and find that performance degrades substantially as temporal discontinuity increases, revealing fundamental challenges in modeling heavy-tailed blinking dynamics. These results highlight the need for sequence models better suited to sparse, irregular temporal processes encountered in real world scientific imaging data.