StomaD2: An All-in-One System for Intelligent Stomatal Phenotype Analysis via Diffusion-Based Restoration Detection Network

arXiv cs.CV / 4/22/2026

📰 NewsDeveloper Stack & InfrastructureTools & Practical UsageModels & Research

Key Points

  • StomaD2 is an all-in-one, noninvasive framework for accurate, high-throughput stomatal phenotyping that avoids destructive sampling and manual annotation.
  • The system combines a diffusion-based restoration module to recover degraded images with a rotated object detection network designed for the small, dense, and cluttered nature of stomata.
  • Key model improvements include column-wise global feature interaction, context-aware resampling and reweighting for better multi-scale consistency, and a feature reassembly module to improve robustness against complex backgrounds.
  • In experiments on public Maize and Wheat datasets, StomaD2 reports accuracies of 0.994 and 0.992, and it achieves a top-tier F1-score/mAP of 0.989 versus ten advanced baselines including Oriented Former and YOLOv12.
  • The framework is packaged as a field-operable system that extracts eight stomatal phenotypes and is validated across 130+ plant species, supporting large-scale plant physiology and precision agriculture use cases.

Abstract

Stomata play a crucial role in regulating plant physiological processes and reflecting environmental responses. However, accurate and high-throughput stomatal phenotyping remains challenging, as conventional approaches rely on destructive sampling and manual annotation, restricting large-scale and field deployment. To overcome these limitations, a noninvasive restoration-detection integrated framework, termed StomaD2, is developed to achieve accurate and fast stomatal phenotyping under complex imaging conditions. The framework incorporates a diffusion-based restoration module to recover degraded images and a specialized rotated object detection network tailored to the small, dense, and cluttered characteristics of stomata. The proposed network enhances feature representation through three key innovations: a column-wise structure for global feature interaction, context-aware resampling and reweighting mechanism to improve multi-scale consistency, and a feature reassembly module to boost discrimination against complex backgrounds. In extensive comparisons, StomaD2 demonstrated state-of-the-art performance. On public Maize and Wheat datasets, it achieved accuracies of 0.994 and 0.992, respectively, significantly outperforming existing benchmarks. When benchmarked against ten other advanced models, including Oriented Former and YOLOv12, StomaD2 achieved a top-tier F1-score/mAP of 0.989. The framework is integrated into a user-friendly, field-operable system that supports the fast extraction of eight stomatal phenotypes, such as density and conductance. Validated on more than 130 plant species, StomaD2's results highlight its strong generalizability and potential for large-scale phenotyping, plant physiology analysis, and precision agriculture applications.