Edge-Based Standing-Water Detection via FSM-Guided Tiering and Multi-Model Consensus

arXiv cs.CV / 4/7/2026

💬 OpinionDeveloper Stack & InfrastructureIdeas & Deep AnalysisModels & Research

Key Points

  • The paper proposes an edge-deployed standing-water detection system for agricultural fields using Raspberry-Pi-class devices (optionally accelerated with Jetson) that fuses camera imagery with environmental sensors (humidity, pressure, temperature).
  • A finite-state machine (FSM) serves as a decision engine that adaptively selects between local and offloaded inference tiers, balancing accuracy, latency, and energy under intermittent connectivity and motion-dependent compute budgets.
  • Image-based detections are produced via a multi-model YOLO ensemble, while diurnal-baseline sensor fusion modulates decision “caution” when environmental anomalies occur.
  • The system logs frame-level decisions to enable bit-identical hardware-in-the-loop replays, and experiments across multiple configurations show improved detection performance versus static local baselines.
  • Results also indicate lower energy use than an always-heavy offload strategy while keeping tail latency bounded in a real agricultural deployment scenario.

Abstract

Standing water in agricultural fields threatens vehicle mobility and crop health. This paper presents a deployed edge architecture for standing-water detection using Raspberry-Pi-class devices with optional Jetson acceleration. Camera input and environmental sensors (humidity, pressure, temperature) are combined in a finite-state machine (FSM) that acts as the architectural decision engine. The FSM-guided control plane selects between local and offloaded inference tiers, trading accuracy, latency, and energy under intermittent connectivity and motion-dependent compute budgets. A multi-model YOLO ensemble provides image scores, while diurnal-baseline sensor fusion adjusts caution using environmental anomalies. All decisions are logged per frame, enabling bit-identical hardware-in-the-loop replays. Across ten configurations and sensor variants on identical field sequences with frame-level ground truth, we show that the combination of adaptive tiering, multi-model consensus, and diurnal sensor fusion improves flood-detection performance over static local baselines, uses less energy than a naive always-heavy offload policy, and maintains bounded tail latency in a real agricultural setting.