SODA-CitrON: Static Object Data Association by Clustering Multi-Modal Sensor Detections Online

arXiv cs.RO / 4/29/2026

💬 OpinionDeveloper Stack & InfrastructureIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces SODA-CitrON, an online method for associating and tracking static objects by clustering heterogeneous, multi-modal sensor detections.
  • Unlike classical data association approaches such as JPDA that rely on strong motion models, SODA-CitrON is designed to work well when static objects are observed intermittently with clutter and heterogeneous uncertainty.
  • The approach is unsupervised and fully online, estimating object positions while maintaining persistent tracks even when the number of objects is unknown.
  • The method offers worst-case loglinear computational complexity in the number of detections and aims to provide explainable outputs.
  • In Monte Carlo simulations across multiple scenarios, SODA-CitrON outperforms prior techniques (including POM-based filtering, DBSTREAM clustering, and JPDA) on metrics such as F1 score, position RMSE, MOTP, and MOTA for static mapping.

Abstract

The online fusion and tracking of static objects from heterogeneous sensor detections is a fundamental problem in robotics, autonomous systems, and environmental mapping. Although classical data association approaches such as JPDA are well suited for dynamic targets, they are less effective for static objects observed intermittently and with heterogeneous uncertainties, where motion models provide minimal discriminative power with respect to clutter. In this paper, we propose a novel method for static object data association by clustering multi-modal sensor detections online (SODA-CitrON), while simultaneously estimating positions and maintaining persistent tracks for an unknown number of objects. The proposed unsupervised machine learning approach operates in a fully online manner and handles temporally uncorrelated and multi-sensor measurements. Additionally, it has a worst-case loglinear complexity in the number of sensor detections while providing full output explainability. We evaluate the proposed approach in different Monte Carlo simulation scenarios and compare it against state-of-the-art methods, including POM-based filtering, DBSTREAM clustering, and JPDA. The results demonstrate that SODA-CitrON consistently outperforms the compared methods in terms of F1 score, position RMSE, MOTP, and MOTA in the static object mapping scenarios studied.