FruitProM-V2: Robust Probabilistic Maturity Estimation and Detection of Fruits and Vegetables

arXiv cs.AI / 4/30/2026

💬 OpinionModels & Research

Key Points

  • The paper argues that fruit and vegetable maturity is a continuous biological process, but vision-based methods often force it into hard multi-class boundaries that can misrepresent similar ripening stages.
  • By studying annotation reliability with two independent annotators on a held-out tomato dataset, the authors find that disagreement is concentrated near adjacent maturity stages.
  • To address this, FruitProM-V2 treats maturity as a latent continuous variable and uses a probabilistic “distributional detection head,” converting the learned distribution into class probabilities via the CDF.
  • The probabilistic approach achieves performance comparable to a standard detector on clean labels while better capturing uncertainty, and it shows improved robustness when training with controlled label noise.
  • Overall, explicitly modeling maturity uncertainty is presented as a way to produce more reliable maturity estimation for harvest-timing decisions.

Abstract

Accurate fruit maturity identification is essential for determining harvest timing, as incorrect assessment directly affects yield and post-harvest quality. Although ripening is a continuous biological process, vision-based maturity estimation is typically formulated as a multi-class classification task, which imposes sharp boundaries between visually similar stages. To examine this limitation, we perform an annotation reliability study with two independent annotators on a held-out tomato dataset and observe disagreement concentrated near adjacent maturity stages. Motivated by this observation, we model maturity as a latent continuous variable and predict it probabilistically using a distributional detection head, converting the distribution into class probabilities through the cumulative distribution function (CDF). The proposed formulation maintains comparable performance to a standard detector under clean labels while better representing uncertainty. Furthermore, when controlled label noise is introduced during training, the probabilistic model demonstrates improved robustness relative to the baseline, indicating that explicitly modeling maturity uncertainty leads to more reliable visual maturity estimation.