Robust by Design: A Continuous Monitoring and Data Integration Framework for Medical AI

arXiv cs.CV / 4/13/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper proposes an autonomous continuous monitoring and data integration framework to keep adaptive medical AI models robust against data drift in real clinical settings.
  • Using multi-metric feature analysis plus Monte Carlo dropout uncertainty gating, the method only incorporates new images that appear statistically similar to the training distribution and have low predictive entropy.
  • It supports incremental retraining with safeguards that prevent performance degradation beyond a 5% threshold, aiming to avoid catastrophic forgetting.
  • Experiments on glomerular pathology image classification with a ResNet18 ensemble and a multi-center dataset show sustained performance (AUC ~0.92 and accuracy ~89%) even as new images are added.
  • Overall, the approach targets long-term reliability for medical imaging AI by combining drift detection, selective data ingestion, and controlled continual learning.

Abstract

Adaptive medical AI models often face performance drops in dynamic clinical environments due to data drift. We propose an autonomous continuous monitoring and data integration framework that maintains robust performance over time. Focusing on glomerular pathology image classification (proliferative vs. non-proliferative lupus nephritis), our three-stage method uses multi-metric feature analysis and Monte Carlo dropout-based uncertainty gating to decide when to retrain on new data. Only images statistically similar to the training distribution (via Euclidean, cosine, Mahalanobis metrics) and with low predictive entropy are integrated. The model is then incrementally retrained with these images under strict performance safeguards (no metric degradation >5%). In experiments with a ResNet18 ensemble on a multi-center dataset, the framework prevents performance degradation: new images were added without significant change in AUC (~0.92) or accuracy (~89%). This approach addresses data shift and avoids catastrophic forgetting, enabling sustained learning in medical imaging AI.