Copula-enhanced Vision Transformer for high myopia diagnosis through OU UWF fundus images
arXiv cs.CV / 5/4/2026
💬 OpinionDeveloper Stack & InfrastructureModels & Research
Key Points
- The paper targets AI-assisted myopia screening by jointly performing two tasks from both-eye (OU) ultra-widefield fundus images: diagnosing OU high myopia (a binary outcome) and predicting axial length (a continuous outcome).
- It proposes a Vision Transformer approach that uses residual adapters on a foundation model to capture both inter-ocular similarity and heterogeneity between the two eyes.
- To handle the mixed binary–continuous multitask outputs, the authors introduce a four-dimensional copula loss that models conditional dependence via a Gaussian copula likelihood, implemented in PyTorch.
- They develop a computationally efficient fast Monte Carlo Expectation Maximization (fMCEM) algorithm for estimating copula parameters and show theoretical numerical stability under a multitask overfitting issue they call the stronger covariance phenomenon.
- Experiments on an annotated OU ultra-widefield fundus dataset and on synthetic data show stable improvements in both classification and regression performance with the proposed method.
Related Articles
AnnouncementsBuilding a new enterprise AI services company with Blackstone, Hellman & Friedman, and Goldman Sachs
Anthropic News

Dara Khosrowshahi on replacing Uber drivers — and himself — with AI
The Verge

CLMA Frame Test
Dev.to

Governance and Liability in AI Agents: What I Built Trying to Answer Those Questions
Dev.to

Roundtable chat with Talkie-1930 and Gemma 4 31B
Reddit r/LocalLLaMA