COMO: Closed-Loop Optical Molecule Recognition with Minimum Risk Training
arXiv cs.CV / 4/28/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- Optical chemical structure recognition (OCSR) is difficult in real-world documents due to wide variations, shorthand, and visual noise, and many deep-learning methods use teacher forcing with token-level MLE that creates exposure bias.
- The paper proposes Minimum Risk Training (MRT) for OCSR and introduces COMO, a closed-loop framework that reduces exposure bias by optimizing molecule-level, non-differentiable objectives via iterative sampling and evaluation.
- Experiments on ten benchmarks, including synthetic and real patent/scientific diagrams, show COMO significantly outperforms existing rule-based and learning-based approaches while requiring less training data.
- Ablation results indicate that MRT is architecture-agnostic, suggesting the approach can be broadly applied to end-to-end OCSR systems.
Related Articles

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.
Dev.to

Same Agent, Different Risk | How Microsoft 365 Copilot Grounding Changes the Security Model | Rahsi Framework™
Dev.to

Claude Haiku for Low-Cost AI Inference: Patterns from a Horse Racing Prediction System
Dev.to

How We Built an Ambient AI Clinical Documentation Pipeline (and Saved Doctors 8+ Hours a Week)
Dev.to

🦀 PicoClaw Deep Dive — A Field Guide to Building an Ultra-Light AI Agent in Go 🐹
Dev.to