ModuSeg: Decoupling Object Discovery and Semantic Retrieval for Training-Free Weakly Supervised Segmentation
arXiv cs.CV / 4/9/2026
💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- ModuSeg is a training-free weakly supervised semantic segmentation method that explicitly decouples object discovery (localization) from semantic assignment (category labeling) to avoid models overfitting to sparse discriminative regions.
- It uses a general mask proposer to generate geometric proposals with reliable boundaries, then relies on foundation-model features stored in an offline feature bank and performs segmentation via non-parametric feature retrieval.
- The approach introduces semantic boundary purification and soft-masked feature aggregation to reduce boundary ambiguity and quantization errors, improving the quality of learned category prototypes.
- Experiments on standard benchmark datasets show that the decoupled design preserves fine-grained boundaries while achieving highly competitive performance without parameter fine-tuning, and the code is released publicly.
Related Articles

Black Hat Asia
AI Business

Amazon CEO takes aim at Nvidia, Intel, Starlink, more in annual shareholder letter
TechCrunch

Why Anthropic’s new model has cybersecurity experts rattled
Reddit r/artificial
Does the AI 2027 paper still hold any legitimacy?
Reddit r/artificial

Why Most Productivity Systems Fail (And What to Do Instead)
Dev.to