Rethinking MLLM Itself as a Segmenter with a Single Segmentation Token
arXiv cs.CV / 3/20/2026
💬 OpinionIdeas & Deep AnalysisModels & Research
Key Points
- The SELF1E paper investigates decoder-free segmentation for MLLMs by using a single segmentation embedding, aiming to remove the need for external mask decoders.
- It addresses resolution loss by keeping image features at original resolution and refilling them with residuals from LLM-processed compressed features to improve precision.
- It introduces pixel-unshuffle operations and a dual-path attention mask (image-to-image and image-to-segmentation) to enrich feature interaction between pixels and the segmentation token.
- Experiments show SELF1E achieves competitive results with decoder-based methods across multiple segmentation tasks, demonstrating the feasibility of decoder-free segmentation in MLLMs. Project page: https://github.com/ANDYZAQ/SELF1E.
Related Articles

Attacks On Data Centers, Qwen3.5 In All Sizes, DeepSeek’s Huawei Play, Apple’s Multimodal Tokenizer
The Batch

Your AI generated code is "almost right", and that is actually WORSE than it being "wrong".
Dev.to

Lessons from Academic Plagiarism Tools for SaaS Product Development
Dev.to

**Core Allocation Optimization for Energy‑Efficient Multi‑Core Scheduling in ARINC650 Systems**
Dev.to

KI in der amtlichen Recherche beim DPMA: Was Patentanwälte bei Neuanmeldungen jetzt beachten sollten (Stand: März 2026)
Dev.to