PCA-Seg: Revisiting Cost Aggregation for Open-Vocabulary Semantic and Part Segmentation
arXiv cs.CV / 3/19/2026
📰 NewsModels & Research
Key Points
- PCA-Seg introduces parallel cost aggregation to alleviate knowledge interference between class-level semantics and spatial context in open-vocabulary semantic and part segmentation.
- It features an expert-driven perceptual learning (EPL) module with a multi-expert parser to fuse semantic and contextual features and a coefficient mapper that learns pixel-specific weights for adaptive feature integration.
- A feature orthogonalization decoupling (FOD) strategy reduces redundancy between semantic and contextual streams, enabling learning from orthogonalized, complementary knowledge.
- Extensive experiments on eight benchmarks show that each parallel block adds only about 0.35M parameters while delivering state-of-the-art OSPS performance.
- The approach offers a lightweight, scalable path to improved vision-language alignment in open-vocabulary segmentation tasks.




![[Boost]](/_next/image?url=https%3A%2F%2Fmedia2.dev.to%2Fdynamic%2Fimage%2Fwidth%3D800%252Cheight%3D%252Cfit%3Dscale-down%252Cgravity%3Dauto%252Cformat%3Dauto%2Fhttps%253A%252F%252Fdev-to-uploads.s3.amazonaws.com%252Fuploads%252Fuser%252Fprofile_image%252F3833034%252F44fa15e0-8eb9-4843-a424-a4a7b3538f43.jpeg&w=3840&q=75)