Beyond Visual Cues: Semantic-Driven Token Filtering and Expert Routing for Anytime Person ReID
arXiv cs.CV / 4/17/2026
📰 NewsDeveloper Stack & InfrastructureSignals & Early TrendsModels & Research
Key Points
- The paper introduces STFER (Semantic-driven Token Filtering and Expert Routing) for any-time person re-identification under large modality shifts (RGB/IR) and major clothing changes.
- STFER uses Large Vision-Language Models (LVLMs) to generate identity-consistency semantic text that encodes identity-discriminative, biometric-constant information.
- It applies this semantic text in two mechanisms: Semantic-driven Visual Token Filtering (SVTF) to emphasize informative visual regions while suppressing background noise, and Semantic-driven Expert Routing (SER) to improve multi-scenario gating.
- Experiments on the AT-USTC dataset show state-of-the-art performance, and a model trained on AT-USTC generalizes strongly to five widely used ReID benchmarks.
- The authors state that the code will be released soon, enabling further research and replication.
Related Articles
langchain-anthropic==1.4.1
LangChain Releases

🚀 Anti-Gravity Meets Cloud AI: The Future of Effortless Development
Dev.to

Stop burning tokens on DOM noise: a Playwright MCP optimizer layer
Dev.to

Talk to Your Favorite Game Characters! Mantella Brings AI to Skyrim and Fallout 4 NPCs
Dev.to

AI Will Run Companies. Here's Why That Should Excite You, Not Scare You.
Dev.to