Disentangle-then-Align: Non-Iterative Hybrid Multimodal Image Registration via Cross-Scale Feature Disentanglement
arXiv cs.CV / 3/23/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- The paper introduces HRNet, a Hybrid Registration Network that combines representation disentanglement with hybrid parameter prediction for multimodal image registration.
- It uses a shared backbone with Modality-Specific Batch Normalization and a Cross-scale Disentanglement and Adaptive Projection module to suppress modality-private cues and stabilize the shared feature space.
- A Hybrid Parameter Prediction Module performs non-iterative coarse-to-fine estimation of global rigid parameters and deformation fields, yielding a coherent deformation field.
- The method achieves state-of-the-art performance on both rigid and non-rigid registration across four multimodal datasets, with the code released on the project website.
Related Articles
Is AI becoming a bubble, and could it end like the dot-com crash?
Reddit r/artificial

Externalizing State
Dev.to

I made a 'benchmark' where LLMs write code controlling units in a 1v1 RTS game.
Dev.to

My AI Does Not Have a Clock
Dev.to
How to settle on a coding LLM ? What parameters to watch out for ?
Reddit r/LocalLLaMA